Assuming you ran pktgen in H1 which is in embedded mode and testpmd in H2 which is in separated mode, can you elaborate if testpmd (a dpdk application ) can give better performance than ovs offload or ovs dpdk for pkt size between 64-512 bytes, is it recommended to use separated mode without offload for better performance ?