bad IOPS when running multiple btest/fio in parallel

* bad IOPS when running multiple btest/fio in parallel
@ 2018-10-12  4:44 Yao Lin
  2018-10-12 14:39 ` Keith Busch
  2018-10-12 15:49 ` Bart Van Assche
  0 siblings, 2 replies; 6+ messages in thread
From: Yao Lin @ 2018-10-12  4:44 UTC (permalink / raw)

Today I changed to a much simpler setup and the same issue persists.

Directly connect 2 PCs (identical hardware) with a pair of 100G rNICs. Create a null block device on the target PC and configure it as the NVMeOF target. So, there is no switch or SSD in this setup. And this is a single FIO, not the 4 FIO in parallel I mentioned earlier.

Start fio test against that null block device from the host, the best IOPS is 1550K. That's the best IOPS after I try out many different QD, # of job, and CPU affinity setting. Run the same fio test on the target, I get 2250K IOPS (it jumps to 3650K when I increased the number of threads). ?

So it seems to me that Linux NVMe stack is quite good and can support 100Gb/s + throughput. But the same can not be said of the NVMeOF stack. Any tuning possible?

^ permalink raw reply	[flat|nested] 6+ messages in thread