Optimizing mmap_queue on AVX/AVX2 CPUs

* Optimizing mmap_queue on AVX/AVX2 CPUs
@ 2017-08-26  1:46 Rebecca Cran
  2017-08-29 15:33 ` Jens Axboe
  0 siblings, 1 reply; 12+ messages in thread
From: Rebecca Cran @ 2017-08-26  1:46 UTC (permalink / raw)
  To: fio

I'm not sure how far we want to get into optimizing fio for specific CPUs?

I've done some testing and found that when running the mmap ioengine 
against an NVDIMM-N on a modern Intel CPU I can gain a few hundred MB/s 
by optimizing the memory copy using avx/avx2 versus the system's memcpy 
implementation.

Should I proceed with submitting a patch, or do we want to avoid getting 
into these sort of optimizations?

-- 
Rebecca

^ permalink raw reply	[flat|nested] 12+ messages in thread