All of lore.kernel.org
 help / color / mirror / Atom feed
* [Qemu-devel] [v2 0/2] add avx2 instruction optimization
@ 2015-11-10  2:51 Liang Li
  2015-11-10  2:51 ` [Qemu-devel] [v2 1/2] cutils: " Liang Li
                   ` (2 more replies)
  0 siblings, 3 replies; 35+ messages in thread
From: Liang Li @ 2015-11-10  2:51 UTC (permalink / raw)
  To: qemu-devel; +Cc: quintela, Liang Li, mst, amit.shah, pbonzini

buffer_find_nonzero_offset() is a hot function during live migration.
Now it use SSE2 intructions for optimization. For platform supports
AVX2 instructions, use the AVX2 instructions for optimization can help
to improve the performance about 30% comparing to SSE2.
Zero page check can be faster with this optimization, the test result
shows that for an 8GB RAM idle guest, this patch can help to shorten
the total live migration time about 6%.

This patch use the ifunc mechanism to select the proper function when
running, for platform supports AVX2, excute the AVX2 instructions,
else, excute the original code.

With patch, if build QEMU binary with AVX2 enabled, the binary can run
on both platforms support AVX2 or not.

If build QEMU binary with AVX2 diabled, or if compiler can not support
AVX2, the binary will not contain the AVX2 instruction, and it can run
on both platforms support AVX2 or not.

 
Liang Li (2):
  cutils: add avx2 instruction optimization
  configure: add options to config avx2

 configure             | 29 ++++++++++++++++++++++
 include/qemu-common.h | 28 +++++++++++++++------
 util/Makefile.objs    |  2 ++
 util/avx2.c           | 69 +++++++++++++++++++++++++++++++++++++++++++++++++++
 util/cutils.c         | 53 +++++++++++++++++++++++++++++++++++++--
 5 files changed, 172 insertions(+), 9 deletions(-)
 create mode 100644 util/avx2.c

-- 
1.9.1

^ permalink raw reply	[flat|nested] 35+ messages in thread

end of thread, other threads:[~2016-04-07 13:55 UTC | newest]

Thread overview: 35+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-11-10  2:51 [Qemu-devel] [v2 0/2] add avx2 instruction optimization Liang Li
2015-11-10  2:51 ` [Qemu-devel] [v2 1/2] cutils: " Liang Li
2015-11-12 10:08   ` Paolo Bonzini
2015-11-12 10:12     ` Li, Liang Z
2015-11-12 11:30     ` Juan Quintela
2015-11-13  2:49     ` Li, Liang Z
2015-11-13  9:30       ` Paolo Bonzini
2015-11-12 14:43   ` Richard Henderson
2015-11-10  2:51 ` [Qemu-devel] [v2 2/2] configure: add options to config avx2 Liang Li
2015-11-10  3:43 ` [Qemu-devel] [v2 0/2] add avx2 instruction optimization Eric Blake
2015-11-10  5:48   ` Li, Liang Z
2015-11-10  9:13     ` Juan Quintela
2015-11-10  9:26       ` Li, Liang Z
2015-11-10  9:35         ` Paolo Bonzini
2015-11-10  9:41           ` Li, Liang Z
2015-11-10  9:50             ` Paolo Bonzini
2015-11-10  9:56               ` Li, Liang Z
2015-11-10 10:00                 ` Paolo Bonzini
2015-11-10 10:04                   ` Li, Liang Z
2015-11-12  2:49           ` Li, Liang Z
2015-11-12  8:43             ` Paolo Bonzini
2015-11-12  8:53               ` Li, Liang Z
2015-11-12  9:04                 ` Paolo Bonzini
2015-11-12  9:40                   ` Li, Liang Z
2015-11-12  9:45                     ` Paolo Bonzini
2015-11-12  9:53                       ` Li, Liang Z
2015-11-12 11:34                         ` Juan Quintela
2015-11-12 11:42                           ` Li, Liang Z
2015-11-12 19:56                             ` Dr. David Alan Gilbert
2015-11-12 20:20                               ` Eric Blake
2016-04-07 11:09                                 ` Dr. David Alan Gilbert
2016-04-07 12:54                                   ` Michael S. Tsirkin
2016-04-07 13:42                                     ` Dr. David Alan Gilbert
2016-04-07 13:54                                     ` Paolo Bonzini
2015-11-10  9:30       ` Paolo Bonzini

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.