[Qemu-devel] [PATCH 00/10] tb hash improvements

* [Qemu-devel] [PATCH 00/10] tb hash improvements
@ 2016-04-05  5:30 Emilio G. Cota
  2016-04-05  5:30 ` [Qemu-devel] [PATCH 01/10] translate-all: add missing fold of tb_ctx into tcg_ctx Emilio G. Cota
                   ` (11 more replies)
  0 siblings, 12 replies; 62+ messages in thread
From: Emilio G. Cota @ 2016-04-05  5:30 UTC (permalink / raw)
  To: QEMU Developers, MTTCG Devel
  Cc: Peter Maydell, Peter Crosthwaite, Sergey Fedorov, Paolo Bonzini,
	Alex Bennée, Richard Henderson

This patchset is derived from my ongoing work on MTTCG, but does
not depend on it and brings improvements that we can already
benefit from. It applies cleanly on the current master and
is checkpatch-clean.

The key goal is to make the TB hash table faster, and while at it,
scalable. Tested on two different host machines, the execution time
improvement before and after this series, when booting a debian
jessie arm image[*] with arm-softmmu, is:

- Intel Xeon E5-2690: 21.2% less time
- Intel i7-4790K: 23.5% less time

This workload is particularly sensitive to TB hash performance.
Other workloads not as sensitive might see a slight performance
degradation with this patchset, since the hashing + lookup
functions take now more instructions. In any case, no significant
slowdowns should occur.

The commit logs are sometimes long because I have lots of numbers
to share.

The only bits I'm not too comfortable with in this series are patches
2 and 5; I don't develop on Windows so I'm shooting in the dark there.

Please take a look and if possible, test on workloads you care about!

Thanks,

		Emilio

[*] taskset -c 0 arm-softmmu/qemu-system-arm -machine type=virt -nographic \
     -smp 1 -m 4096 -netdev user,id=unet,hostfwd=tcp::2222-:22 \
     -device virtio-net-device,netdev=unet \
     -drive file=jessie-arm32.qcow2,id=myblock,index=0,if=none \
     -device virtio-blk-device,drive=myblock \
     -kernel aarch32-current-linux-kernel-only.img \
     -append 'console=ttyAMA0 root=/dev/vda1' \
     -name arm,debug-threads=on -smp 1 -tb-size 1024
The image is taken from:
  http://people.linaro.org/~alex.bennee/images/jessie-arm32.qcow2
The image was modified to call `shutdown -h now` right after boot.
The kernel is taken from:
  http://people.linaro.org/~alex.bennee/images/aarch32-current-linux-kernel-only.img

^ permalink raw reply	[flat|nested] 62+ messages in thread