On Wed, 2016-08-17 at 19:17 +0200, Dario Faggioli wrote: > The last 4 patches, still for Credit2, are optimizations, either wrt > existing > code, or wrt new code introduced in this series. I've chosen to keep > them > separate to make reviewing/understanding new code easier. In fact, > although > they look pretty simple, the soft-affinity code was pretty complex > already, and > even these simple optimization, if done all at once, would have made > the > reviewer's life (unnecessary) tougher. > About this. I've run the benchmarks with and without these performance optimization patches, in order to assess their effect as good as I could. The baseline on top of which I was applying the series is different from the one used to produce the other numbers reported in the cover letter, so what's shown there and what I show here is not directly comparable (but that's not a problem). Given the nature of the improvements, I've run more iterations of each configuration of the benchmarks (i.e., 15 iterations instead of 5) to get more stable results. Here's my findings: ++++++++++++++++++++++++++++++++++ |    CREDIT1, for reference      | ++++++++++++++++++++++++++++++++++ |               | MAKEXEN IPERF  | |---------------|----------------| |no dom0 load   | 28.353  11.793 | |with dom0 load | 43.955  10.932*| ++++++++++++++++++++++++++++++++++ ++++++++++++++++++++++++++++++++++ |   CREDIT2, until patch 20      | ++++++++++++++++++++++++++++++++++ |               | MAKEXEN IPERF  | |---------------| ---------------| |no dom0 load   | 28.367  11.716 | |with dom0 load | 40.591  10.645 | ++++++++++++++++++++++++++++++++++ |+++++++++++++++++++++++++++++++++ |    CREDIT2, full series        | ++++++++++++++++++++++++++++++++++ |               | MAKEXEN IPERF  | |---------------|----------------| |no dom0 load   | 27.597* 12.059*| |with dom0 load | 39.706* 10.609 | |--------------------------------|  * marks the best results So:  - apart from a glitch on "IPERF with dom0 load", Credit2 with the     full series applied is confirmed to be the best. About the glitch:     - wrt the fact that Credit1 is better, we also have other evidences       that network throughput could be a bit of a weak spot of Credit2       versus Credit1 so far (although, we have to admit, they're       pretty close), and we already have ideas on how to try improve       the situation;     - wrt the role played by optimization patches, well, results are       basically the same.  - The performance optimization patches do have an (positive!) impact.  - In case of "no dom0 load, it's actually thanks to the optimization    patches that Credit2 beats Credit1. Regards, Dario -- <> (Raistlin Majere) ----------------------------------------------------------------- Dario Faggioli, Ph.D, http://about.me/dario.faggioli Senior Software Engineer, Citrix Systems R&D Ltd., Cambridge (UK)