All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH 0/2] Separate NUMA statistics from zone statistics
@ 2017-08-15  8:45 ` Kemi Wang
  0 siblings, 0 replies; 41+ messages in thread
From: Kemi Wang @ 2017-08-15  8:45 UTC (permalink / raw)
  To: Andrew Morton, Michal Hocko, Mel Gorman, Johannes Weiner
  Cc: Dave, Andi Kleen, Jesper Dangaard Brouer, Ying Huang, Aaron Lu,
	Tim Chen, Linux MM, Linux Kernel, Kemi Wang

Each page allocation updates a set of per-zone statistics with a call to
zone_statistics(). As discussed in 2017 MM submit, these are a substantial
source of overhead in the page allocator and are very rarely consumed. This
significant overhead in cache bouncing caused by zone counters (NUMA
associated counters) update in parallel in multi-threaded page allocation
(pointed out by Dave Hansen).

To mitigate this overhead, this patchset separates NUMA statistics from
zone statistics framework, and update NUMA counter threshold to a fixed
size of 32765, as a small threshold greatly increases the update frequency
of the global counter from local per cpu counter (suggested by Ying Huang).
The rationality is that these statistics counters don't need to be read
often, unlike other VM counters, so it's not a problem to use a large
threshold and make readers more expensive.

With this patchset, we see 26.6% drop of CPU cycles(537-->394, see below)
for per single page allocation and reclaim on Jesper's page_bench03
benchmark. Meanwhile, this patchset keeps the same style of virtual memory
statistics with little end-user-visible effects (see the first patch for
details), except that the number of NUMA items in each cpu
(vm_numa_stat_diff[]) is added to zone->vm_numa_stat[] when a user *reads*
the value of NUMA counter to eliminate deviation.

I did an experiment of single page allocation and reclaim concurrently
using Jesper's page_bench03 benchmark on a 2-Socket Broadwell-based server
(88 processors with 126G memory) with different size of threshold of pcp
counter.

Benchmark provided by Jesper D Broucer(increase loop times to 10000000):
https://github.com/netoptimizer/prototype-kernel/tree/master/kernel/mm/bench

   Threshold   CPU cycles    Throughput(88 threads)
      32        799         241760478
      64        640         301628829
      125       537         358906028 <==> system by default
      256       468         412397590
      512       428         450550704
      4096      399         482520943
      20000     394         489009617
      30000     395         488017817
      32765     394(-26.6%) 488932078(+36.2%) <==> with this patchset
      N/A       342(-36.3%) 562900157(+56.8%) <==> disable zone_statistics

Kemi Wang (2):
  mm: Change the call sites of numa statistics items
  mm: Update NUMA counter threshold size

 drivers/base/node.c    |  22 ++++---
 include/linux/mmzone.h |  25 +++++---
 include/linux/vmstat.h |  33 ++++++++++
 mm/page_alloc.c        |  10 +--
 mm/vmstat.c            | 162 +++++++++++++++++++++++++++++++++++++++++++++++--
 5 files changed, 227 insertions(+), 25 deletions(-)

-- 
2.7.4

^ permalink raw reply	[flat|nested] 41+ messages in thread

end of thread, other threads:[~2017-08-23  4:55 UTC | newest]

Thread overview: 41+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-08-15  8:45 [PATCH 0/2] Separate NUMA statistics from zone statistics Kemi Wang
2017-08-15  8:45 ` Kemi Wang
2017-08-15  8:45 ` [PATCH 1/2] mm: Change the call sites of numa statistics items Kemi Wang
2017-08-15  8:45   ` Kemi Wang
2017-08-15  9:49   ` Mel Gorman
2017-08-15  9:49     ` Mel Gorman
2017-08-16  2:12     ` kemi
2017-08-16  2:12       ` kemi
2017-08-15  8:45 ` [PATCH 2/2] mm: Update NUMA counter threshold size Kemi Wang
2017-08-15  8:45   ` Kemi Wang
2017-08-15  9:58   ` Mel Gorman
2017-08-15  9:58     ` Mel Gorman
2017-08-15 16:55     ` Tim Chen
2017-08-15 16:55       ` Tim Chen
2017-08-15 17:30       ` Mel Gorman
2017-08-15 17:30         ` Mel Gorman
2017-08-15 17:51         ` Tim Chen
2017-08-15 17:51           ` Tim Chen
2017-08-15 19:05           ` Mel Gorman
2017-08-15 19:05             ` Mel Gorman
2017-08-16  3:02       ` kemi
2017-08-16  3:02         ` kemi
2017-08-16  2:31     ` kemi
2017-08-16  2:31       ` kemi
2017-08-22  3:21     ` kemi
2017-08-22  3:21       ` kemi
2017-08-22  8:39       ` Mel Gorman
2017-08-22  8:39         ` Mel Gorman
2017-08-22  8:53         ` kemi
2017-08-15 10:36 ` [PATCH 0/2] Separate NUMA statistics from zone statistics Jesper Dangaard Brouer
2017-08-15 10:36   ` Jesper Dangaard Brouer
2017-08-16  3:23   ` kemi
2017-08-16  3:23     ` kemi
2017-08-22 21:22 ` Christopher Lameter
2017-08-22 21:22   ` Christopher Lameter
2017-08-22 23:19   ` Andi Kleen
2017-08-22 23:19     ` Andi Kleen
2017-08-23  1:14   ` kemi
2017-08-23  1:14     ` kemi
2017-08-23  4:55     ` Dave Hansen
2017-08-23  4:55       ` Dave Hansen

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.