[RFC PATCH 0/14] Parallel struct page initialisation v5r4

* [RFC PATCH 0/14] Parallel struct page initialisation v5r4
@ 2015-04-22 17:07 ` Mel Gorman
  0 siblings, 0 replies; 48+ messages in thread
From: Mel Gorman @ 2015-04-22 17:07 UTC (permalink / raw)
  To: Linux-MM
  Cc: Nathan Zimmer, Dave Hansen, Waiman Long, Scott Norton,
	Daniel J Blueman, LKML, Mel Gorman

Changelog since v1
o Always initialise low zones
o Typo corrections
o Rename parallel mem init to parallel struct page init
o Rebase to 4.0

Struct page initialisation had been identified as one of the reasons why
large machines take a long time to boot. Patches were posted a long time ago
to defer initialisation until they were first used.  This was rejected on
the grounds it should not be necessary to hurt the fast paths. This series
reuses much of the work from that time but defers the initialisation of
memory to kswapd so that one thread per node initialises memory local to
that node.

After applying the series and setting the appropriate Kconfig variable I
see this in the boot log on a 64G machine

[    7.383764] kswapd 0 initialised deferred memory in 188ms
[    7.404253] kswapd 1 initialised deferred memory in 208ms
[    7.411044] kswapd 3 initialised deferred memory in 216ms
[    7.411551] kswapd 2 initialised deferred memory in 216ms

On a 1TB machine, I see

[   11.913324] kswapd 0 initialised deferred memory in 1168ms
[   12.220011] kswapd 2 initialised deferred memory in 1476ms
[   12.245369] kswapd 3 initialised deferred memory in 1500ms
[   12.271680] kswapd 1 initialised deferred memory in 1528ms

Once booted the machine appears to work as normal. Boot times were measured
from the time shutdown was called until ssh was available again.  In the
64G case, the boot time savings are negligible. On the 1TB machine, the
savings were 16 seconds.

It would be nice if the people that have access to really large machines
would test this series and report back if the complexity is justified.

Patches are against 4.0.

 Documentation/kernel-parameters.txt |   6 +
 arch/ia64/mm/numa.c                 |  19 +-
 arch/x86/Kconfig                    |   1 +
 include/linux/memblock.h            |  18 ++
 include/linux/mm.h                  |   8 +-
 include/linux/mmzone.h              |  45 ++--
 init/main.c                         |   1 +
 mm/Kconfig                          |  28 +++
 mm/bootmem.c                        |   8 +-
 mm/internal.h                       |  23 +-
 mm/memblock.c                       |  34 ++-
 mm/mm_init.c                        |   9 +-
 mm/nobootmem.c                      |   7 +-
 mm/page_alloc.c                     | 408 +++++++++++++++++++++++++++++++-----
 mm/vmscan.c                         |   6 +-
 15 files changed, 514 insertions(+), 107 deletions(-)

-- 
2.1.2

^ permalink raw reply	[flat|nested] 48+ messages in thread