linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* BUG: mm, numa: test segfaults, only when NUMA balancing is on
@ 2013-10-16 15:54 Alex Thorlton
  2013-10-17 11:30 ` Bob Liu
                   ` (2 more replies)
  0 siblings, 3 replies; 18+ messages in thread
From: Alex Thorlton @ 2013-10-16 15:54 UTC (permalink / raw)
  To: linux-mm; +Cc: linux-kernel

Hi guys,

I ran into a bug a week or so ago, that I believe has something to do
with NUMA balancing, but I'm having a tough time tracking down exactly
what is causing it.  When running with the following configuration
options set:

CONFIG_ARCH_SUPPORTS_NUMA_BALANCING=y
CONFIG_NUMA_BALANCING_DEFAULT_ENABLED=y
CONFIG_NUMA_BALANCING=y
# CONFIG_HUGETLBFS is not set
# CONFIG_HUGETLB_PAGE is not set

I get intermittent segfaults when running the memscale test that we've
been using to test some of the THP changes.  Here's a link to the test:

ftp://shell.sgi.com/collect/memscale/

I typically run the test with a line similar to this:

./thp_memscale -C 0 -m 0 -c <cores> -b <memory>

Where <cores> is the number of cores to spawn threads on, and <memory>
is the amount of memory to reserve from each core.  The <memory> field
can accept values like 512m or 1g, etc.  I typically run 256 cores and
512m, though I think the problem should be reproducable on anything with
128+ cores.

The test never seems to have any problems when running with hugetlbfs
on and NUMA balancing off, but it segfaults every once in a while with
the config options above.  It seems to occur more frequently, the more
cores you run on.  It segfaults on about 50% of the runs at 256 cores,
and on almost every run at 512 cores.  The fewest number of cores I've
seen a segfault on has been 128, though it seems to be rare on this many
cores.

At this point, I'm not familiar enough with NUMA balancing code to know
what could be causing this, and we don't typically run with NUMA
balancing on, so I don't see this in my everyday testing, but I felt
that it was definitely worth bringing up.

If anybody has any ideas of where I could poke around to find a
solution, please let me know.

- Alex

^ permalink raw reply	[flat|nested] 18+ messages in thread

end of thread, other threads:[~2013-11-27 23:57 UTC | newest]

Thread overview: 18+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-10-16 15:54 BUG: mm, numa: test segfaults, only when NUMA balancing is on Alex Thorlton
2013-10-17 11:30 ` Bob Liu
2013-10-18  0:33   ` Alex Thorlton
2013-11-04 14:58 ` Mel Gorman
2013-11-04 20:03   ` Alex Thorlton
2013-11-06 13:10     ` Mel Gorman
2013-11-07 21:48       ` Alex Thorlton
2013-11-08 11:20         ` Mel Gorman
2013-11-08 14:08           ` Mel Gorman
2013-11-08 22:13           ` Alex Thorlton
2013-11-12 21:29             ` Alex Thorlton
2013-11-15  0:09               ` Mel Gorman
2013-11-15 14:45                 ` Mel Gorman
2013-11-22 21:28                   ` Alex Thorlton
2013-11-22 23:05                     ` Mel Gorman
2013-11-23  0:09                       ` Mel Gorman
2013-11-27 23:58                         ` Alex Thorlton
2013-11-07 21:52 ` Alex Thorlton

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).