From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mel Gorman Subject: Re: [PATCH v5 7/8] mm: Only IPI CPUs to drain local pages if they exist Date: Thu, 5 Jan 2012 22:31:06 +0000 Message-ID: <20120105223106.GG27881@csn.ul.ie> References: <1325499859-2262-1-git-send-email-gilad@benyossef.com> <1325499859-2262-8-git-send-email-gilad@benyossef.com> <4F033EC9.4050909@gmail.com> <20120105142017.GA27881@csn.ul.ie> <20120105144011.GU11810@n2100.arm.linux.org.uk> <20120105161739.GD27881@csn.ul.ie> <20120105140645.42498cdd.akpm@linux-foundation.org> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Cc: Russell King - ARM Linux , KOSAKI Motohiro , Gilad Ben-Yossef , linux-kernel@vger.kernel.org, Chris Metcalf , Peter Zijlstra , Frederic Weisbecker , linux-mm@kvack.org, Pekka Enberg , Matt Mackall , Sasha Levin , Rik van Riel , Andi Kleen , Alexander Viro , linux-fsdevel@vger.kernel.org, Avi Kivity To: Andrew Morton Return-path: Content-Disposition: inline In-Reply-To: <20120105140645.42498cdd.akpm@linux-foundation.org> Sender: owner-linux-mm@kvack.org List-Id: linux-fsdevel.vger.kernel.org On Thu, Jan 05, 2012 at 02:06:45PM -0800, Andrew Morton wrote: > On Thu, 5 Jan 2012 16:17:39 +0000 > Mel Gorman wrote: > > > mm: page allocator: Guard against CPUs going offline while draining per-cpu page lists > > > > While running a CPU hotplug stress test under memory pressure, I > > saw cases where under enough stress the machine would halt although > > it required a machine with 8 cores and plenty memory. I think the > > problems may be related. > > When we first implemented them, the percpu pages in the page allocator > were of really really marginal benefit. I didn't merge the patches at > all for several cycles, and it was eventually a 49/51 decision. > > So I suggest that our approach to solving this particular problem > should be to nuke the whole thing, then see if that caused any > observeable problems. If it did, can we solve those problems by means > other than bringing the dang things back? > Sounds drastic. It would be less controversial to replace this patch with a version that calls get_online_cpu() in drain_all_pages() but remove the call to drain_all_pages() call from the page allocator on the grounds it is not safe against CPU hotplug and to hell with the slightly elevated allocation failure rates and stalls. That would avoid the try_get_online_cpus() crappiness and be less complex. If you really want to consider deleting the per-cpu allocator, maybe it could be a LSF/MM topic? Personally I would be wary of deleting it but mostly because I lack regular access to the type of hardware to evaulate whether it was safe to remove or not. Minimally, removing the per-cpu allocator could make the zone lock very hot even though slub probably makes it very hot already. -- Mel Gorman SUSE Labs -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/ Don't email: email@kvack.org