From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756040AbdCGSzG (ORCPT ); Tue, 7 Mar 2017 13:55:06 -0500 Received: from gum.cmpxchg.org ([85.214.110.215]:39148 "EHLO gum.cmpxchg.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751088AbdCGSzE (ORCPT ); Tue, 7 Mar 2017 13:55:04 -0500 Date: Tue, 7 Mar 2017 11:56:31 -0500 From: Johannes Weiner To: Michal Hocko Cc: Minchan Kim , Andrew Morton , Jia He , Mel Gorman , linux-mm@kvack.org, linux-kernel@vger.kernel.org, kernel-team@fb.com, Vlastimil Babka Subject: Re: [PATCH 1/9] mm: fix 100% CPU kswapd busyloop on unreclaimable nodes Message-ID: <20170307165631.GA21425@cmpxchg.org> References: <20170228214007.5621-1-hannes@cmpxchg.org> <20170228214007.5621-2-hannes@cmpxchg.org> <20170303012609.GA3394@bbox> <20170303075954.GA31499@dhcp22.suse.cz> <20170306013740.GA8779@bbox> <20170306162410.GB2090@cmpxchg.org> <20170307101702.GD28642@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20170307101702.GD28642@dhcp22.suse.cz> User-Agent: Mutt/1.7.2 (2016-11-26) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Mar 07, 2017 at 11:17:02AM +0100, Michal Hocko wrote: > On Mon 06-03-17 11:24:10, Johannes Weiner wrote: > > @@ -3271,7 +3271,8 @@ static int balance_pgdat(pg_data_t *pgdat, int order, int classzone_idx) > > * Raise priority if scanning rate is too low or there was no > > * progress in reclaiming pages > > */ > > - if (raise_priority || !sc.nr_reclaimed) > > + nr_reclaimed = sc.nr_reclaimed - nr_reclaimed; > > + if (raise_priority || !nr_reclaimed) > > sc.priority--; > > } while (sc.priority >= 1); > > > > I would rather not play with the sc state here. From a quick look at > least > /* > * Fragmentation may mean that the system cannot be rebalanced for > * high-order allocations. If twice the allocation size has been > * reclaimed then recheck watermarks only at order-0 to prevent > * excessive reclaim. Assume that a process requested a high-order > * can direct reclaim/compact. > */ > if (sc->order && sc->nr_reclaimed >= compact_gap(sc->order)) > sc->order = 0; > > does rely on the value. Wouldn't something like the following be safer? Well, what behavior is correct, though? This check looks like an argument *against* resetting sc.nr_reclaimed. If kswapd is woken up for a higher order, this check sets a reclaim cutoff beyond which it should give up on the order and balance for 0. That's on the scope of the kswapd invocation. Applying this threshold to the outcome of just the preceeding priority seems like a mistake. Mel? Vlastimil?