From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751423AbdEBUl2 (ORCPT ); Tue, 2 May 2017 16:41:28 -0400 Received: from mail-pf0-f173.google.com ([209.85.192.173]:35671 "EHLO mail-pf0-f173.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750710AbdEBUlZ (ORCPT ); Tue, 2 May 2017 16:41:25 -0400 Date: Tue, 2 May 2017 13:41:23 -0700 (PDT) From: David Rientjes X-X-Sender: rientjes@chino.kir.corp.google.com To: Michal Hocko cc: Andrew Morton , Minchan Kim , Johannes Weiner , Mel Gorman , linux-kernel@vger.kernel.org, linux-mm@kvack.org Subject: Re: [patch v2] mm, vmscan: avoid thrashing anon lru when free + file is low In-Reply-To: <20170502080246.GD14593@dhcp22.suse.cz> Message-ID: References: <20170418013659.GD21354@bbox> <20170419001405.GA13364@bbox> <20170420060904.GA3720@bbox> <20170502080246.GD14593@dhcp22.suse.cz> User-Agent: Alpine 2.10 (DEB 1266 2009-07-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 2 May 2017, Michal Hocko wrote: > I have already asked and my questions were ignored. So let me ask again > and hopefuly not get ignored this time. So Why do we need a different > criterion on anon pages than file pages? The preference in get_scan_count() as already implemented is to reclaim from file pages if there is enough memory on the inactive list to reclaim. That is unchanged with this patch. > I do agree that blindly > scanning anon pages when file pages are low is very suboptimal but this > adds yet another heuristic without _any_ numbers. Why cannot we simply > treat anon and file pages equally? Something like the following > > if (pgdatfile + pgdatanon + pgdatfree > 2*total_high_wmark) { > scan_balance = SCAN_FILE; > if (pgdatfile < pgdatanon) > scan_balance = SCAN_ANON; > goto out; > } > This would be substantially worse than the current code because it thrashes the anon lru when anon out numbers file pages rather than at the point we fall under the high watermarks for all eligible zones. If you tested your suggestion, you could see gigabytes of memory left untouched on the file lru. Anonymous memory is more probable to be part of the working set. > Also it would help to describe the workload which can trigger this > behavior so that we can compare numbers before and after this patch. Any workload that fills system RAM with anonymous memory that cannot be reclaimed will thrash the anon lru without this patch.