From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752630AbcGKXCB (ORCPT ); Mon, 11 Jul 2016 19:02:01 -0400 Received: from mail-pa0-f43.google.com ([209.85.220.43]:35051 "EHLO mail-pa0-f43.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750787AbcGKXB7 (ORCPT ); Mon, 11 Jul 2016 19:01:59 -0400 Date: Mon, 11 Jul 2016 16:01:52 -0700 (PDT) From: David Rientjes X-X-Sender: rientjes@chino.kir.corp.google.com To: Joonsoo Kim cc: Vlastimil Babka , Andrew Morton , Mel Gorman , linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: Re: [patch] mm, compaction: make sure freeing scanner isn't persistently expensive In-Reply-To: <20160630073158.GA30114@js1304-P5Q-DELUXE> Message-ID: References: <6685fe19-753d-7d76-aced-3bb071d7c81d@suse.cz> <20160630073158.GA30114@js1304-P5Q-DELUXE> User-Agent: Alpine 2.10 (DEB 1266 2009-07-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 30 Jun 2016, Joonsoo Kim wrote: > We need to find a root cause of this problem, first. > > I guess that this problem would happen when isolate_freepages_block() > early stop due to watermark check (if your patch is applied to your > kernel). If scanner meets, cached pfn will be reset and your patch > doesn't have any effect. So, I guess that scanner doesn't meet. > If the scanners meet, we should rely on deferred compaction to suppress further attempts in the near future. This is outside the scope of this fix. > We enter the compaction with enough free memory so stop in > isolate_freepages_block() should be unlikely event but your number > shows that it happens frequently? > It's not the only reason why freepages will be returned to the buddy allocator: if locks become contended because we are spending too much time compacting memory, we can persistently get free pages returned to the end of the zone and then repeatedly iterate >100GB of memory on every call to isolate_freepages(), which makes its own contended checks fire more often. This patch is only an attempt to prevent lenghty iterations when we have recently scanned the memory and found freepages to not be isolatable. > In addition, I worry that your previous patch that makes > isolate_freepages_block() stop when watermark doesn't meet would cause > compaction non-progress. Amount of free memory can be flutuated so > watermark fail would be temporaral. We need to break compaction in > this case? It would decrease compaction success rate if there is a > memory hogger in parallel. Any idea? > In my opinion, which I think is quite well known by now, the compaction freeing scanner shouldn't be checking _any_ watermark. The end result is that we're migrating memory, not allocating additional memory; determining if compaction should be done is best left lower on the stack.