From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752743Ab2A2AvA (ORCPT ); Sat, 28 Jan 2012 19:51:00 -0500 Received: from mx1.redhat.com ([209.132.183.28]:61889 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752024Ab2A2Au7 (ORCPT ); Sat, 28 Jan 2012 19:50:59 -0500 Message-ID: <4F2497DC.2040405@redhat.com> Date: Sat, 28 Jan 2012 19:50:36 -0500 From: Rik van Riel User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:9.0) Gecko/20111222 Thunderbird/9.0 MIME-Version: 1.0 To: James Bottomley CC: Dan Magenheimer , Andrew Morton , Dave Hansen , linux-kernel@vger.kernel.org, linux-mm@kvack.org, Konrad Wilk , Seth Jennings , Nitin Gupta , Nebojsa Trpkovic , minchan@kernel.org, KAMEZAWA Hiroyuki , Chris Mason , lsf-pc@lists.linux-foundation.org Subject: Re: [PATCH] mm: implement WasActive page flag (for improving cleancache) References: <4F218D36.2060308@linux.vnet.ibm.com> <9fcd06f5-360e-4542-9fbb-f8c7efb28cb6@default> <20120126163150.31a8688f.akpm@linux-foundation.org> <20120126171548.2c85dd44.akpm@linux-foundation.org> <7198bfb3-1e32-40d3-8601-d88aed7aabd8@default> <1327671787.2977.17.camel@dabdike.int.hansenpartnership.com> <3ac611ee-8830-41bd-8464-6867da701948@default> <1327686876.2977.37.camel@dabdike.int.hansenpartnership.com> <9813c0cd-0335-4994-b734-e9fc7872c0cb@default> <1327700951.2977.78.camel@dabdike.int.hansenpartnership.com> In-Reply-To: <1327700951.2977.78.camel@dabdike.int.hansenpartnership.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 01/27/2012 04:49 PM, James Bottomley wrote: > So here, I was just saying your desire to store more data in the page > table and expand the page flags looks complex. > > Perhaps we do have a fundamental misunderstanding: For readahead, I > don't really care about the referenced part. referenced just means > pointed to by one or more vmas and active means pointed to by two or > more vmas (unless executable in which case it's one). That is not at all what "referenced" means everywhere else in the VM. If you write theories on what Dan should use, it would help if you limited yourself to stuff the VM provides and/or could provide :) > What I think we care about for readahead is accessed. This means a page > that got touched regardless of how many references it has. An > unaccessed unaged RA page is a less good candidate for reclaim because > it should soon be accessed (under the RA heuristics) than an accessed RA > page. Obviously if the heuristics misfire, we end up with futile RA > pages, which we read in expecting to be accessed, but which in fact > never were (so an unaccessed aged RA page) and need to be evicted. > > But for me, perhaps it's enough to put unaccessed RA pages into the > active list on instantiation and then actually put them in the inactive > list when they're accessed That is an absolutely terrible idea for many obvious reasons. Having readahead pages displace the working set wholesale is the absolute last thing we want. > I'm less clear on why you think a WasActive() flag is needed. I think > you mean a member of the inactive list that was at some point previously > active. > Um, that's complex. Doesn't your inactive-C list really just identify > pages that were shared but have sunk in the LRU lists due to lack of > use? Nope. Pages that are not mapped can still end up on the active list, by virtue of getting accessed multiple times in a "short" period of time (the residence on the inactive list). We want to cache frequently accessed pages with preference over streaming IO data that gets accessed infrequently. -- All rights reversed