All of lore.kernel.org
 help / color / mirror / Atom feed
From: Shaohua Li <shaohua.li@intel.com>
To: Andrew Morton <akpm@linux-foundation.org>
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>,
	linux-mm <linux-mm@kvack.org>, Andi Kleen <andi@firstfloor.org>,
	"Wu, Fengguang" <fengguang.wu@intel.com>,
	"minchan.kim@gmail.com" <minchan.kim@gmail.com>
Subject: Re: [RFC]mm: batch activate_page() to reduce lock contention
Date: Fri, 27 Aug 2010 16:17:50 +0800	[thread overview]
Message-ID: <1282897070.30698.5.camel@sli10-conroe.sh.intel.com> (raw)
In-Reply-To: <20100826143052.f079e43c.akpm@linux-foundation.org>

On Fri, 2010-08-27 at 05:30 +0800, Andrew Morton wrote:
> On Thu, 26 Aug 2010 15:59:10 +0800
> Shaohua Li <shaohua.li@intel.com> wrote:
> 
> > On Thu, Aug 26, 2010 at 04:03:18AM +0800, Andrew Morton wrote:
> > > On Fri, 6 Aug 2010 11:08:05 +0800
> > > Shaohua Li <shaohua.li@intel.com> wrote:
> > > 
> > > > Subject: mm: batch activate_page() to reduce lock contention
> > > 
> > ...
> >
> > > This function is pretty bizarre.  It really really needs some comments
> > > explaining what it's doing and most especially *why* it's doing it.
> > > 
> > > It's a potential O(n*nr_zones) search (I think)!  We demand proof that
> > > it's worthwhile!
> > > 
> > > Yes, if the pagevec is filled with pages from different zones then it
> > > will reduce the locking frequency.  But in the common case where the
> > > pagevec has pages all from the same zone, or has contiguous runs of
> > > pages from different zones then all that extra bitmap fiddling gained
> > > us nothing.
> > > 
> > > (I think the search could be made more efficient by advancing `i' when
> > > we first see last_zone!=page_zone(page), but that'd just make the code
> > > even worse).
> > Thanks for pointing this out. Then we can simplify things a little bit.
> > the 144 bytes footprint is because of this too, then we can remove it.
> 
> ok..
> 
> > > 
> > > There's a downside/risk to this code.  A billion years ago I found
> > > that it was pretty important that if we're going to batch pages in this
> > > manner, it's important that ALL pages be batched via the same means. 
> > > If 99% of the pages go through the pagevec and 1% of pages bypass the
> > > pagevec, the LRU order gets scrambled and we can end up causing
> > > additional disk seeks when the time comes to write things out.  The
> > > effect was measurable.
> > > 
> > > And lo, putback_lru_pages() (at least) bypasses your new pagevecs,
> > > potentially scrambling the LRU ordering.  Admittedly, if we're putting
> > > back unreclaimable pages in there, the LRU is probably already pretty
> > > scrambled.  But that's just a guess.
> > ok, we can drain the pagevecs in putback_lru_pages() or add active page
> > to the new pagevecs.
> 
> The latter I guess?
hi,
looks the lru_add_pvecs pagevecs is bypassed too in putback_lru_pages().
Assume the bypass doesn't has obvious impact? each pagevec stores 14
pages, it should be < 1/1000 total memory in typical systems. so I
wonder if we really need handle the active page pagevecs bypass.

Thanks,
Shaohua

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2010-08-27  8:17 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-07-20  7:18 [RFC]mm: batch activate_page() to reduce lock contention Shaohua Li
2010-07-21 16:06 ` Minchan Kim
2010-07-22  0:27   ` Shaohua Li
2010-07-22  1:08     ` Minchan Kim
2010-07-22  5:17       ` Shaohua Li
2010-07-22 12:28         ` Minchan Kim
2010-07-23  8:12         ` Wu Fengguang
2010-07-23  8:14           ` Wu Fengguang
2010-07-22 23:49 ` Andrew Morton
2010-07-23 15:10 ` KOSAKI Motohiro
2010-07-23 15:25   ` Andi Kleen
2010-07-23 18:06     ` KOSAKI Motohiro
2010-07-26  5:08   ` Shaohua Li
2010-08-05 21:07     ` Andrew Morton
2010-08-06  3:08       ` Shaohua Li
2010-08-25 20:03         ` Andrew Morton
2010-08-26  7:59           ` Shaohua Li
2010-08-26 21:30             ` Andrew Morton
2010-08-27  8:17               ` Shaohua Li [this message]
2010-09-03 21:12                 ` Andrew Morton

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1282897070.30698.5.camel@sli10-conroe.sh.intel.com \
    --to=shaohua.li@intel.com \
    --cc=akpm@linux-foundation.org \
    --cc=andi@firstfloor.org \
    --cc=fengguang.wu@intel.com \
    --cc=kosaki.motohiro@jp.fujitsu.com \
    --cc=linux-mm@kvack.org \
    --cc=minchan.kim@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.