From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S935095Ab1ESV61 (ORCPT ); Thu, 19 May 2011 17:58:27 -0400 Received: from mga09.intel.com ([134.134.136.24]:17849 "EHLO mga09.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S935005Ab1ESVyr (ORCPT ); Thu, 19 May 2011 17:54:47 -0400 X-ExtLoop1: 1 Message-Id: <20110519214744.929464031@intel.com> User-Agent: quilt/0.48-1 Date: Fri, 20 May 2011 05:45:35 +0800 From: Wu Fengguang To: Andrew Morton cc: Jan Kara , Dave Chinner , Mel Gorman , Itaru Kitayama , Wu Fengguang cc: Christoph Hellwig cc: Cc: LKML Subject: [PATCH 05/18] writeback: the kupdate expire timestamp should be a moving target References: <20110519214530.939830917@intel.com> Content-Disposition: inline; filename=writeback-moving-dirty-expired.patch Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Dynamically compute the dirty expire timestamp at queue_io() time. writeback_control.older_than_this used to be determined at entrance to the kupdate writeback work. This _static_ timestamp may go stale if the kupdate work runs on and on. The flusher may then stuck with some old busy inodes, never considering newly expired inodes thereafter. This has two possible problems: - It is unfair for a large dirty inode to delay (for a long time) the writeback of small dirty inodes. - As time goes by, the large and busy dirty inode may contain only _freshly_ dirtied pages. Ignoring newly expired dirty inodes risks delaying the expired dirty pages to the end of LRU lists, triggering the evil pageout(). Nevertheless this patch merely addresses part of the problem. v2: keep policy changes inside wb_writeback() and keep the wbc.older_than_this visibility as suggested by Dave. CC: Dave Chinner Acked-by: Jan Kara Acked-by: Mel Gorman Signed-off-by: Itaru Kitayama Signed-off-by: Wu Fengguang --- fs/fs-writeback.c | 11 ++++++----- 1 file changed, 6 insertions(+), 5 deletions(-) --- linux-next.orig/fs/fs-writeback.c 2011-05-20 05:11:28.000000000 +0800 +++ linux-next/fs/fs-writeback.c 2011-05-20 05:11:29.000000000 +0800 @@ -672,11 +672,6 @@ static long wb_writeback(struct bdi_writ long write_chunk = MAX_WRITEBACK_PAGES; struct inode *inode; - if (wbc.for_kupdate) { - wbc.older_than_this = &oldest_jif; - oldest_jif = jiffies - - msecs_to_jiffies(dirty_expire_interval * 10); - } if (!wbc.range_cyclic) { wbc.range_start = 0; wbc.range_end = LLONG_MAX; @@ -723,6 +718,12 @@ static long wb_writeback(struct bdi_writ if (work->for_background && !over_bground_thresh()) break; + if (work->for_kupdate) { + oldest_jif = jiffies - + msecs_to_jiffies(dirty_expire_interval * 10); + wbc.older_than_this = &oldest_jif; + } + wbc.more_io = 0; wbc.nr_to_write = write_chunk; wbc.pages_skipped = 0;