From mboxrd@z Thu Jan 1 00:00:00 1970 From: Pekka Enberg Subject: Re: [patch 3/5] mm: try to distribute dirty pages fairly across zones Date: Fri, 30 Sep 2011 10:35:25 +0300 Message-ID: References: <1317367044-475-1-git-send-email-jweiner@redhat.com> <1317367044-475-4-git-send-email-jweiner@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Cc: Andrew Morton , Mel Gorman , Christoph Hellwig , Dave Chinner , Wu Fengguang , Jan Kara , Rik van Riel , Minchan Kim , Chris Mason , "Theodore Ts'o" , Andreas Dilger , Shaohua Li , xfs@oss.sgi.com, linux-btrfs@vger.kernel.org, linux-ext4@vger.kernel.org, linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org To: Johannes Weiner Return-path: In-Reply-To: <1317367044-475-4-git-send-email-jweiner@redhat.com> List-ID: Hi Johannes! On Fri, Sep 30, 2011 at 10:17 AM, Johannes Weiner wrot= e: > But there is a flaw in that we have a zoned page allocator which does > not care about the global state but rather the state of individual > memory zones. =A0And right now there is nothing that prevents one zone > from filling up with dirty pages while other zones are spared, which > frequently leads to situations where kswapd, in order to restore the > watermark of free pages, does indeed have to write pages from that > zone's LRU list. =A0This can interfere so badly with IO from the flusher > threads that major filesystems (btrfs, xfs, ext4) mostly ignore write > requests from reclaim already, taking away the VM's only possibility > to keep such a zone balanced, aside from hoping the flushers will soon > clean pages from that zone. The obvious question is: how did you test this? Can you share the results? Pekka -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/ Don't email: email@kvack.org From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757007Ab1I3Hfb (ORCPT ); Fri, 30 Sep 2011 03:35:31 -0400 Received: from mail-iy0-f174.google.com ([209.85.210.174]:54936 "EHLO mail-iy0-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753513Ab1I3HfZ convert rfc822-to-8bit (ORCPT ); Fri, 30 Sep 2011 03:35:25 -0400 MIME-Version: 1.0 In-Reply-To: <1317367044-475-4-git-send-email-jweiner@redhat.com> References: <1317367044-475-1-git-send-email-jweiner@redhat.com> <1317367044-475-4-git-send-email-jweiner@redhat.com> Date: Fri, 30 Sep 2011 10:35:25 +0300 X-Google-Sender-Auth: XFTpttcEZHJ8iRYqEzINQIqWS8U Message-ID: Subject: Re: [patch 3/5] mm: try to distribute dirty pages fairly across zones From: Pekka Enberg To: Johannes Weiner Cc: Andrew Morton , Mel Gorman , Christoph Hellwig , Dave Chinner , Wu Fengguang , Jan Kara , Rik van Riel , Minchan Kim , Chris Mason , "Theodore Ts'o" , Andreas Dilger , Shaohua Li , xfs@oss.sgi.com, linux-btrfs@vger.kernel.org, linux-ext4@vger.kernel.org, linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8BIT Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Johannes! On Fri, Sep 30, 2011 at 10:17 AM, Johannes Weiner wrote: > But there is a flaw in that we have a zoned page allocator which does > not care about the global state but rather the state of individual > memory zones.  And right now there is nothing that prevents one zone > from filling up with dirty pages while other zones are spared, which > frequently leads to situations where kswapd, in order to restore the > watermark of free pages, does indeed have to write pages from that > zone's LRU list.  This can interfere so badly with IO from the flusher > threads that major filesystems (btrfs, xfs, ext4) mostly ignore write > requests from reclaim already, taking away the VM's only possibility > to keep such a zone balanced, aside from hoping the flushers will soon > clean pages from that zone. The obvious question is: how did you test this? Can you share the results? Pekka From mboxrd@z Thu Jan 1 00:00:00 1970 From: Pekka Enberg Subject: Re: [patch 3/5] mm: try to distribute dirty pages fairly across zones Date: Fri, 30 Sep 2011 10:35:25 +0300 Message-ID: References: <1317367044-475-1-git-send-email-jweiner@redhat.com> <1317367044-475-4-git-send-email-jweiner@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Cc: Andrew Morton , Mel Gorman , Christoph Hellwig , Dave Chinner , Wu Fengguang , Jan Kara , Rik van Riel , Minchan Kim , Chris Mason , "Theodore Ts'o" , Andreas Dilger , Shaohua Li , xfs@oss.sgi.com, linux-btrfs@vger.kernel.org, linux-ext4@vger.kernel.org, linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org To: Johannes Weiner Return-path: In-Reply-To: <1317367044-475-4-git-send-email-jweiner@redhat.com> Sender: owner-linux-mm@kvack.org List-Id: linux-fsdevel.vger.kernel.org Hi Johannes! On Fri, Sep 30, 2011 at 10:17 AM, Johannes Weiner wrot= e: > But there is a flaw in that we have a zoned page allocator which does > not care about the global state but rather the state of individual > memory zones. =A0And right now there is nothing that prevents one zone > from filling up with dirty pages while other zones are spared, which > frequently leads to situations where kswapd, in order to restore the > watermark of free pages, does indeed have to write pages from that > zone's LRU list. =A0This can interfere so badly with IO from the flusher > threads that major filesystems (btrfs, xfs, ext4) mostly ignore write > requests from reclaim already, taking away the VM's only possibility > to keep such a zone balanced, aside from hoping the flushers will soon > clean pages from that zone. The obvious question is: how did you test this? Can you share the results? Pekka -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/ Don't email: email@kvack.org From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from cuda.sgi.com (cuda3.sgi.com [192.48.176.15]) by oss.sgi.com (8.14.3/8.14.3/SuSE Linux 0.8) with ESMTP id p8U7ZRBY247293 for ; Fri, 30 Sep 2011 02:35:27 -0500 Received: from mail-iy0-f181.google.com (localhost [127.0.0.1]) by cuda.sgi.com (Spam Firewall) with ESMTP id A1DE01C56C05 for ; Fri, 30 Sep 2011 00:35:25 -0700 (PDT) Received: from mail-iy0-f181.google.com (mail-iy0-f181.google.com [209.85.210.181]) by cuda.sgi.com with ESMTP id sU8bTCDu8ODZEy5R for ; Fri, 30 Sep 2011 00:35:25 -0700 (PDT) Received: by iaeo4 with SMTP id o4so1917811iae.26 for ; Fri, 30 Sep 2011 00:35:25 -0700 (PDT) MIME-Version: 1.0 In-Reply-To: <1317367044-475-4-git-send-email-jweiner@redhat.com> References: <1317367044-475-1-git-send-email-jweiner@redhat.com> <1317367044-475-4-git-send-email-jweiner@redhat.com> Date: Fri, 30 Sep 2011 10:35:25 +0300 Message-ID: Subject: Re: [patch 3/5] mm: try to distribute dirty pages fairly across zones From: Pekka Enberg List-Id: XFS Filesystem from SGI List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Sender: xfs-bounces@oss.sgi.com Errors-To: xfs-bounces@oss.sgi.com To: Johannes Weiner Cc: Rik van Riel , linux-ext4@vger.kernel.org, Jan Kara , linux-btrfs@vger.kernel.org, linux-kernel@vger.kernel.org, xfs@oss.sgi.com, Christoph Hellwig , linux-mm@kvack.org, Andreas Dilger , Mel Gorman , Shaohua Li , linux-fsdevel@vger.kernel.org, Theodore Ts'o , Andrew Morton , Wu Fengguang , Chris Mason , Minchan Kim Hi Johannes! On Fri, Sep 30, 2011 at 10:17 AM, Johannes Weiner wrot= e: > But there is a flaw in that we have a zoned page allocator which does > not care about the global state but rather the state of individual > memory zones. =A0And right now there is nothing that prevents one zone > from filling up with dirty pages while other zones are spared, which > frequently leads to situations where kswapd, in order to restore the > watermark of free pages, does indeed have to write pages from that > zone's LRU list. =A0This can interfere so badly with IO from the flusher > threads that major filesystems (btrfs, xfs, ext4) mostly ignore write > requests from reclaim already, taking away the VM's only possibility > to keep such a zone balanced, aside from hoping the flushers will soon > clean pages from that zone. The obvious question is: how did you test this? Can you share the results? Pekka _______________________________________________ xfs mailing list xfs@oss.sgi.com http://oss.sgi.com/mailman/listinfo/xfs