From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mel Gorman Subject: Re: [PATCH] Revert "libceph: use memalloc flags for net IO" Date: Tue, 7 Apr 2015 16:41:02 +0100 Message-ID: <20150407154102.GA4701@suse.de> References: <1428414024-47769-1-git-send-email-idryomov@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Return-path: Received: from cantor2.suse.de ([195.135.220.15]:54728 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752989AbbDGPlH (ORCPT ); Tue, 7 Apr 2015 11:41:07 -0400 Content-Disposition: inline In-Reply-To: <1428414024-47769-1-git-send-email-idryomov@gmail.com> Sender: ceph-devel-owner@vger.kernel.org List-ID: To: Ilya Dryomov Cc: ceph-devel@vger.kernel.org, Mike Christie , Sage Weil On Tue, Apr 07, 2015 at 04:40:24PM +0300, Ilya Dryomov wrote: > This reverts commit 89baaa570ab0b476db09408d209578cfed700e9f. > > Dirty page throttling should be sufficient for us in the general case > so there is no need to use __GFP_MEMALLOC - it would be needed only in > the swap-over-rbd case, which we currently don't support. (It would > probably take approximately the commit that is being reverted to add > that support, but we would also need the "swap" option to distinguish > from the general case and make sure swap ceph_client-s aren't shared > with anything else.) See ceph-devel threads [1] and [2] for the > details of why enabling pfmemalloc reserves for all cases is a bad > thing. > > On top of potential system lockups related to drained emergency > reserves, this turned out to cause ceph lockups in case peers are on > the same host and communicating via loopback due to sk_filter() > dropping pfmemalloc skbs on the receiving side because the receiving > loopback socket is not tagged with SOCK_MEMALLOC. > > [1] "SOCK_MEMALLOC vs loopback" > http://www.spinics.net/lists/ceph-devel/msg22998.html > [2] "[PATCH] libceph: don't set memalloc flags in loopback case" > http://www.spinics.net/lists/ceph-devel/msg23392.html > > Conflicts: > net/ceph/messenger.c [ context: tcp_nodelay option ] > > Cc: Mike Christie > Cc: Mel Gorman > Cc: Sage Weil > Cc: stable@vger.kernel.org # 3.18+, needs backporting > Signed-off-by: Ilya Dryomov Acked-by: Mel Gorman -- Mel Gorman SUSE Labs