From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932322AbaGSQgh (ORCPT ); Sat, 19 Jul 2014 12:36:37 -0400 Received: from mail-ig0-f176.google.com ([209.85.213.176]:57348 "EHLO mail-ig0-f176.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932075AbaGSQgf (ORCPT ); Sat, 19 Jul 2014 12:36:35 -0400 MIME-Version: 1.0 In-Reply-To: References: <1402655819-14325-1-git-send-email-dh.herrmann@gmail.com> <1402655819-14325-7-git-send-email-dh.herrmann@gmail.com> Date: Sat, 19 Jul 2014 18:36:34 +0200 Message-ID: Subject: Re: [RFC v3 6/7] shm: wait for pins to be released when sealing From: David Herrmann To: Hugh Dickins Cc: linux-kernel , Michael Kerrisk , Ryan Lortie , Linus Torvalds , Andrew Morton , linux-mm , linux-fsdevel , Linux API , Greg Kroah-Hartman , John Stultz , Lennart Poettering , Daniel Mack , Kay Sievers , Tony Battersby , Andy Lutomirski Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi On Wed, Jul 16, 2014 at 12:09 PM, Hugh Dickins wrote: > On Fri, 13 Jun 2014, David Herrmann wrote: > >> We currently fail setting SEAL_WRITE in case there're pending page >> references. This patch extends the pin-tests to wait up to 150ms for all >> references to be dropped. This is still not perfect in that it doesn't >> account for harmless read-only pins, but it's much better than a hard >> failure. >> >> Signed-off-by: David Herrmann > > Right, I didn't look through the patch itself, just compared the result > with what I sent. Okay, you prefer to separate out shmem_tag_pins(). The main reason why I split both is to avoid goto-label "restart" and "restart2". > Yes, it looks fine. There's just one change I'd like at this stage, > something I realized shortly after sending the code fragment: please > add a call to lru_add_drain() at the head of shmem_tag_pins(). The > reason being that lru_add_drain() is local to the cpu, so cheap, and > in many cases will bring down all the raised refcounts right then. > > Whereas lru_add_drain_all() in the first scan of shmem_wait_for_pins() > is much more expensive, involving inter-processor interrupts to do > that on all cpus: it is appropriate to call it at that point, but we > really ought to try the cheaper lru_add_drain() at the earlier stage. I added an lru_add_drain_all() to my shmem_test_pins() function in Patch 2/7. This patch dropped it again as your wait_for_pins() already included it and it's quite expensive. But yes, the local lru_add_drain() makes perfect sense. Fixed! Thanks David > I would also like never to embark on this scan of the radix_tree > and wait for pins, if the pages were never given out in a VM_SHARED > mapping - or is that unrealistic, because every memfd is read-write, > and typical initialization expected to be by mmap() rather than write()? > But anyway, you're quite right not to get into that at this stage: > it's best left as an optimization once the basics are safely in. From mboxrd@z Thu Jan 1 00:00:00 1970 From: David Herrmann Subject: Re: [RFC v3 6/7] shm: wait for pins to be released when sealing Date: Sat, 19 Jul 2014 18:36:34 +0200 Message-ID: References: <1402655819-14325-1-git-send-email-dh.herrmann@gmail.com> <1402655819-14325-7-git-send-email-dh.herrmann@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Cc: linux-kernel , Michael Kerrisk , Ryan Lortie , Linus Torvalds , Andrew Morton , linux-mm , linux-fsdevel , Linux API , Greg Kroah-Hartman , John Stultz , Lennart Poettering , Daniel Mack , Kay Sievers , Tony Battersby , Andy Lutomirski To: Hugh Dickins Return-path: In-Reply-To: Sender: owner-linux-mm@kvack.org List-Id: linux-fsdevel.vger.kernel.org Hi On Wed, Jul 16, 2014 at 12:09 PM, Hugh Dickins wrote: > On Fri, 13 Jun 2014, David Herrmann wrote: > >> We currently fail setting SEAL_WRITE in case there're pending page >> references. This patch extends the pin-tests to wait up to 150ms for all >> references to be dropped. This is still not perfect in that it doesn't >> account for harmless read-only pins, but it's much better than a hard >> failure. >> >> Signed-off-by: David Herrmann > > Right, I didn't look through the patch itself, just compared the result > with what I sent. Okay, you prefer to separate out shmem_tag_pins(). The main reason why I split both is to avoid goto-label "restart" and "restart2". > Yes, it looks fine. There's just one change I'd like at this stage, > something I realized shortly after sending the code fragment: please > add a call to lru_add_drain() at the head of shmem_tag_pins(). The > reason being that lru_add_drain() is local to the cpu, so cheap, and > in many cases will bring down all the raised refcounts right then. > > Whereas lru_add_drain_all() in the first scan of shmem_wait_for_pins() > is much more expensive, involving inter-processor interrupts to do > that on all cpus: it is appropriate to call it at that point, but we > really ought to try the cheaper lru_add_drain() at the earlier stage. I added an lru_add_drain_all() to my shmem_test_pins() function in Patch 2/7. This patch dropped it again as your wait_for_pins() already included it and it's quite expensive. But yes, the local lru_add_drain() makes perfect sense. Fixed! Thanks David > I would also like never to embark on this scan of the radix_tree > and wait for pins, if the pages were never given out in a VM_SHARED > mapping - or is that unrealistic, because every memfd is read-write, > and typical initialization expected to be by mmap() rather than write()? > But anyway, you're quite right not to get into that at this stage: > it's best left as an optimization once the basics are safely in. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org