From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-16.6 required=3.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER,INCLUDES_PATCH, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED,USER_AGENT_GIT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id CC5DDC433E0 for ; Fri, 5 Mar 2021 04:23:46 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 6F4B865009 for ; Fri, 5 Mar 2021 04:23:46 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 6F4B865009 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=infradead.org Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id 12A466B000C; Thu, 4 Mar 2021 23:23:46 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 0DB736B000D; Thu, 4 Mar 2021 23:23:46 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id EE5EE6B000E; Thu, 4 Mar 2021 23:23:45 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0231.hostedemail.com [216.40.44.231]) by kanga.kvack.org (Postfix) with ESMTP id D2FF86B000C for ; Thu, 4 Mar 2021 23:23:45 -0500 (EST) Received: from smtpin28.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay01.hostedemail.com (Postfix) with ESMTP id 8F970180ACC44 for ; Fri, 5 Mar 2021 04:23:45 +0000 (UTC) X-FDA: 77884527210.28.769B885 Received: from casper.infradead.org (casper.infradead.org [90.155.50.34]) by imf29.hostedemail.com (Postfix) with ESMTP id 13251D6 for ; Fri, 5 Mar 2021 04:23:43 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=casper.20170209; h=Content-Transfer-Encoding:MIME-Version: References:In-Reply-To:Message-Id:Date:Subject:Cc:To:From:Sender:Reply-To: Content-Type:Content-ID:Content-Description; bh=GNVeNTjh8qSM6taS0KFQZnAh6GuTzP3Y17/7pIz0/00=; b=TtnXDCNWldliiyBuYtu5pgwJ3J 6A4Fr7benmulwG2DvO+Q+TKu17CVi0sK/di9mGSFp7zNPuuqFljH4d3J8+ZDFlHh+lbdjSnclzTmq m2AG0Rk0dlKX7oCxQ2HK0vxk/RP8JBUCii22UFO7oj1lXJqt45ezcpAvYWrc9C7ITWGwLfqvzG3dd lNCQuVgZk/RMWV0WxlkClhXYZBi2EDZ7fTXSbAuOiF7rDXtND/aEoSpSWqETFl142UKZ3y1yY/Tf4 QVjIp8iR0Fcx+smd6E2jTMAKnU+0N8cDm2VTu/+NXtO1WKNcIR1xTvHqGKi/14MMh3uZrPEFZjBkR Ec+q3odQ==; Received: from willy by casper.infradead.org with local (Exim 4.94 #2 (Red Hat Linux)) id 1lI1zM-00A3lz-I3; Fri, 05 Mar 2021 04:23:01 +0000 From: "Matthew Wilcox (Oracle)" To: linux-mm@kvack.org Cc: "Matthew Wilcox (Oracle)" , linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org Subject: [PATCH v4 13/25] mm/filemap: Add lock_folio Date: Fri, 5 Mar 2021 04:18:49 +0000 Message-Id: <20210305041901.2396498-14-willy@infradead.org> X-Mailer: git-send-email 2.29.2 In-Reply-To: <20210305041901.2396498-1-willy@infradead.org> References: <20210305041901.2396498-1-willy@infradead.org> MIME-Version: 1.0 X-Rspamd-Server: rspam03 X-Rspamd-Queue-Id: 13251D6 X-Stat-Signature: w6a5m8hz6y1nanmtcur8xxoqq81r1zgb Received-SPF: none (infradead.org>: No applicable sender policy available) receiver=imf29; identity=mailfrom; envelope-from=""; helo=casper.infradead.org; client-ip=90.155.50.34 X-HE-DKIM-Result: pass/pass X-HE-Tag: 1614918223-590709 Content-Transfer-Encoding: quoted-printable X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: This is like lock_page() but for use by callers who know they have a foli= o. Convert __lock_page() to be __lock_folio(). This saves one call to compound_head() per contended call to lock_page(). Saves 362 bytes of text; mostly from improved register allocation and inlining decisions. __lock_folio is 59 bytes while __lock_page was 79. Signed-off-by: Matthew Wilcox (Oracle) --- include/linux/pagemap.h | 24 +++++++++++++++++++----- mm/filemap.c | 29 +++++++++++++++-------------- 2 files changed, 34 insertions(+), 19 deletions(-) diff --git a/include/linux/pagemap.h b/include/linux/pagemap.h index a34cf531c100..034e41256340 100644 --- a/include/linux/pagemap.h +++ b/include/linux/pagemap.h @@ -635,7 +635,7 @@ static inline bool wake_page_match(struct wait_page_q= ueue *wait_page, return true; } =20 -extern void __lock_page(struct page *page); +void __lock_folio(struct folio *folio); extern int __lock_page_killable(struct page *page); extern int __lock_page_async(struct page *page, struct wait_page_queue *= wait); extern int __lock_page_or_retry(struct page *page, struct mm_struct *mm, @@ -643,13 +643,24 @@ extern int __lock_page_or_retry(struct page *page, = struct mm_struct *mm, void unlock_page(struct page *page); void unlock_folio(struct folio *folio); =20 +static inline bool trylock_folio(struct folio *folio) +{ + return likely(!test_and_set_bit_lock(PG_locked, folio_flags(folio))); +} + /* * Return true if the page was successfully locked */ static inline int trylock_page(struct page *page) { - page =3D compound_head(page); - return (likely(!test_and_set_bit_lock(PG_locked, &page->flags))); + return trylock_folio(page_folio(page)); +} + +static inline void lock_folio(struct folio *folio) +{ + might_sleep(); + if (!trylock_folio(folio)) + __lock_folio(folio); } =20 /* @@ -657,9 +668,12 @@ static inline int trylock_page(struct page *page) */ static inline void lock_page(struct page *page) { + struct folio *folio; might_sleep(); - if (!trylock_page(page)) - __lock_page(page); + + folio =3D page_folio(page); + if (!trylock_folio(folio)) + __lock_folio(folio); } =20 /* diff --git a/mm/filemap.c b/mm/filemap.c index 9960ef1b2758..3e3e3c666b94 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -1160,7 +1160,7 @@ static void wake_up_page(struct page *page, int bit= ) */ enum behavior { EXCLUSIVE, /* Hold ref to page and take the bit when woken, like - * __lock_page() waiting on then setting PG_locked. + * __lock_folio() waiting on then setting PG_locked. */ SHARED, /* Hold ref to page and check the bit when woken, like * wait_on_page_writeback() waiting on PG_writeback. @@ -1488,17 +1488,16 @@ void page_endio(struct page *page, bool is_write,= int err) EXPORT_SYMBOL_GPL(page_endio); =20 /** - * __lock_page - get a lock on the page, assuming we need to sleep to ge= t it - * @__page: the page to lock + * __lock_folio - Get a lock on the folio, assuming we need to sleep to = get it. + * @folio: The folio to lock */ -void __lock_page(struct page *__page) +void __lock_folio(struct folio *folio) { - struct page *page =3D compound_head(__page); - wait_queue_head_t *q =3D page_waitqueue(page); - wait_on_page_bit_common(q, page, PG_locked, TASK_UNINTERRUPTIBLE, + wait_queue_head_t *q =3D page_waitqueue(&folio->page); + wait_on_page_bit_common(q, &folio->page, PG_locked, TASK_UNINTERRUPTIBL= E, EXCLUSIVE); } -EXPORT_SYMBOL(__lock_page); +EXPORT_SYMBOL(__lock_folio); =20 int __lock_page_killable(struct page *__page) { @@ -1573,10 +1572,10 @@ int __lock_page_or_retry(struct page *page, struc= t mm_struct *mm, return 0; } } else { - __lock_page(page); + __lock_folio(page_folio(page)); } - return 1; =20 + return 1; } =20 /** @@ -2720,7 +2719,9 @@ loff_t mapping_seek_hole_data(struct address_space = *mapping, loff_t start, static int lock_page_maybe_drop_mmap(struct vm_fault *vmf, struct page *= page, struct file **fpin) { - if (trylock_page(page)) + struct folio *folio =3D page_folio(page); + + if (trylock_folio(folio)) return 1; =20 /* @@ -2733,7 +2734,7 @@ static int lock_page_maybe_drop_mmap(struct vm_faul= t *vmf, struct page *page, =20 *fpin =3D maybe_unlock_mmap_for_io(vmf, *fpin); if (vmf->flags & FAULT_FLAG_KILLABLE) { - if (__lock_page_killable(page)) { + if (__lock_page_killable(&folio->page)) { /* * We didn't have the right flags to drop the mmap_lock, * but all fault_handlers only check for fatal signals @@ -2745,11 +2746,11 @@ static int lock_page_maybe_drop_mmap(struct vm_fa= ult *vmf, struct page *page, return 0; } } else - __lock_page(page); + __lock_folio(folio); + return 1; } =20 - /* * Synchronous readahead happens when we don't even find a page in the p= age * cache at all. We don't want to perform IO under the mmap sem, so if = we have --=20 2.30.0