From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-8.7 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_PATCH,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 62EF9C4361B for ; Thu, 10 Dec 2020 12:14:58 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id D172523380 for ; Thu, 10 Dec 2020 12:14:57 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org D172523380 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=bytedance.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id 1FBFA6B0072; Thu, 10 Dec 2020 07:14:57 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 1AA8E6B0073; Thu, 10 Dec 2020 07:14:57 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 072C36B0074; Thu, 10 Dec 2020 07:14:57 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0012.hostedemail.com [216.40.44.12]) by kanga.kvack.org (Postfix) with ESMTP id DDDFD6B0072 for ; Thu, 10 Dec 2020 07:14:56 -0500 (EST) Received: from smtpin24.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay03.hostedemail.com (Postfix) with ESMTP id A36F38249980 for ; Thu, 10 Dec 2020 12:14:56 +0000 (UTC) X-FDA: 77577266592.24.toe10_0116a73273f8 Received: from filter.hostedemail.com (10.5.16.251.rfc1918.com [10.5.16.251]) by smtpin24.hostedemail.com (Postfix) with ESMTP id 8410D1A4A7 for ; Thu, 10 Dec 2020 12:14:56 +0000 (UTC) X-HE-Tag: toe10_0116a73273f8 X-Filterd-Recvd-Size: 5873 Received: from mail-pf1-f196.google.com (mail-pf1-f196.google.com [209.85.210.196]) by imf39.hostedemail.com (Postfix) with ESMTP for ; Thu, 10 Dec 2020 12:14:55 +0000 (UTC) Received: by mail-pf1-f196.google.com with SMTP id w6so3899791pfu.1 for ; Thu, 10 Dec 2020 04:14:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance-com.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=QxFGFJsxZYls/I5RHSRMKxF31Bhdbu1ViR2qIipRD4o=; b=pyeUveG5izDmcPi3YPsdLwxm0UGDjaVIvqkfLtY9ehokEsyj0koOU+oO137aCVQe1n S9HbuVil3IFRqdfB7r2OcFUWgLNJW+pv/J+/Keq+c6ZggD153zxP6abVtB9FFrULXeR2 jUAGtgQeJeZNEyEgghMJ7bReovymKtcyGa5Jh7/AyHNE6JX+aJWPEVd5B78XNf0VWh9q e2zxpabuIfdVa1n0VaeTS3ZCHPLpW0+1egqVx2c63LJbOKyXdERXCiy5MMkbEci2BvI3 LmqDjXkDLvHx19/lRyQ5jdx29rjQnuu+X6pw/+sFi6+hSszzz4+h8lWuSzqjDIkkm4bU yRgQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=QxFGFJsxZYls/I5RHSRMKxF31Bhdbu1ViR2qIipRD4o=; b=B2cxaITGJjYfdZamqHaZrWTahle1rWqj1c3H0eUlmpJv8HjdwouLSu0tqQmzk2qHt9 gTlUm//Cc2QqLST4a0YkQHfawJ6rZvn/pawM5Qm4PEFlaOl5mRnx6P2EOrzcEYVv+zep HdrH4uTG7d5+6GW7cdQaouKLFwdQ6vc4K30ETfWQGz4t2IZOauepfwcLOLYEvShRQFVn YWRl92KKfhENWoRbcQBhMTf3/fUhiLlVyoqeCHX4JdgLTMla2eXEJgCiT0ECIe/gBEX0 VEB5yDZOek+xv49JXQd+k91t4+k6wwTJ3LRyH6gpgfxX5t1dUCZcLuknWxtb/zaOWIHv xYig== X-Gm-Message-State: AOAM5326NfxZJqufFkHT0Ulv51q90vhYjKcYvX30u8Pj1ehBX4dAuJxD WZ1p9+uCPdvFUaXJBN2+AODO8mZD3wGV4ai46YwfxA== X-Google-Smtp-Source: ABdhPJw0Q3sbE5ilZgEAGeKR/BMorRwGHmKVqeOb9m/6WNpEGR39LDL8qXPIi5AGsXa7U+Z8+8NR0X+NUb+G4ihOY1Q= X-Received: by 2002:a63:c15:: with SMTP id b21mr6294519pgl.341.1607602494485; Thu, 10 Dec 2020 04:14:54 -0800 (PST) MIME-Version: 1.0 References: <20201210035526.38938-1-songmuchun@bytedance.com> <20201210035526.38938-13-songmuchun@bytedance.com> <375d6bad6bb37e3626f71bfabc20b384@suse.de> In-Reply-To: <375d6bad6bb37e3626f71bfabc20b384@suse.de> From: Muchun Song Date: Thu, 10 Dec 2020 20:14:18 +0800 Message-ID: Subject: Re: [External] Re: [PATCH v8 12/12] mm/hugetlb: Optimize the code with the help of the compiler To: Oscar Salvador Cc: Jonathan Corbet , Mike Kravetz , Thomas Gleixner , mingo@redhat.com, bp@alien8.de, x86@kernel.org, hpa@zytor.com, dave.hansen@linux.intel.com, luto@kernel.org, Peter Zijlstra , viro@zeniv.linux.org.uk, Andrew Morton , paulmck@kernel.org, mchehab+huawei@kernel.org, pawan.kumar.gupta@linux.intel.com, Randy Dunlap , oneukum@suse.com, anshuman.khandual@arm.com, jroedel@suse.de, Mina Almasry , David Rientjes , Matthew Wilcox , Michal Hocko , "Song Bao Hua (Barry Song)" , David Hildenbrand , Xiongchun duan , linux-doc@vger.kernel.org, LKML , Linux Memory Management List , linux-fsdevel Content-Type: text/plain; charset="UTF-8" X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Thu, Dec 10, 2020 at 7:39 PM Oscar Salvador wrote: > > On 2020-12-10 04:55, Muchun Song wrote: > > We cannot optimize if a "struct page" crosses page boundaries. If > > it is true, we can optimize the code with the help of a compiler. > > When free_vmemmap_pages_per_hpage() returns zero, most functions are > > optimized by the compiler. > > As I said earlier, I would squash this patch with patch#10 and > remove the !is_power_of_2 check in hugetlb_vmemmap_init and leave > only the check for the boot parameter. > That should be enough. Yeah, you are right. I just want the compiler to do optimization. > > > static inline bool is_hugetlb_free_vmemmap_enabled(void) > > { > > - return hugetlb_free_vmemmap_enabled; > > + return hugetlb_free_vmemmap_enabled && > > + is_power_of_2(sizeof(struct page)); > > Why? hugetlb_free_vmemmap_enabled can only become true > if the is_power_of_2 check succeeds in early_hugetlb_free_vmemmap_param. > The "is_power_of_2" check here can go. > > > diff --git a/mm/hugetlb_vmemmap.h b/mm/hugetlb_vmemmap.h > > index 0a1c0d33a316..5f5e90c81cd2 100644 > > --- a/mm/hugetlb_vmemmap.h > > +++ b/mm/hugetlb_vmemmap.h > > @@ -21,7 +21,7 @@ void free_huge_page_vmemmap(struct hstate *h, struct > > page *head); > > */ > > static inline unsigned int free_vmemmap_pages_per_hpage(struct hstate > > *h) > > { > > - return h->nr_free_vmemmap_pages; > > + return h->nr_free_vmemmap_pages && is_power_of_2(sizeof(struct > > page)); > > If hugetlb_free_vmemmap_enabled is false, hugetlb_vmemmap_init() leaves > h->nr_free_vmemmap_pages unset to 0, so no need for the is_power_of_2 > check here. Yeah, you are right. But if we do this check can make the code simple. For example, here is a code snippet. void func(void) { if (free_vmemmap_pages_per_hpage()) return; /* Do something */ } With this patch, the func will be optimized to null when is_power_of_2 returns false. void func(void) { } Without this patch, the compiler cannot do this optimization. Thanks. > > > -- > Oscar Salvador > SUSE L3 -- Yours, Muchun