From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.2 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 94519C3F2C6 for ; Tue, 10 Mar 2020 02:32:05 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 5B0542465A for ; Tue, 10 Mar 2020 02:32:05 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 5B0542465A Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=arm.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id E94886B009F; Mon, 9 Mar 2020 22:32:04 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id E461E6B00A1; Mon, 9 Mar 2020 22:32:04 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id D5B6C6B00A2; Mon, 9 Mar 2020 22:32:04 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0161.hostedemail.com [216.40.44.161]) by kanga.kvack.org (Postfix) with ESMTP id BB6386B009F for ; Mon, 9 Mar 2020 22:32:04 -0400 (EDT) Received: from smtpin12.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay04.hostedemail.com (Postfix) with ESMTP id 754524DD7 for ; Tue, 10 Mar 2020 02:32:04 +0000 (UTC) X-FDA: 76577877768.12.kick61_4c3e5842f291e X-HE-Tag: kick61_4c3e5842f291e X-Filterd-Recvd-Size: 5593 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by imf32.hostedemail.com (Postfix) with ESMTP for ; Tue, 10 Mar 2020 02:32:03 +0000 (UTC) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id C538730E; Mon, 9 Mar 2020 19:32:02 -0700 (PDT) Received: from [10.163.1.203] (unknown [10.163.1.203]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id E46843F67D; Mon, 9 Mar 2020 19:31:53 -0700 (PDT) Subject: Re: [PATCH V15] mm/debug: Add tests validating architecture page table helpers To: Christophe Leroy , Qian Cai Cc: linux-mm@kvack.org, Andrew Morton , Mike Rapoport , Vineet Gupta , Catalin Marinas , Will Deacon , Benjamin Herrenschmidt , Paul Mackerras , Michael Ellerman , Heiko Carstens , Vasily Gorbik , Christian Borntraeger , Thomas Gleixner , Ingo Molnar , Borislav Petkov , "H. Peter Anvin" , "Kirill A . Shutemov" , Paul Walmsley , Palmer Dabbelt , linux-snps-arc@lists.infradead.org, linux-arm-kernel@lists.infradead.org, linuxppc-dev@lists.ozlabs.org, linux-s390@vger.kernel.org, linux-riscv@lists.infradead.org, x86@kernel.org, linux-arch@vger.kernel.org, linux-kernel@vger.kernel.org References: <61250cdc-f80b-2e50-5168-2ec67ec6f1e6@arm.com> From: Anshuman Khandual Message-ID: <2d950d8c-4b23-741e-591f-e22e857c0755@arm.com> Date: Tue, 10 Mar 2020 08:01:51 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: quoted-printable X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On 03/07/2020 12:35 PM, Christophe Leroy wrote: >=20 >=20 > Le 07/03/2020 =C3=A0 01:56, Anshuman Khandual a =C3=A9crit=C2=A0: >> >> >> On 03/07/2020 06:04 AM, Qian Cai wrote: >>> >>> >>>> On Mar 6, 2020, at 7:03 PM, Anshuman Khandual wrote: >>>> >>>> Hmm, set_pte_at() function is not preferred here for these tests. Th= e idea >>>> is to avoid or atleast minimize TLB/cache flushes triggered from the= se sort >>>> of 'static' tests. set_pte_at() is platform provided and could/might= trigger >>>> these flushes or some other platform specific synchronization stuff.= Just >>> >>> Why is that important for this debugging option? >> >> Primarily reason is to avoid TLB/cache flush instructions on the syste= m >> during these tests that only involve transforming different page table >> level entries through helpers. Unless really necessary, why should it >> emit any TLB/cache flush instructions ? >=20 > What's the problem with thoses flushes ? >=20 >> >>> >>>> wondering is there specific reason with respect to the soft lock up = problem >>>> making it necessary to use set_pte_at() rather than a simple WRITE_O= NCE() ? >>> >>> Looks at the s390 version of set_pte_at(), it has this comment, >>> vmaddr); >>> >>> /* >>> =C2=A0 * Certain architectures need to do special things when PTEs >>> =C2=A0 * within a page table are directly modified.=C2=A0 Thus, the f= ollowing >>> =C2=A0 * hook is made available. >>> =C2=A0 */ >>> >>> I can only guess that powerpc=C2=A0 could be the same here. >> >> This comment is present in multiple platforms while defining set_pte_a= t(). >> Is not 'barrier()' here alone good enough ? Else what exactly set_pte_= at() >> does as compared to WRITE_ONCE() that avoids the soft lock up, just tr= ying >> to understand. >> >=20 >=20 > Argh ! I didn't realise that you were writing directly into the page ta= bles. When it works, that's only by chance I guess. >=20 > To properly set the page table entries, set_pte_at() has to be used: > - On powerpc 8xx, with 16k pages, the page table entry must be copied f= our times. set_pte_at() does it, WRITE_ONCE() doesn't. > - On powerpc book3s/32 (hash MMU), the flag _PAGE_HASHPTE must be prese= rved among writes. set_pte_at() preserves it, WRITE_ONCE() doesn't. >=20 > set_pte_at() also does a few other mandatory things, like calling pte_m= kpte() >=20 > So, the WRITE_ONCE() must definitely become a set_pte_at() Sure, will do. These are part of the clear tests that populates a given entry with a non zero value before clearing and testing it with pxx_none(= ). In that context, WRITE_ONCE() seemed sufficient. But pte_clear() might be closely tied with proper page table entry update and hence a preceding set_pte_at() will be better. There are still more WRITE_ONCE() for other page table levels during thes= e clear tests. set_pmd_at() and set_pud_at() are defined on platforms that support (and enable) THP and PUD based THP respectively. Hence they could not be used for clear tests as remaining helpers pmd_clear(), pud_clear()= , p4d_clear() and pgd_clear() still need to be validated with or without THP support and enablement. We should just leave all other WRITE_ONCE() instances unchanged. Please correct me if I am missing something here. >=20 > Christophe >=20