All of lore.kernel.org
 help / color / mirror / Atom feed
From: Steven Price <steven.price@arm.com>
To: Mike Rapoport <rppt@linux.ibm.com>
Cc: "Mark Rutland" <Mark.Rutland@arm.com>,
	"Dave Hansen" <dave.hansen@linux.intel.com>,
	"Arnd Bergmann" <arnd@arndb.de>,
	"Ard Biesheuvel" <ard.biesheuvel@linaro.org>,
	"Peter Zijlstra" <peterz@infradead.org>,
	"Catalin Marinas" <catalin.marinas@arm.com>,
	x86@kernel.org, "Will Deacon" <will.deacon@arm.com>,
	linux-kernel@vger.kernel.org, linux-mm@kvack.org,
	"Jérôme Glisse" <jglisse@redhat.com>,
	"Ingo Molnar" <mingo@redhat.com>,
	"Borislav Petkov" <bp@alien8.de>,
	"Andy Lutomirski" <luto@kernel.org>,
	"H. Peter Anvin" <hpa@zytor.com>,
	"James Morse" <james.morse@arm.com>,
	"Thomas Gleixner" <tglx@linutronix.de>,
	linux-arm-kernel@lists.infradead.org, "Liang,
	Kan" <kan.liang@linux.intel.com>
Subject: Re: [PATCH v5 10/19] mm: pagewalk: Add p4d_entry() and pgd_entry()
Date: Fri, 22 Mar 2019 10:37:14 +0000	[thread overview]
Message-ID: <f3ec9d74-f578-117a-6529-469089e46788@arm.com> (raw)
In-Reply-To: <20190322102930.GA24367@rapoport-lnx>

On 22/03/2019 10:29, Mike Rapoport wrote:
> On Fri, Mar 22, 2019 at 10:11:59AM +0000, Steven Price wrote:
>> On 21/03/2019 21:15, Mike Rapoport wrote:
>>> On Thu, Mar 21, 2019 at 02:19:44PM +0000, Steven Price wrote:
>>>> pgd_entry() and pud_entry() were removed by commit 0b1fbfe50006c410
>>>> ("mm/pagewalk: remove pgd_entry() and pud_entry()") because there were
>>>> no users. We're about to add users so reintroduce them, along with
>>>> p4d_entry() as we now have 5 levels of tables.
>>>>
>>>> Note that commit a00cc7d9dd93d66a ("mm, x86: add support for
>>>> PUD-sized transparent hugepages") already re-added pud_entry() but with
>>>> different semantics to the other callbacks. Since there have never
>>>> been upstream users of this, revert the semantics back to match the
>>>> other callbacks. This means pud_entry() is called for all entries, not
>>>> just transparent huge pages.
>>>>
>>>> Signed-off-by: Steven Price <steven.price@arm.com>
>>>> ---
>>>>  include/linux/mm.h |  9 ++++++---
>>>>  mm/pagewalk.c      | 27 ++++++++++++++++-----------
>>>>  2 files changed, 22 insertions(+), 14 deletions(-)
>>>>
>>>> diff --git a/include/linux/mm.h b/include/linux/mm.h
>>>> index 76769749b5a5..2983f2396a72 100644
>>>> --- a/include/linux/mm.h
>>>> +++ b/include/linux/mm.h
>>>> @@ -1367,10 +1367,9 @@ void unmap_vmas(struct mmu_gather *tlb, struct vm_area_struct *start_vma,
>>>>
>>>>  /**
>>>>   * mm_walk - callbacks for walk_page_range
>>>> + * @pgd_entry: if set, called for each non-empty PGD (top-level) entry
>>>> + * @p4d_entry: if set, called for each non-empty P4D (1st-level) entry
>>>
>>> IMHO, p4d implies the 4th level :)
>>
>> You have a good point there... I was simply working back from the
>> existing definitions (below) of PTE:4th, PMD:3rd, PUD:2nd. But it's
>> already somewhat broken by PGD:0th and my cop-out was calling it "top".
>>
>>> I think it would make more sense to start counting from PTE rather than
>>> from PGD. Then it would be consistent across architectures with fewer
>>> levels.
>>
>> It would also be the opposite way round to architectures such as Arm
>> which number their levels, for example [1] refers to levels 0-3 (with 3
>> being PTE in Linux terms).
> 
> By consistent I meant that for architectures with fewer levels we won't be
> describing PTE as level 4 when the architecture only has 2 levels.

Ah I see, although we've apparently been doing that for over a decade
already[2] :)

[2]
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=e6473092bd9116583ce9ab8cf1b6570e1aa6fc83

>> [1]
>> https://developer.arm.com/docs/100940/latest/translation-tables-in-armv8-a
>>
>> Probably the least confusing thing is to drop the level numbers in
>> brackets since I don't believe they directly match any architecture, and
>> hopefully any user of the page walking code is already familiar with the
>> P?D terms used by the kernel.
> 
> That's a fair assumption :)
> Still, maybe we keep your (top-level) for PGD and use (lowest level) for
> PTE and drop those in the middle?

Yes that's a good compromise.

Thanks,

Steve

WARNING: multiple messages have this Message-ID (diff)
From: Steven Price <steven.price@arm.com>
To: Mike Rapoport <rppt@linux.ibm.com>
Cc: "Mark Rutland" <Mark.Rutland@arm.com>,
	x86@kernel.org, "Arnd Bergmann" <arnd@arndb.de>,
	"Ard Biesheuvel" <ard.biesheuvel@linaro.org>,
	"Peter Zijlstra" <peterz@infradead.org>,
	"Catalin Marinas" <catalin.marinas@arm.com>,
	"Dave Hansen" <dave.hansen@linux.intel.com>,
	"Will Deacon" <will.deacon@arm.com>,
	linux-kernel@vger.kernel.org, linux-mm@kvack.org,
	"Jérôme Glisse" <jglisse@redhat.com>,
	"Ingo Molnar" <mingo@redhat.com>,
	"Borislav Petkov" <bp@alien8.de>,
	"Andy Lutomirski" <luto@kernel.org>,
	"H. Peter Anvin" <hpa@zytor.com>,
	"James Morse" <james.morse@arm.com>,
	"Thomas Gleixner" <tglx@linutronix.de>,
	linux-arm-kernel@lists.infradead.org, "Liang,
	Kan" <kan.liang@linux.intel.com>
Subject: Re: [PATCH v5 10/19] mm: pagewalk: Add p4d_entry() and pgd_entry()
Date: Fri, 22 Mar 2019 10:37:14 +0000	[thread overview]
Message-ID: <f3ec9d74-f578-117a-6529-469089e46788@arm.com> (raw)
In-Reply-To: <20190322102930.GA24367@rapoport-lnx>

On 22/03/2019 10:29, Mike Rapoport wrote:
> On Fri, Mar 22, 2019 at 10:11:59AM +0000, Steven Price wrote:
>> On 21/03/2019 21:15, Mike Rapoport wrote:
>>> On Thu, Mar 21, 2019 at 02:19:44PM +0000, Steven Price wrote:
>>>> pgd_entry() and pud_entry() were removed by commit 0b1fbfe50006c410
>>>> ("mm/pagewalk: remove pgd_entry() and pud_entry()") because there were
>>>> no users. We're about to add users so reintroduce them, along with
>>>> p4d_entry() as we now have 5 levels of tables.
>>>>
>>>> Note that commit a00cc7d9dd93d66a ("mm, x86: add support for
>>>> PUD-sized transparent hugepages") already re-added pud_entry() but with
>>>> different semantics to the other callbacks. Since there have never
>>>> been upstream users of this, revert the semantics back to match the
>>>> other callbacks. This means pud_entry() is called for all entries, not
>>>> just transparent huge pages.
>>>>
>>>> Signed-off-by: Steven Price <steven.price@arm.com>
>>>> ---
>>>>  include/linux/mm.h |  9 ++++++---
>>>>  mm/pagewalk.c      | 27 ++++++++++++++++-----------
>>>>  2 files changed, 22 insertions(+), 14 deletions(-)
>>>>
>>>> diff --git a/include/linux/mm.h b/include/linux/mm.h
>>>> index 76769749b5a5..2983f2396a72 100644
>>>> --- a/include/linux/mm.h
>>>> +++ b/include/linux/mm.h
>>>> @@ -1367,10 +1367,9 @@ void unmap_vmas(struct mmu_gather *tlb, struct vm_area_struct *start_vma,
>>>>
>>>>  /**
>>>>   * mm_walk - callbacks for walk_page_range
>>>> + * @pgd_entry: if set, called for each non-empty PGD (top-level) entry
>>>> + * @p4d_entry: if set, called for each non-empty P4D (1st-level) entry
>>>
>>> IMHO, p4d implies the 4th level :)
>>
>> You have a good point there... I was simply working back from the
>> existing definitions (below) of PTE:4th, PMD:3rd, PUD:2nd. But it's
>> already somewhat broken by PGD:0th and my cop-out was calling it "top".
>>
>>> I think it would make more sense to start counting from PTE rather than
>>> from PGD. Then it would be consistent across architectures with fewer
>>> levels.
>>
>> It would also be the opposite way round to architectures such as Arm
>> which number their levels, for example [1] refers to levels 0-3 (with 3
>> being PTE in Linux terms).
> 
> By consistent I meant that for architectures with fewer levels we won't be
> describing PTE as level 4 when the architecture only has 2 levels.

Ah I see, although we've apparently been doing that for over a decade
already[2] :)

[2]
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=e6473092bd9116583ce9ab8cf1b6570e1aa6fc83

>> [1]
>> https://developer.arm.com/docs/100940/latest/translation-tables-in-armv8-a
>>
>> Probably the least confusing thing is to drop the level numbers in
>> brackets since I don't believe they directly match any architecture, and
>> hopefully any user of the page walking code is already familiar with the
>> P?D terms used by the kernel.
> 
> That's a fair assumption :)
> Still, maybe we keep your (top-level) for PGD and use (lowest level) for
> PTE and drop those in the middle?

Yes that's a good compromise.

Thanks,

Steve

_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel

  reply	other threads:[~2019-03-22 10:37 UTC|newest]

Thread overview: 53+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-03-21 14:19 [PATCH v5 00/19] Convert x86 & arm64 to use generic page walk Steven Price
2019-03-21 14:19 ` Steven Price
2019-03-21 14:19 ` [PATCH v5 01/19] arc: mm: Add p?d_large() definitions Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 02/19] arm64: " Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 03/19] mips: " Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 04/19] powerpc: " Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 05/19] riscv: " Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 06/19] s390: " Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 07/19] sparc: " Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 08/19] x86: " Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 09/19] mm: Add generic p?d_large() macros Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 10/19] mm: pagewalk: Add p4d_entry() and pgd_entry() Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 21:15   ` Mike Rapoport
2019-03-21 21:15     ` Mike Rapoport
2019-03-22 10:11     ` Steven Price
2019-03-22 10:11       ` Steven Price
2019-03-22 10:29       ` Mike Rapoport
2019-03-22 10:29         ` Mike Rapoport
2019-03-22 10:37         ` Steven Price [this message]
2019-03-22 10:37           ` Steven Price
2019-03-21 14:19 ` [PATCH v5 11/19] mm: pagewalk: Allow walking without vma Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 12/19] mm: pagewalk: Add test_p?d callbacks Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 13/19] arm64: mm: Convert mm/dump.c to use walk_page_range() Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 14/19] x86: mm: Don't display pages which aren't present in debugfs Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 15/19] x86: mm: Point to struct seq_file from struct pg_state Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 16/19] x86: mm+efi: Convert ptdump_walk_pgd_level() to take a mm_struct Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 17/19] x86: mm: Convert ptdump_walk_pgd_level_debugfs() to take an mm_struct Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 18/19] x86: mm: Convert ptdump_walk_pgd_level_core() " Steven Price
2019-03-21 14:19   ` Steven Price
2019-03-21 14:19 ` [PATCH v5 19/19] x86: mm: Convert dump_pagetables to use walk_page_range Steven Price
2019-03-21 14:19   ` Steven Price

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=f3ec9d74-f578-117a-6529-469089e46788@arm.com \
    --to=steven.price@arm.com \
    --cc=Mark.Rutland@arm.com \
    --cc=ard.biesheuvel@linaro.org \
    --cc=arnd@arndb.de \
    --cc=bp@alien8.de \
    --cc=catalin.marinas@arm.com \
    --cc=dave.hansen@linux.intel.com \
    --cc=hpa@zytor.com \
    --cc=james.morse@arm.com \
    --cc=jglisse@redhat.com \
    --cc=kan.liang@linux.intel.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=luto@kernel.org \
    --cc=mingo@redhat.com \
    --cc=peterz@infradead.org \
    --cc=rppt@linux.ibm.com \
    --cc=tglx@linutronix.de \
    --cc=will.deacon@arm.com \
    --cc=x86@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.