linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v2] mm: re-allow pinning of zero pfns (again)
@ 2022-08-10 16:53 Alex Williamson
  2022-08-26 19:01 ` Laba, SlawomirX
  2022-08-28  0:59 ` John Hubbard
  0 siblings, 2 replies; 6+ messages in thread
From: Alex Williamson @ 2022-08-10 16:53 UTC (permalink / raw)
  To: akpm
  Cc: alex.sierra, willy, hch, Felix.Kuehling, apopple, david,
	linux-mm, linux-kernel

The below referenced commit makes the same error as 1c563432588d ("mm: fix
is_pinnable_page against a cma page"), re-interpreting the logic to exclude
pinning of the zero page, which breaks device assignment with vfio.

To avoid further subtle mistakes, split the logic into discrete tests.

Suggested-by: Matthew Wilcox <willy@infradead.org>
Suggested-by: Felix Kuehling <felix.kuehling@amd.com>
Link: https://lore.kernel.org/all/165490039431.944052.12458624139225785964.stgit@omen
Fixes: f25cbb7a95a2 ("mm: add zone device coherent type memory support")
Signed-off-by: Alex Williamson <alex.williamson@redhat.com>
---
 include/linux/mm.h |   17 ++++++++++++++---
 1 file changed, 14 insertions(+), 3 deletions(-)

diff --git a/include/linux/mm.h b/include/linux/mm.h
index 18e01474cf6b..835106a9718f 100644
--- a/include/linux/mm.h
+++ b/include/linux/mm.h
@@ -1544,9 +1544,20 @@ static inline bool is_longterm_pinnable_page(struct page *page)
 	if (mt == MIGRATE_CMA || mt == MIGRATE_ISOLATE)
 		return false;
 #endif
-	return !(is_device_coherent_page(page) ||
-		 is_zone_movable_page(page) ||
-		 is_zero_pfn(page_to_pfn(page)));
+	/*
+	 * The zero page might reside in a movable zone, however it may not
+	 * be migrated and can therefore be pinned.  The vfio subsystem pins
+	 * user mappings including the zero page for IOMMU translation.
+	 */
+	if (is_zero_pfn(page_to_pfn(page)))
+		return true;
+
+	/* Coherent device memory must always allow eviction. */
+	if (is_device_coherent_page(page))
+		return false;
+
+	/* Otherwise, non-movable zone pages can be pinned. */
+	return !is_zone_movable_page(page);
 }
 #else
 static inline bool is_longterm_pinnable_page(struct page *page)



^ permalink raw reply related	[flat|nested] 6+ messages in thread

* RE: [PATCH v2] mm: re-allow pinning of zero pfns (again)
  2022-08-10 16:53 [PATCH v2] mm: re-allow pinning of zero pfns (again) Alex Williamson
@ 2022-08-26 19:01 ` Laba, SlawomirX
  2022-08-28  0:59 ` John Hubbard
  1 sibling, 0 replies; 6+ messages in thread
From: Laba, SlawomirX @ 2022-08-26 19:01 UTC (permalink / raw)
  To: Alex Williamson, akpm
  Cc: alex.sierra, willy, hch, Felix.Kuehling, apopple, david,
	linux-mm, linux-kernel

> Subject: [PATCH v2] mm: re-allow pinning of zero pfns (again)
> 
> The below referenced commit makes the same error as 1c563432588d ("mm:
> fix is_pinnable_page against a cma page"), re-interpreting the logic to
> exclude pinning of the zero page, which breaks device assignment with vfio.
> 
> To avoid further subtle mistakes, split the logic into discrete tests.
> 
> Suggested-by: Matthew Wilcox <willy@infradead.org>
> Suggested-by: Felix Kuehling <felix.kuehling@amd.com>
> Link:
> https://lore.kernel.org/all/165490039431.944052.12458624139225785964.stgit
> @omen
> Fixes: f25cbb7a95a2 ("mm: add zone device coherent type memory
> support")
> Signed-off-by: Alex Williamson <alex.williamson@redhat.com>
> ---

Tested-by: Slawomir Laba <slawomirx.laba@intel.com>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] mm: re-allow pinning of zero pfns (again)
  2022-08-10 16:53 [PATCH v2] mm: re-allow pinning of zero pfns (again) Alex Williamson
  2022-08-26 19:01 ` Laba, SlawomirX
@ 2022-08-28  0:59 ` John Hubbard
  2022-08-28  1:40   ` Andrew Morton
  2022-08-28 12:37   ` Alex Williamson
  1 sibling, 2 replies; 6+ messages in thread
From: John Hubbard @ 2022-08-28  0:59 UTC (permalink / raw)
  To: Alex Williamson, akpm
  Cc: alex.sierra, willy, hch, Felix.Kuehling, apopple, david,
	linux-mm, linux-kernel

On 8/10/22 09:53, Alex Williamson wrote:
> The below referenced commit makes the same error as 1c563432588d ("mm: fix
> is_pinnable_page against a cma page"), re-interpreting the logic to exclude
> pinning of the zero page, which breaks device assignment with vfio.
> 
> To avoid further subtle mistakes, split the logic into discrete tests.
> 
> Suggested-by: Matthew Wilcox <willy@infradead.org>
> Suggested-by: Felix Kuehling <felix.kuehling@amd.com>
> Link: https://lore.kernel.org/all/165490039431.944052.12458624139225785964.stgit@omen
> Fixes: f25cbb7a95a2 ("mm: add zone device coherent type memory support")
> Signed-off-by: Alex Williamson <alex.williamson@redhat.com>
> ---
>  include/linux/mm.h |   17 ++++++++++++++---
>  1 file changed, 14 insertions(+), 3 deletions(-)
Hi Alex,

Looks good. I'm suggesting a simpler comment, below, because
even though the VFIO folks are thinking about VFIO, here we
are deep in the mm layer and there are lots of non-VFIO callers
that may pin the zero page.

> 
> diff --git a/include/linux/mm.h b/include/linux/mm.h
> index 18e01474cf6b..835106a9718f 100644
> --- a/include/linux/mm.h
> +++ b/include/linux/mm.h
> @@ -1544,9 +1544,20 @@ static inline bool is_longterm_pinnable_page(struct page *page)
>  	if (mt == MIGRATE_CMA || mt == MIGRATE_ISOLATE)
>  		return false;
>  #endif
> -	return !(is_device_coherent_page(page) ||
> -		 is_zone_movable_page(page) ||
> -		 is_zero_pfn(page_to_pfn(page)));
> +	/*
> +	 * The zero page might reside in a movable zone, however it may not
> +	 * be migrated and can therefore be pinned.  The vfio subsystem pins
> +	 * user mappings including the zero page for IOMMU translation.
> +	 */

Those notes are all about (some of) the callers. But it's a simple
answer, really, so how about just this:

	/* The zero page is always allowed to be pinned. */

?

> +	if (is_zero_pfn(page_to_pfn(page)))
> +		return true;
> +
> +	/* Coherent device memory must always allow eviction. */
> +	if (is_device_coherent_page(page))
> +		return false;
> +
> +	/* Otherwise, non-movable zone pages can be pinned. */
> +	return !is_zone_movable_page(page);
>  }
>  #else
>  static inline bool is_longterm_pinnable_page(struct page *page)
> 
> 
> 

Reviewed-by: John Hubbard <jhubbard@nvidia.com>


thanks,

-- 
John Hubbard
NVIDIA

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] mm: re-allow pinning of zero pfns (again)
  2022-08-28  0:59 ` John Hubbard
@ 2022-08-28  1:40   ` Andrew Morton
  2022-08-28 12:37   ` Alex Williamson
  1 sibling, 0 replies; 6+ messages in thread
From: Andrew Morton @ 2022-08-28  1:40 UTC (permalink / raw)
  To: John Hubbard
  Cc: Alex Williamson, alex.sierra, willy, hch, Felix.Kuehling,
	apopple, david, linux-mm, linux-kernel

On Sat, 27 Aug 2022 17:59:32 -0700 John Hubbard <jhubbard@nvidia.com> wrote:

> 
> 	/* The zero page is always allowed to be pinned. */

Wow, that's really verbose :)

--- a/include/linux/mm.h~mm-re-allow-pinning-of-zero-pfns-again-fix
+++ a/include/linux/mm.h
@@ -1544,11 +1544,7 @@ static inline bool is_longterm_pinnable_
 	if (mt == MIGRATE_CMA || mt == MIGRATE_ISOLATE)
 		return false;
 #endif
-	/*
-	 * The zero page might reside in a movable zone, however it may not
-	 * be migrated and can therefore be pinned.  The vfio subsystem pins
-	 * user mappings including the zero page for IOMMU translation.
-	 */
+	/* The zero page may always be pinned */
 	if (is_zero_pfn(page_to_pfn(page)))
 		return true;
 
_


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] mm: re-allow pinning of zero pfns (again)
  2022-08-28  0:59 ` John Hubbard
  2022-08-28  1:40   ` Andrew Morton
@ 2022-08-28 12:37   ` Alex Williamson
  2022-08-28 17:16     ` John Hubbard
  1 sibling, 1 reply; 6+ messages in thread
From: Alex Williamson @ 2022-08-28 12:37 UTC (permalink / raw)
  To: John Hubbard
  Cc: akpm, alex.sierra, willy, hch, Felix.Kuehling, apopple, david,
	linux-mm, linux-kernel

On Sat, 27 Aug 2022 17:59:32 -0700
John Hubbard <jhubbard@nvidia.com> wrote:

> On 8/10/22 09:53, Alex Williamson wrote:
> > The below referenced commit makes the same error as 1c563432588d ("mm: fix
> > is_pinnable_page against a cma page"), re-interpreting the logic to exclude
> > pinning of the zero page, which breaks device assignment with vfio.
> > 
> > To avoid further subtle mistakes, split the logic into discrete tests.
> > 
> > Suggested-by: Matthew Wilcox <willy@infradead.org>
> > Suggested-by: Felix Kuehling <felix.kuehling@amd.com>
> > Link: https://lore.kernel.org/all/165490039431.944052.12458624139225785964.stgit@omen
> > Fixes: f25cbb7a95a2 ("mm: add zone device coherent type memory support")
> > Signed-off-by: Alex Williamson <alex.williamson@redhat.com>
> > ---
> >  include/linux/mm.h |   17 ++++++++++++++---
> >  1 file changed, 14 insertions(+), 3 deletions(-)  
> Hi Alex,
> 
> Looks good. I'm suggesting a simpler comment, below, because
> even though the VFIO folks are thinking about VFIO, here we
> are deep in the mm layer and there are lots of non-VFIO callers
> that may pin the zero page.
> 
> > 
> > diff --git a/include/linux/mm.h b/include/linux/mm.h
> > index 18e01474cf6b..835106a9718f 100644
> > --- a/include/linux/mm.h
> > +++ b/include/linux/mm.h
> > @@ -1544,9 +1544,20 @@ static inline bool is_longterm_pinnable_page(struct page *page)
> >  	if (mt == MIGRATE_CMA || mt == MIGRATE_ISOLATE)
> >  		return false;
> >  #endif
> > -	return !(is_device_coherent_page(page) ||
> > -		 is_zone_movable_page(page) ||
> > -		 is_zero_pfn(page_to_pfn(page)));
> > +	/*
> > +	 * The zero page might reside in a movable zone, however it may not
> > +	 * be migrated and can therefore be pinned.  The vfio subsystem pins
> > +	 * user mappings including the zero page for IOMMU translation.
> > +	 */  
> 
> Those notes are all about (some of) the callers. But it's a simple
> answer, really, so how about just this:
> 
> 	/* The zero page is always allowed to be pinned. */

Sure.  Are we looking for a re-spin with this?  I see Andrew already
added this incremental change to his hotfix-unstable branch separately.
I'd hate for a comment re-spin to delay getting a fix for this problem,
that blocks any VM use cases of VFIO, into mainline any longer.  Thanks,

Alex


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] mm: re-allow pinning of zero pfns (again)
  2022-08-28 12:37   ` Alex Williamson
@ 2022-08-28 17:16     ` John Hubbard
  0 siblings, 0 replies; 6+ messages in thread
From: John Hubbard @ 2022-08-28 17:16 UTC (permalink / raw)
  To: Alex Williamson
  Cc: akpm, alex.sierra, willy, hch, Felix.Kuehling, apopple, david,
	linux-mm, linux-kernel

On 8/28/22 05:37, Alex Williamson wrote:
>> Those notes are all about (some of) the callers. But it's a simple
>> answer, really, so how about just this:
>>
>> 	/* The zero page is always allowed to be pinned. */
> 
> Sure.  Are we looking for a re-spin with this?  I see Andrew already
> added this incremental change to his hotfix-unstable branch separately.
> I'd hate for a comment re-spin to delay getting a fix for this problem,
> that blocks any VM use cases of VFIO, into mainline any longer.  Thanks,
> 

Definitely not. Andrews fixup should suffice.


thanks,

-- 
John Hubbard
NVIDIA

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2022-08-28 17:17 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-08-10 16:53 [PATCH v2] mm: re-allow pinning of zero pfns (again) Alex Williamson
2022-08-26 19:01 ` Laba, SlawomirX
2022-08-28  0:59 ` John Hubbard
2022-08-28  1:40   ` Andrew Morton
2022-08-28 12:37   ` Alex Williamson
2022-08-28 17:16     ` John Hubbard

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).