linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
@ 2021-05-26 23:52 Naoya Horiguchi
  2021-05-27  0:31 ` Matthew Wilcox
                   ` (5 more replies)
  0 siblings, 6 replies; 10+ messages in thread
From: Naoya Horiguchi @ 2021-05-26 23:52 UTC (permalink / raw)
  To: linux-mm, Mike Kravetz
  Cc: Michal Hocko, Oscar Salvador, Muchun Song, Andrew Morton,
	David Hildenbrand, Naoya Horiguchi, linux-kernel

From: Naoya Horiguchi <naoya.horiguchi@nec.com>

When memory_failure() or soft_offline_page() is called on a tail page of
some hugetlb page, "BUG: unable to handle page fault" error can be
triggered.

remove_hugetlb_page() dereferences page->lru, so it's assumed that the
page points to a head page, but one of the caller,
dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
which could be a tail page.  So pass 'head' to it, instead.

Fixes: 6eb4e88a6d27 ("hugetlb: create remove_hugetlb_page() to separate functionality")
Signed-off-by: Naoya Horiguchi <naoya.horiguchi@nec.com>
---
 mm/hugetlb.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git v5.13-rc3/mm/hugetlb.c v5.13-rc3_patched/mm/hugetlb.c
index 95918f410c0f..470f7b5b437e 100644
--- v5.13-rc3/mm/hugetlb.c
+++ v5.13-rc3_patched/mm/hugetlb.c
@@ -1793,7 +1793,7 @@ int dissolve_free_huge_page(struct page *page)
 			SetPageHWPoison(page);
 			ClearPageHWPoison(head);
 		}
-		remove_hugetlb_page(h, page, false);
+		remove_hugetlb_page(h, head, false);
 		h->max_huge_pages--;
 		spin_unlock_irq(&hugetlb_lock);
 		update_and_free_page(h, head);
-- 
2.25.1


^ permalink raw reply related	[flat|nested] 10+ messages in thread

* Re: [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
  2021-05-26 23:52 [PATCH v1] hugetlb: pass head page to remove_hugetlb_page() Naoya Horiguchi
@ 2021-05-27  0:31 ` Matthew Wilcox
  2021-05-27  0:55 ` Mike Kravetz
                   ` (4 subsequent siblings)
  5 siblings, 0 replies; 10+ messages in thread
From: Matthew Wilcox @ 2021-05-27  0:31 UTC (permalink / raw)
  To: Naoya Horiguchi
  Cc: linux-mm, Mike Kravetz, Michal Hocko, Oscar Salvador,
	Muchun Song, Andrew Morton, David Hildenbrand, Naoya Horiguchi,
	linux-kernel

On Thu, May 27, 2021 at 08:52:57AM +0900, Naoya Horiguchi wrote:
> From: Naoya Horiguchi <naoya.horiguchi@nec.com>
> 
> When memory_failure() or soft_offline_page() is called on a tail page of
> some hugetlb page, "BUG: unable to handle page fault" error can be
> triggered.
> 
> remove_hugetlb_page() dereferences page->lru, so it's assumed that the
> page points to a head page, but one of the caller,
> dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
> which could be a tail page.  So pass 'head' to it, instead.

I'd like to point out that with folios, this is a compile-time error,
not a run-time error.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
  2021-05-26 23:52 [PATCH v1] hugetlb: pass head page to remove_hugetlb_page() Naoya Horiguchi
  2021-05-27  0:31 ` Matthew Wilcox
@ 2021-05-27  0:55 ` Mike Kravetz
  2021-05-27  2:43 ` [External] " Muchun Song
                   ` (3 subsequent siblings)
  5 siblings, 0 replies; 10+ messages in thread
From: Mike Kravetz @ 2021-05-27  0:55 UTC (permalink / raw)
  To: Naoya Horiguchi, linux-mm
  Cc: Michal Hocko, Oscar Salvador, Muchun Song, Andrew Morton,
	David Hildenbrand, Naoya Horiguchi, linux-kernel

On 5/26/21 4:52 PM, Naoya Horiguchi wrote:
> From: Naoya Horiguchi <naoya.horiguchi@nec.com>
> 
> When memory_failure() or soft_offline_page() is called on a tail page of
> some hugetlb page, "BUG: unable to handle page fault" error can be
> triggered.
> 
> remove_hugetlb_page() dereferences page->lru, so it's assumed that the
> page points to a head page, but one of the caller,
> dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
> which could be a tail page.  So pass 'head' to it, instead.
> 
> Fixes: 6eb4e88a6d27 ("hugetlb: create remove_hugetlb_page() to separate functionality")
> Signed-off-by: Naoya Horiguchi <naoya.horiguchi@nec.com>
> ---
>  mm/hugetlb.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)

Thanks Naoya!

Reviewed-by: Mike Kravetz <mike.kravetz@oracle.com>

-- 
Mike Kravetz

> 
> diff --git v5.13-rc3/mm/hugetlb.c v5.13-rc3_patched/mm/hugetlb.c
> index 95918f410c0f..470f7b5b437e 100644
> --- v5.13-rc3/mm/hugetlb.c
> +++ v5.13-rc3_patched/mm/hugetlb.c
> @@ -1793,7 +1793,7 @@ int dissolve_free_huge_page(struct page *page)
>  			SetPageHWPoison(page);
>  			ClearPageHWPoison(head);
>  		}
> -		remove_hugetlb_page(h, page, false);
> +		remove_hugetlb_page(h, head, false);
>  		h->max_huge_pages--;
>  		spin_unlock_irq(&hugetlb_lock);
>  		update_and_free_page(h, head);
> 

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [External] [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
  2021-05-26 23:52 [PATCH v1] hugetlb: pass head page to remove_hugetlb_page() Naoya Horiguchi
  2021-05-27  0:31 ` Matthew Wilcox
  2021-05-27  0:55 ` Mike Kravetz
@ 2021-05-27  2:43 ` Muchun Song
  2021-05-27  6:58 ` Michal Hocko
                   ` (2 subsequent siblings)
  5 siblings, 0 replies; 10+ messages in thread
From: Muchun Song @ 2021-05-27  2:43 UTC (permalink / raw)
  To: Naoya Horiguchi
  Cc: Linux Memory Management List, Mike Kravetz, Michal Hocko,
	Oscar Salvador, Andrew Morton, David Hildenbrand,
	Naoya Horiguchi, LKML

On Thu, May 27, 2021 at 7:53 AM Naoya Horiguchi <nao.horiguchi@gmail.com> wrote:
>
> From: Naoya Horiguchi <naoya.horiguchi@nec.com>
>
> When memory_failure() or soft_offline_page() is called on a tail page of
> some hugetlb page, "BUG: unable to handle page fault" error can be
> triggered.
>
> remove_hugetlb_page() dereferences page->lru, so it's assumed that the
> page points to a head page, but one of the caller,
> dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
> which could be a tail page.  So pass 'head' to it, instead.
>
> Fixes: 6eb4e88a6d27 ("hugetlb: create remove_hugetlb_page() to separate functionality")
> Signed-off-by: Naoya Horiguchi <naoya.horiguchi@nec.com>

Thanks Naoya!

Reviewed-by: Muchun Song <songmuchun@bytedance.com>

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
  2021-05-26 23:52 [PATCH v1] hugetlb: pass head page to remove_hugetlb_page() Naoya Horiguchi
                   ` (2 preceding siblings ...)
  2021-05-27  2:43 ` [External] " Muchun Song
@ 2021-05-27  6:58 ` Michal Hocko
  2021-05-27  7:47 ` Oscar Salvador
  2021-05-27 16:28 ` Mike Kravetz
  5 siblings, 0 replies; 10+ messages in thread
From: Michal Hocko @ 2021-05-27  6:58 UTC (permalink / raw)
  To: Naoya Horiguchi
  Cc: linux-mm, Mike Kravetz, Oscar Salvador, Muchun Song,
	Andrew Morton, David Hildenbrand, Naoya Horiguchi, linux-kernel

On Thu 27-05-21 08:52:57, Naoya Horiguchi wrote:
> From: Naoya Horiguchi <naoya.horiguchi@nec.com>
> 
> When memory_failure() or soft_offline_page() is called on a tail page of
> some hugetlb page, "BUG: unable to handle page fault" error can be
> triggered.
> 
> remove_hugetlb_page() dereferences page->lru, so it's assumed that the
> page points to a head page, but one of the caller,
> dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
> which could be a tail page.  So pass 'head' to it, instead.
> 
> Fixes: 6eb4e88a6d27 ("hugetlb: create remove_hugetlb_page() to separate functionality")
> Signed-off-by: Naoya Horiguchi <naoya.horiguchi@nec.com>

This is really nasty and easy to overlook. I have completely missed that
when reviewing and I do remember checking for head vs page as there is
quite some non trivial handling of both here.

Acked-by: Michal Hocko <mhocko@suse.com>

Thanks!

> ---
>  mm/hugetlb.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git v5.13-rc3/mm/hugetlb.c v5.13-rc3_patched/mm/hugetlb.c
> index 95918f410c0f..470f7b5b437e 100644
> --- v5.13-rc3/mm/hugetlb.c
> +++ v5.13-rc3_patched/mm/hugetlb.c
> @@ -1793,7 +1793,7 @@ int dissolve_free_huge_page(struct page *page)
>  			SetPageHWPoison(page);
>  			ClearPageHWPoison(head);
>  		}
> -		remove_hugetlb_page(h, page, false);
> +		remove_hugetlb_page(h, head, false);
>  		h->max_huge_pages--;
>  		spin_unlock_irq(&hugetlb_lock);
>  		update_and_free_page(h, head);
> -- 
> 2.25.1

-- 
Michal Hocko
SUSE Labs

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
  2021-05-26 23:52 [PATCH v1] hugetlb: pass head page to remove_hugetlb_page() Naoya Horiguchi
                   ` (3 preceding siblings ...)
  2021-05-27  6:58 ` Michal Hocko
@ 2021-05-27  7:47 ` Oscar Salvador
  2021-05-27  8:22   ` Michal Hocko
  2021-05-27 16:28 ` Mike Kravetz
  5 siblings, 1 reply; 10+ messages in thread
From: Oscar Salvador @ 2021-05-27  7:47 UTC (permalink / raw)
  To: Naoya Horiguchi
  Cc: linux-mm, Mike Kravetz, Michal Hocko, Muchun Song, Andrew Morton,
	David Hildenbrand, Naoya Horiguchi, linux-kernel

On Thu, May 27, 2021 at 08:52:57AM +0900, Naoya Horiguchi wrote:
> From: Naoya Horiguchi <naoya.horiguchi@nec.com>
> 
> When memory_failure() or soft_offline_page() is called on a tail page of
> some hugetlb page, "BUG: unable to handle page fault" error can be
> triggered.
> 
> remove_hugetlb_page() dereferences page->lru, so it's assumed that the
> page points to a head page, but one of the caller,
> dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
> which could be a tail page.  So pass 'head' to it, instead.
> 
> Fixes: 6eb4e88a6d27 ("hugetlb: create remove_hugetlb_page() to separate functionality")
> Signed-off-by: Naoya Horiguchi <naoya.horiguchi@nec.com>

Reviewed-by: Oscar Salvador <osalvador@suse.de>

It is probably worth adding a comment in remove_hugetlb_page() noting
that we need a head page, so future users do not repeat the same
mistake.

Thanks 

-- 
Oscar Salvador
SUSE L3

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
  2021-05-27  7:47 ` Oscar Salvador
@ 2021-05-27  8:22   ` Michal Hocko
  0 siblings, 0 replies; 10+ messages in thread
From: Michal Hocko @ 2021-05-27  8:22 UTC (permalink / raw)
  To: Oscar Salvador
  Cc: Naoya Horiguchi, linux-mm, Mike Kravetz, Muchun Song,
	Andrew Morton, David Hildenbrand, Naoya Horiguchi, linux-kernel

On Thu 27-05-21 09:47:44, Oscar Salvador wrote:
> On Thu, May 27, 2021 at 08:52:57AM +0900, Naoya Horiguchi wrote:
> > From: Naoya Horiguchi <naoya.horiguchi@nec.com>
> > 
> > When memory_failure() or soft_offline_page() is called on a tail page of
> > some hugetlb page, "BUG: unable to handle page fault" error can be
> > triggered.
> > 
> > remove_hugetlb_page() dereferences page->lru, so it's assumed that the
> > page points to a head page, but one of the caller,
> > dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
> > which could be a tail page.  So pass 'head' to it, instead.
> > 
> > Fixes: 6eb4e88a6d27 ("hugetlb: create remove_hugetlb_page() to separate functionality")
> > Signed-off-by: Naoya Horiguchi <naoya.horiguchi@nec.com>
> 
> Reviewed-by: Oscar Salvador <osalvador@suse.de>
> 
> It is probably worth adding a comment in remove_hugetlb_page() noting
> that we need a head page, so future users do not repeat the same
> mistake.

Ideally this will turn into page folio concept and no comments are
really needed.
-- 
Michal Hocko
SUSE Labs

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
  2021-05-26 23:52 [PATCH v1] hugetlb: pass head page to remove_hugetlb_page() Naoya Horiguchi
                   ` (4 preceding siblings ...)
  2021-05-27  7:47 ` Oscar Salvador
@ 2021-05-27 16:28 ` Mike Kravetz
  2021-05-27 19:54   ` Michal Hocko
  5 siblings, 1 reply; 10+ messages in thread
From: Mike Kravetz @ 2021-05-27 16:28 UTC (permalink / raw)
  To: Naoya Horiguchi, linux-mm
  Cc: Michal Hocko, Oscar Salvador, Muchun Song, Andrew Morton,
	David Hildenbrand, Naoya Horiguchi, linux-kernel

On 5/26/21 4:52 PM, Naoya Horiguchi wrote:
> From: Naoya Horiguchi <naoya.horiguchi@nec.com>
> 
> When memory_failure() or soft_offline_page() is called on a tail page of
> some hugetlb page, "BUG: unable to handle page fault" error can be
> triggered.
> 
> remove_hugetlb_page() dereferences page->lru, so it's assumed that the
> page points to a head page, but one of the caller,
> dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
> which could be a tail page.  So pass 'head' to it, instead.
> 
> Fixes: 6eb4e88a6d27 ("hugetlb: create remove_hugetlb_page() to separate functionality")
> Signed-off-by: Naoya Horiguchi <naoya.horiguchi@nec.com>
> ---
>  mm/hugetlb.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git v5.13-rc3/mm/hugetlb.c v5.13-rc3_patched/mm/hugetlb.c
> index 95918f410c0f..470f7b5b437e 100644
> --- v5.13-rc3/mm/hugetlb.c
> +++ v5.13-rc3_patched/mm/hugetlb.c
> @@ -1793,7 +1793,7 @@ int dissolve_free_huge_page(struct page *page)
>  			SetPageHWPoison(page);
>  			ClearPageHWPoison(head);
>  		}
> -		remove_hugetlb_page(h, page, false);
> +		remove_hugetlb_page(h, head, false);
>  		h->max_huge_pages--;
>  		spin_unlock_irq(&hugetlb_lock);
>  		update_and_free_page(h, head);
> 

I believe we have the same problem later in the routine when calling
add_hugetlb_page()?

If so, should we combine the changes?  Or, do we need two patches as
the bugs were introduced with different commits?
-- 
Mike Kravetz

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
  2021-05-27 16:28 ` Mike Kravetz
@ 2021-05-27 19:54   ` Michal Hocko
  2021-05-27 21:12     ` Mike Kravetz
  0 siblings, 1 reply; 10+ messages in thread
From: Michal Hocko @ 2021-05-27 19:54 UTC (permalink / raw)
  To: Mike Kravetz
  Cc: Naoya Horiguchi, linux-mm, Oscar Salvador, Muchun Song,
	Andrew Morton, David Hildenbrand, Naoya Horiguchi, linux-kernel

On Thu 27-05-21 09:28:51, Mike Kravetz wrote:
> On 5/26/21 4:52 PM, Naoya Horiguchi wrote:
> > From: Naoya Horiguchi <naoya.horiguchi@nec.com>
> > 
> > When memory_failure() or soft_offline_page() is called on a tail page of
> > some hugetlb page, "BUG: unable to handle page fault" error can be
> > triggered.
> > 
> > remove_hugetlb_page() dereferences page->lru, so it's assumed that the
> > page points to a head page, but one of the caller,
> > dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
> > which could be a tail page.  So pass 'head' to it, instead.
> > 
> > Fixes: 6eb4e88a6d27 ("hugetlb: create remove_hugetlb_page() to separate functionality")
> > Signed-off-by: Naoya Horiguchi <naoya.horiguchi@nec.com>
> > ---
> >  mm/hugetlb.c | 2 +-
> >  1 file changed, 1 insertion(+), 1 deletion(-)
> > 
> > diff --git v5.13-rc3/mm/hugetlb.c v5.13-rc3_patched/mm/hugetlb.c
> > index 95918f410c0f..470f7b5b437e 100644
> > --- v5.13-rc3/mm/hugetlb.c
> > +++ v5.13-rc3_patched/mm/hugetlb.c
> > @@ -1793,7 +1793,7 @@ int dissolve_free_huge_page(struct page *page)
> >  			SetPageHWPoison(page);
> >  			ClearPageHWPoison(head);
> >  		}
> > -		remove_hugetlb_page(h, page, false);
> > +		remove_hugetlb_page(h, head, false);
> >  		h->max_huge_pages--;
> >  		spin_unlock_irq(&hugetlb_lock);
> >  		update_and_free_page(h, head);
> > 
> 
> I believe we have the same problem later in the routine when calling
> add_hugetlb_page()?

Can we ever get a tail page there?

> If so, should we combine the changes?  Or, do we need two patches as
> the bugs were introduced with different commits?

If there is an issue then I would go with a separate patch.

Thanks!
-- 
Michal Hocko
SUSE Labs

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v1] hugetlb: pass head page to remove_hugetlb_page()
  2021-05-27 19:54   ` Michal Hocko
@ 2021-05-27 21:12     ` Mike Kravetz
  0 siblings, 0 replies; 10+ messages in thread
From: Mike Kravetz @ 2021-05-27 21:12 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Naoya Horiguchi, linux-mm, Oscar Salvador, Muchun Song,
	Andrew Morton, David Hildenbrand, Naoya Horiguchi, linux-kernel

On 5/27/21 12:54 PM, Michal Hocko wrote:
> On Thu 27-05-21 09:28:51, Mike Kravetz wrote:
>> On 5/26/21 4:52 PM, Naoya Horiguchi wrote:
>>> From: Naoya Horiguchi <naoya.horiguchi@nec.com>
>>>
>>> remove_hugetlb_page() dereferences page->lru, so it's assumed that the
>>> page points to a head page, but one of the caller,
>>> dissolve_free_huge_page(), provides remove_hugetlb_page() with 'page'
>>> which could be a tail page.  So pass 'head' to it, instead.
>>>
>>
>> I believe we have the same problem later in the routine when calling
>> add_hugetlb_page()?
> 
> Can we ever get a tail page there?
> 

Yes.  Actually alloc_huge_page_vmemmap() and add_hugetlb_page() calls
later in the same block of code expect head page but we pass 'page' which
could be a tail page.

>> If so, should we combine the changes?  Or, do we need two patches as
>> the bugs were introduced with different commits?
> 
> If there is an issue then I would go with a separate patch.
> 

The calls to alloc_huge_page_vmemmap() and add_hugetlb_page() are only
in Andrew's tree (and next).  So, they can be fixed there.

I'll send a patch shortly.
-- 
Mike Kravetz

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2021-05-27 21:12 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-05-26 23:52 [PATCH v1] hugetlb: pass head page to remove_hugetlb_page() Naoya Horiguchi
2021-05-27  0:31 ` Matthew Wilcox
2021-05-27  0:55 ` Mike Kravetz
2021-05-27  2:43 ` [External] " Muchun Song
2021-05-27  6:58 ` Michal Hocko
2021-05-27  7:47 ` Oscar Salvador
2021-05-27  8:22   ` Michal Hocko
2021-05-27 16:28 ` Mike Kravetz
2021-05-27 19:54   ` Michal Hocko
2021-05-27 21:12     ` Mike Kravetz

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).