All of lore.kernel.org
 help / color / mirror / Atom feed
From: Punit Agrawal <punit.agrawal@arm.com>
To: "Kirill A. Shutemov" <kirill@shutemov.name>
Cc: akpm@linux-foundation.org, linux-mm@kvack.org,
	linux-kernel@vger.kernel.org,
	linux-arm-kernel@lists.infradead.org, catalin.marinas@arm.com,
	will.deacon@arm.com, n-horiguchi@ah.jp.nec.com,
	kirill.shutemov@linux.intel.com, mike.kravetz@oracle.com,
	steve.capper@arm.com, mark.rutland@arm.com,
	hillf.zj@alibaba-inc.com, linux-arch@vger.kernel.org,
	aneesh.kumar@linux.vnet.ibm.com, Michal Hocko <mhocko@suse.com>
Subject: Re: [PATCH v3 2/6] mm, gup: Ensure real head page is ref-counted when using hugepages
Date: Tue, 23 May 2017 16:43:50 +0100	[thread overview]
Message-ID: <874lwbh93d.fsf@e105922-lin.cambridge.arm.com> (raw)
In-Reply-To: <20170523131312.aim6obne2t5sxtdr@node.shutemov.name> (Kirill A. Shutemov's message of "Tue, 23 May 2017 16:13:12 +0300")

"Kirill A. Shutemov" <kirill@shutemov.name> writes:

> On Mon, May 22, 2017 at 02:36:00PM +0100, Punit Agrawal wrote:
>> When speculatively taking references to a hugepage using
>> page_cache_add_speculative() in gup_huge_pmd(), it is assumed that the
>> page returned by pmd_page() is the head page. Although normally true,
>> this assumption doesn't hold when the hugepage comprises of successive
>> page table entries such as when using contiguous bit on arm64 at PTE or
>> PMD levels.
>> 
>> This can be addressed by ensuring that the page passed to
>> page_cache_add_speculative() is the real head or by de-referencing the
>> head page within the function.
>> 
>> We take the first approach to keep the usage pattern aligned with
>> page_cache_get_speculative() where users already pass the appropriate
>> page, i.e., the de-referenced head.
>> 
>> Apply the same logic to fix gup_huge_[pud|pgd]() as well.
>
> Hm. Okay. But I'm kinda surprise that this is the only place that need to
> be adjusted.
>
> Have you validated all other pmd_page() use-cases?

I came across the gup issues were found while investigating a failing
test from mce-tests.

I think the problem here is not due to the use of pmd_page() but because
page_cache_[add|get]_speculative() don't ensure they ref-count the head
page as is done in get_page().

Having said that, I had a quick look at the other uses of pmd_page() -

Quite a few of them are followed by an explicit BUG_ON() to check that
the page returned is a head page. All other instances seem to be dealing
with transparent hugepages where contiguous hugepages are not supported.

I don't see any call sites that ring alarm bells.

Did you have any particular part of the code in mind where pmd_page()
usage might be a problem?

Thanks,
Punit

WARNING: multiple messages have this Message-ID (diff)
From: Punit Agrawal <punit.agrawal@arm.com>
To: "Kirill A. Shutemov" <kirill@shutemov.name>
Cc: akpm@linux-foundation.org, linux-mm@kvack.org,
	linux-kernel@vger.kernel.org,
	linux-arm-kernel@lists.infradead.org, catalin.marinas@arm.com,
	will.deacon@arm.com, n-horiguchi@ah.jp.nec.com,
	kirill.shutemov@linux.intel.com, mike.kravetz@oracle.com,
	steve.capper@arm.com, mark.rutland@arm.com,
	hillf.zj@alibaba-inc.com, linux-arch@vger.kernel.org,
	aneesh.kumar@linux.vnet.ibm.com, Michal Hocko <mhocko@suse.com>
Subject: Re: [PATCH v3 2/6] mm, gup: Ensure real head page is ref-counted when using hugepages
Date: Tue, 23 May 2017 16:43:50 +0100	[thread overview]
Message-ID: <874lwbh93d.fsf@e105922-lin.cambridge.arm.com> (raw)
In-Reply-To: <20170523131312.aim6obne2t5sxtdr@node.shutemov.name> (Kirill A. Shutemov's message of "Tue, 23 May 2017 16:13:12 +0300")

"Kirill A. Shutemov" <kirill@shutemov.name> writes:

> On Mon, May 22, 2017 at 02:36:00PM +0100, Punit Agrawal wrote:
>> When speculatively taking references to a hugepage using
>> page_cache_add_speculative() in gup_huge_pmd(), it is assumed that the
>> page returned by pmd_page() is the head page. Although normally true,
>> this assumption doesn't hold when the hugepage comprises of successive
>> page table entries such as when using contiguous bit on arm64 at PTE or
>> PMD levels.
>> 
>> This can be addressed by ensuring that the page passed to
>> page_cache_add_speculative() is the real head or by de-referencing the
>> head page within the function.
>> 
>> We take the first approach to keep the usage pattern aligned with
>> page_cache_get_speculative() where users already pass the appropriate
>> page, i.e., the de-referenced head.
>> 
>> Apply the same logic to fix gup_huge_[pud|pgd]() as well.
>
> Hm. Okay. But I'm kinda surprise that this is the only place that need to
> be adjusted.
>
> Have you validated all other pmd_page() use-cases?

I came across the gup issues were found while investigating a failing
test from mce-tests.

I think the problem here is not due to the use of pmd_page() but because
page_cache_[add|get]_speculative() don't ensure they ref-count the head
page as is done in get_page().

Having said that, I had a quick look at the other uses of pmd_page() -

Quite a few of them are followed by an explicit BUG_ON() to check that
the page returned is a head page. All other instances seem to be dealing
with transparent hugepages where contiguous hugepages are not supported.

I don't see any call sites that ring alarm bells.

Did you have any particular part of the code in mind where pmd_page()
usage might be a problem?

Thanks,
Punit

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

WARNING: multiple messages have this Message-ID (diff)
From: punit.agrawal@arm.com (Punit Agrawal)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH v3 2/6] mm, gup: Ensure real head page is ref-counted when using hugepages
Date: Tue, 23 May 2017 16:43:50 +0100	[thread overview]
Message-ID: <874lwbh93d.fsf@e105922-lin.cambridge.arm.com> (raw)
In-Reply-To: <20170523131312.aim6obne2t5sxtdr@node.shutemov.name> (Kirill A. Shutemov's message of "Tue, 23 May 2017 16:13:12 +0300")

"Kirill A. Shutemov" <kirill@shutemov.name> writes:

> On Mon, May 22, 2017 at 02:36:00PM +0100, Punit Agrawal wrote:
>> When speculatively taking references to a hugepage using
>> page_cache_add_speculative() in gup_huge_pmd(), it is assumed that the
>> page returned by pmd_page() is the head page. Although normally true,
>> this assumption doesn't hold when the hugepage comprises of successive
>> page table entries such as when using contiguous bit on arm64 at PTE or
>> PMD levels.
>> 
>> This can be addressed by ensuring that the page passed to
>> page_cache_add_speculative() is the real head or by de-referencing the
>> head page within the function.
>> 
>> We take the first approach to keep the usage pattern aligned with
>> page_cache_get_speculative() where users already pass the appropriate
>> page, i.e., the de-referenced head.
>> 
>> Apply the same logic to fix gup_huge_[pud|pgd]() as well.
>
> Hm. Okay. But I'm kinda surprise that this is the only place that need to
> be adjusted.
>
> Have you validated all other pmd_page() use-cases?

I came across the gup issues were found while investigating a failing
test from mce-tests.

I think the problem here is not due to the use of pmd_page() but because
page_cache_[add|get]_speculative() don't ensure they ref-count the head
page as is done in get_page().

Having said that, I had a quick look at the other uses of pmd_page() -

Quite a few of them are followed by an explicit BUG_ON() to check that
the page returned is a head page. All other instances seem to be dealing
with transparent hugepages where contiguous hugepages are not supported.

I don't see any call sites that ring alarm bells.

Did you have any particular part of the code in mind where pmd_page()
usage might be a problem?

Thanks,
Punit

  reply	other threads:[~2017-05-23 15:43 UTC|newest]

Thread overview: 61+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-05-22 13:35 [PATCH v3 0/6] Support for contiguous pte hugepages Punit Agrawal
2017-05-22 13:35 ` Punit Agrawal
2017-05-22 13:35 ` Punit Agrawal
2017-05-22 13:35 ` [PATCH v3 1/6] mm, gup: Remove broken VM_BUG_ON_PAGE compound check for hugepages Punit Agrawal
2017-05-22 13:35   ` Punit Agrawal
2017-05-22 13:35   ` Punit Agrawal
2017-05-23 13:09   ` Kirill A. Shutemov
2017-05-23 13:09     ` Kirill A. Shutemov
2017-05-23 13:09     ` Kirill A. Shutemov
2017-05-22 13:36 ` [PATCH v3 2/6] mm, gup: Ensure real head page is ref-counted when using hugepages Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-23 13:13   ` Kirill A. Shutemov
2017-05-23 13:13     ` Kirill A. Shutemov
2017-05-23 13:13     ` Kirill A. Shutemov
2017-05-23 15:43     ` Punit Agrawal [this message]
2017-05-23 15:43       ` Punit Agrawal
2017-05-23 15:43       ` Punit Agrawal
2017-05-22 13:36 ` [PATCH v3 3/6] mm/hugetlb: add size parameter to huge_pte_offset() Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-23 10:04   ` kbuild test robot
2017-05-23 10:04     ` kbuild test robot
2017-05-23 10:04     ` kbuild test robot
2017-05-23 10:04     ` kbuild test robot
2017-05-23 16:13     ` Punit Agrawal
2017-05-23 16:13       ` Punit Agrawal
2017-05-22 13:36 ` [PATCH v3 4/6] mm/hugetlb: Allow architectures to override huge_pte_clear() Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-22 13:59   ` Arnd Bergmann
2017-05-22 13:59     ` Arnd Bergmann
2017-05-22 13:59     ` Arnd Bergmann
2017-05-22 15:40     ` Punit Agrawal
2017-05-22 15:40       ` Punit Agrawal
2017-05-22 15:40       ` Punit Agrawal
2017-05-22 16:25   ` [PATCH v3.1 " Punit Agrawal
2017-05-22 16:25     ` Punit Agrawal
2017-05-22 16:25     ` Punit Agrawal
2017-05-22 20:34     ` Arnd Bergmann
2017-05-22 20:34       ` Arnd Bergmann
2017-05-22 20:34       ` Arnd Bergmann
2017-05-23 14:53       ` Punit Agrawal
2017-05-23 14:53         ` Punit Agrawal
2017-05-23 14:53         ` Punit Agrawal
2017-05-23  5:26     ` Martin Schwidefsky
2017-05-23  5:26       ` Martin Schwidefsky
2017-05-23  5:26       ` Martin Schwidefsky
2017-05-23 14:53       ` Punit Agrawal
2017-05-23 14:53         ` Punit Agrawal
2017-05-23 14:53         ` Punit Agrawal
2017-05-22 13:36 ` [PATCH v3 5/6] mm/hugetlb: Introduce set_huge_swap_pte_at() helper Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-22 16:30   ` [PATCH v3.1 " Punit Agrawal
2017-05-22 16:30     ` Punit Agrawal
2017-05-22 16:30     ` Punit Agrawal
2017-05-22 13:36 ` [PATCH v3 6/6] mm: rmap: Use correct helper when poisoning hugepages Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal
2017-05-22 13:36   ` Punit Agrawal

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=874lwbh93d.fsf@e105922-lin.cambridge.arm.com \
    --to=punit.agrawal@arm.com \
    --cc=akpm@linux-foundation.org \
    --cc=aneesh.kumar@linux.vnet.ibm.com \
    --cc=catalin.marinas@arm.com \
    --cc=hillf.zj@alibaba-inc.com \
    --cc=kirill.shutemov@linux.intel.com \
    --cc=kirill@shutemov.name \
    --cc=linux-arch@vger.kernel.org \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mark.rutland@arm.com \
    --cc=mhocko@suse.com \
    --cc=mike.kravetz@oracle.com \
    --cc=n-horiguchi@ah.jp.nec.com \
    --cc=steve.capper@arm.com \
    --cc=will.deacon@arm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.