From: Steve Capper <steve.capper@linaro.org>
To: linux-arm-kernel@lists.infradead.org, catalin.marinas@arm.com,
linux@arm.linux.org.uk, linux-mm@kvack.org,
linux-arch@vger.kernel.org
Cc: peterz@infradead.org, gary.robertson@linaro.org,
anders.roxell@linaro.org, akpm@linux-foundation.org,
Steve Capper <steve.capper@linaro.org>
Subject: [RFC PATCH V4 0/7] get_user_pages_fast for ARM and ARM64
Date: Fri, 28 Mar 2014 15:01:25 +0000 [thread overview]
Message-ID: <1396018892-6773-1-git-send-email-steve.capper@linaro.org> (raw)
Hello,
This RFC series implements get_user_pages_fast and __get_user_pages_fast.
These are required for Transparent HugePages to function correctly, as
a futex on a THP tail will otherwise result in an infinite loop (due to
the core implementation of __get_user_pages_fast always returning 0).
This series may also be beneficial for direct-IO heavy workloads and
certain KVM workloads.
The main changes since RFC V3 are:
* fast_gup now generalised and moved to core code.
* pte_special logic now extended to reduce unnecessary icache syncs.
* dropped the pte_accessible logic in fast_gup as it is unnecessary.
I would really appreciate any comments (especially on the validity or
otherwise of the core fast_gup implementation) and/or testers.
Cheers,
--
Steve
Catalin Marinas (1):
arm64: Convert asm/tlb.h to generic mmu_gather
Steve Capper (6):
mm: Introduce a general RCU get_user_pages_fast.
arm: mm: Introduce special ptes for LPAE
arm: mm: Enable HAVE_RCU_TABLE_FREE logic
arm: mm: Enable RCU fast_gup
arm64: mm: Enable HAVE_RCU_TABLE_FREE logic
arm64: mm: Enable RCU fast_gup
arch/arm/Kconfig | 4 +
arch/arm/include/asm/pgtable-2level.h | 2 +
arch/arm/include/asm/pgtable-3level.h | 14 ++
arch/arm/include/asm/pgtable.h | 6 +-
arch/arm/include/asm/tlb.h | 38 ++++-
arch/arm/mm/flush.c | 19 +++
arch/arm64/Kconfig | 4 +
arch/arm64/include/asm/pgtable.h | 4 +
arch/arm64/include/asm/tlb.h | 140 +++-------------
arch/arm64/mm/flush.c | 19 +++
mm/Kconfig | 3 +
mm/Makefile | 1 +
mm/gup.c | 297 ++++++++++++++++++++++++++++++++++
13 files changed, 431 insertions(+), 120 deletions(-)
create mode 100644 mm/gup.c
--
1.8.1.4
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
WARNING: multiple messages have this Message-ID (diff)
From: Steve Capper <steve.capper@linaro.org>
To: linux-arm-kernel@lists.infradead.org, catalin.marinas@arm.com,
linux@arm.linux.org.uk, linux-mm@kvack.org,
linux-arch@vger.kernel.org
Cc: peterz@infradead.org, gary.robertson@linaro.org,
anders.roxell@linaro.org, akpm@linux-foundation.org,
Steve Capper <steve.capper@linaro.org>
Subject: [RFC PATCH V4 0/7] get_user_pages_fast for ARM and ARM64
Date: Fri, 28 Mar 2014 15:01:25 +0000 [thread overview]
Message-ID: <1396018892-6773-1-git-send-email-steve.capper@linaro.org> (raw)
Message-ID: <20140328150125.XsXV22mTc0lm3AQ4DKGcc1kbJBhrOMBh3tyMuFQf8eU@z> (raw)
Hello,
This RFC series implements get_user_pages_fast and __get_user_pages_fast.
These are required for Transparent HugePages to function correctly, as
a futex on a THP tail will otherwise result in an infinite loop (due to
the core implementation of __get_user_pages_fast always returning 0).
This series may also be beneficial for direct-IO heavy workloads and
certain KVM workloads.
The main changes since RFC V3 are:
* fast_gup now generalised and moved to core code.
* pte_special logic now extended to reduce unnecessary icache syncs.
* dropped the pte_accessible logic in fast_gup as it is unnecessary.
I would really appreciate any comments (especially on the validity or
otherwise of the core fast_gup implementation) and/or testers.
Cheers,
--
Steve
Catalin Marinas (1):
arm64: Convert asm/tlb.h to generic mmu_gather
Steve Capper (6):
mm: Introduce a general RCU get_user_pages_fast.
arm: mm: Introduce special ptes for LPAE
arm: mm: Enable HAVE_RCU_TABLE_FREE logic
arm: mm: Enable RCU fast_gup
arm64: mm: Enable HAVE_RCU_TABLE_FREE logic
arm64: mm: Enable RCU fast_gup
arch/arm/Kconfig | 4 +
arch/arm/include/asm/pgtable-2level.h | 2 +
arch/arm/include/asm/pgtable-3level.h | 14 ++
arch/arm/include/asm/pgtable.h | 6 +-
arch/arm/include/asm/tlb.h | 38 ++++-
arch/arm/mm/flush.c | 19 +++
arch/arm64/Kconfig | 4 +
arch/arm64/include/asm/pgtable.h | 4 +
arch/arm64/include/asm/tlb.h | 140 +++-------------
arch/arm64/mm/flush.c | 19 +++
mm/Kconfig | 3 +
mm/Makefile | 1 +
mm/gup.c | 297 ++++++++++++++++++++++++++++++++++
13 files changed, 431 insertions(+), 120 deletions(-)
create mode 100644 mm/gup.c
--
1.8.1.4
WARNING: multiple messages have this Message-ID (diff)
From: steve.capper@linaro.org (Steve Capper)
To: linux-arm-kernel@lists.infradead.org
Subject: [RFC PATCH V4 0/7] get_user_pages_fast for ARM and ARM64
Date: Fri, 28 Mar 2014 15:01:25 +0000 [thread overview]
Message-ID: <1396018892-6773-1-git-send-email-steve.capper@linaro.org> (raw)
Hello,
This RFC series implements get_user_pages_fast and __get_user_pages_fast.
These are required for Transparent HugePages to function correctly, as
a futex on a THP tail will otherwise result in an infinite loop (due to
the core implementation of __get_user_pages_fast always returning 0).
This series may also be beneficial for direct-IO heavy workloads and
certain KVM workloads.
The main changes since RFC V3 are:
* fast_gup now generalised and moved to core code.
* pte_special logic now extended to reduce unnecessary icache syncs.
* dropped the pte_accessible logic in fast_gup as it is unnecessary.
I would really appreciate any comments (especially on the validity or
otherwise of the core fast_gup implementation) and/or testers.
Cheers,
--
Steve
Catalin Marinas (1):
arm64: Convert asm/tlb.h to generic mmu_gather
Steve Capper (6):
mm: Introduce a general RCU get_user_pages_fast.
arm: mm: Introduce special ptes for LPAE
arm: mm: Enable HAVE_RCU_TABLE_FREE logic
arm: mm: Enable RCU fast_gup
arm64: mm: Enable HAVE_RCU_TABLE_FREE logic
arm64: mm: Enable RCU fast_gup
arch/arm/Kconfig | 4 +
arch/arm/include/asm/pgtable-2level.h | 2 +
arch/arm/include/asm/pgtable-3level.h | 14 ++
arch/arm/include/asm/pgtable.h | 6 +-
arch/arm/include/asm/tlb.h | 38 ++++-
arch/arm/mm/flush.c | 19 +++
arch/arm64/Kconfig | 4 +
arch/arm64/include/asm/pgtable.h | 4 +
arch/arm64/include/asm/tlb.h | 140 +++-------------
arch/arm64/mm/flush.c | 19 +++
mm/Kconfig | 3 +
mm/Makefile | 1 +
mm/gup.c | 297 ++++++++++++++++++++++++++++++++++
13 files changed, 431 insertions(+), 120 deletions(-)
create mode 100644 mm/gup.c
--
1.8.1.4
next reply other threads:[~2014-03-28 15:01 UTC|newest]
Thread overview: 57+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-03-28 15:01 Steve Capper [this message]
2014-03-28 15:01 ` [RFC PATCH V4 0/7] get_user_pages_fast for ARM and ARM64 Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` [RFC PATCH V4 1/7] mm: Introduce a general RCU get_user_pages_fast Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` [RFC PATCH V4 2/7] arm: mm: Introduce special ptes for LPAE Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` [RFC PATCH V4 3/7] arm: mm: Enable HAVE_RCU_TABLE_FREE logic Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-05-01 11:11 ` Catalin Marinas
2014-05-01 11:11 ` Catalin Marinas
2014-05-01 11:11 ` Catalin Marinas
2014-05-01 11:44 ` Steve Capper
2014-05-01 11:44 ` Steve Capper
2014-05-01 11:44 ` Steve Capper
2014-03-28 15:01 ` [RFC PATCH V4 4/7] arm: mm: Enable RCU fast_gup Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` [RFC PATCH V4 5/7] arm64: Convert asm/tlb.h to generic mmu_gather Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` [RFC PATCH V4 6/7] arm64: mm: Enable HAVE_RCU_TABLE_FREE logic Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-04-30 15:20 ` Catalin Marinas
2014-04-30 15:20 ` Catalin Marinas
2014-04-30 15:20 ` Catalin Marinas
2014-04-30 15:33 ` Catalin Marinas
2014-04-30 15:33 ` Catalin Marinas
2014-04-30 15:33 ` Catalin Marinas
2014-04-30 15:38 ` Steve Capper
2014-04-30 15:38 ` Steve Capper
2014-04-30 15:38 ` Steve Capper
2014-04-30 17:21 ` Catalin Marinas
2014-04-30 17:21 ` Catalin Marinas
2014-04-30 17:21 ` Catalin Marinas
2014-05-01 7:34 ` Steve Capper
2014-05-01 7:34 ` Steve Capper
2014-05-01 7:34 ` Steve Capper
2014-05-01 9:52 ` Catalin Marinas
2014-05-01 9:52 ` Catalin Marinas
2014-05-01 9:52 ` Catalin Marinas
2014-05-01 9:57 ` Peter Zijlstra
2014-05-01 9:57 ` Peter Zijlstra
2014-05-01 9:57 ` Peter Zijlstra
2014-05-01 10:04 ` Catalin Marinas
2014-05-01 10:04 ` Catalin Marinas
2014-05-01 10:04 ` Catalin Marinas
2014-05-01 10:15 ` Steve Capper
2014-05-01 10:15 ` Steve Capper
2014-05-01 10:15 ` Steve Capper
2014-03-28 15:01 ` [RFC PATCH V4 7/7] arm64: mm: Enable RCU fast_gup Steve Capper
2014-03-28 15:01 ` Steve Capper
2014-03-28 15:01 ` Steve Capper
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1396018892-6773-1-git-send-email-steve.capper@linaro.org \
--to=steve.capper@linaro.org \
--cc=akpm@linux-foundation.org \
--cc=anders.roxell@linaro.org \
--cc=catalin.marinas@arm.com \
--cc=gary.robertson@linaro.org \
--cc=linux-arch@vger.kernel.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-mm@kvack.org \
--cc=linux@arm.linux.org.uk \
--cc=peterz@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.