[PATCH 0/4] copy_page improvements

* [PATCH 0/4] copy_page improvements
@ 2016-02-02 12:46 Will Deacon
  2016-02-02 12:46 ` [PATCH 1/4] arm64: prefetch: don't provide spin_lock_prefetch with LSE Will Deacon
                   ` (5 more replies)
  0 siblings, 6 replies; 7+ messages in thread
From: Will Deacon @ 2016-02-02 12:46 UTC (permalink / raw)
  To: linux-arm-kernel

Hi all,

This patch series aims to improve our copy_page implementation on both
CPUs with and without hardware prefetchers. It is inspired by a previous
patch series from Andrew, and one of the patches is lifted directly from
there:

  http://lists.infradead.org/pipermail/linux-arm-kernel/2016-January/399132.html

Changes from that series:

  * Applies on top of mainline
  * Inverted the capability to the more generic ARM64_HAS_NO_HW_PREFETCH
  * Re-jigged the MIDR detection to try to keep it simple

I've left copy_template alone for now, since the template really deals
with 64 bytes per iteration, which would need changing.

Andrew -- it would be helpful if you could take this for a spin on
ThunderX to make sure it didn't screw anything up.

Cheers,

Will

--->8

Andrew Pinski (1):
  arm64: lib: patch in prfm for copy_page if requested

Will Deacon (3):
  arm64: prefetch: don't provide spin_lock_prefetch with LSE
  arm64: prefetch: add alternative pattern for CPUs without a prefetcher
  arm64: lib: improve copy_page to deal with 128 bytes at a time

 arch/arm64/include/asm/cpufeature.h |  3 +-
 arch/arm64/include/asm/cputype.h    | 17 +++++++++-
 arch/arm64/include/asm/processor.h  |  7 +++--
 arch/arm64/kernel/cpu_errata.c      | 18 ++---------
 arch/arm64/kernel/cpufeature.c      | 17 ++++++++++
 arch/arm64/lib/copy_page.S          | 63 ++++++++++++++++++++++++++++++++-----
 6 files changed, 98 insertions(+), 27 deletions(-)

-- 
2.1.4

^ permalink raw reply	[flat|nested] 7+ messages in thread