All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH v2 0/3] arm64: Add optimized memset/memcpy functions
@ 2021-08-10  7:13 Stefan Roese
  2021-08-10  7:13 ` [PATCH v2 1/3] arm64: arch/arm/lib: " Stefan Roese
                   ` (2 more replies)
  0 siblings, 3 replies; 8+ messages in thread
From: Stefan Roese @ 2021-08-10  7:13 UTC (permalink / raw)
  To: u-boot; +Cc: Rasmus Villemoes, Wolfgang Denk, sjg, trini


On an NXP LX2160 based platform it has been noticed, that the currently
implemented memset/memcpy functions for aarch64 are suboptimal.
Especially the memset() for clearing the NXP MC firmware memory is very
expensive (time-wise).

By using optimized functions, a speedup of ~ factor 6 has been measured.

This patchset now adds the optimized functions ported from this
repository:
https://github.com/ARM-software/optimized-routines

As the optimized memset function make use of the dc opcode, which needs
the caches to be enabled, an additional check is added and a simple
memset version is used in this case.

Please note that checkpatch.pl complains about some issue with this
imported file: arch/arm/lib/asmdefs.h
Since it's imported I did explicitly not make any changes here, to make
potential future sync'ing easer.

Thanks,
Stefan

Changes in v2:
- Add file names and locations and git commit ID from imported files
  to the commit message
- New patch

Stefan Roese (3):
  arm64: arch/arm/lib: Add optimized memset/memcpy functions
  arm64: memset-arm64: Use simple memset when cache is disabled
  arm64: Kconfig: Enable usage of optimized memset/memcpy

 arch/arm/Kconfig            |  10 +-
 arch/arm/lib/Makefile       |   5 +
 arch/arm/lib/asmdefs.h      |  98 +++++++++++++++
 arch/arm/lib/memcpy-arm64.S | 241 ++++++++++++++++++++++++++++++++++++
 arch/arm/lib/memset-arm64.S | 146 ++++++++++++++++++++++
 5 files changed, 494 insertions(+), 6 deletions(-)
 create mode 100644 arch/arm/lib/asmdefs.h
 create mode 100644 arch/arm/lib/memcpy-arm64.S
 create mode 100644 arch/arm/lib/memset-arm64.S

-- 
2.32.0


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2021-08-10 11:39 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-08-10  7:13 [PATCH v2 0/3] arm64: Add optimized memset/memcpy functions Stefan Roese
2021-08-10  7:13 ` [PATCH v2 1/3] arm64: arch/arm/lib: " Stefan Roese
2021-08-10 11:30   ` Rasmus Villemoes
2021-08-10 11:39     ` Stefan Roese
2021-08-10  7:13 ` [PATCH v2 2/3] arm64: memset-arm64: Use simple memset when cache is disabled Stefan Roese
2021-08-10  9:27   ` Rasmus Villemoes
2021-08-10  9:47     ` Stefan Roese
2021-08-10  7:13 ` [PATCH v2 3/3] arm64: Kconfig: Enable usage of optimized memset/memcpy Stefan Roese

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.