On Wed, Aug 11, 2021 at 04:02:39PM +0200, Stefan Roese wrote: > > On an NXP LX2160 based platform it has been noticed, that the currently > implemented memset/memcpy functions for aarch64 are suboptimal. > Especially the memset() for clearing the NXP MC firmware memory is very > expensive (time-wise). > > By using optimized functions, a speedup of ~ factor 6 has been measured. To be clear, you re-measured with the cache check code added, and this is the speed up? -- Tom