From: w@1wt.eu (Willy Tarreau)
To: linux-arm-kernel@lists.infradead.org
Subject: Call for testing/opinions: Optimized memset/memcpy
Date: Sat, 13 Jul 2013 19:24:45 +0200 [thread overview]
Message-ID: <20130713172445.GL32054@1wt.eu> (raw)
In-Reply-To: <loom.20130713T172357-560@post.gmane.org>
Hello Harm,
On Sat, Jul 13, 2013 at 03:51:07PM +0000, Harm Hanemaaijer wrote:
> Hello,
>
> I've been doing some work on optimizing the memset/memcpy family of
> functions for modern ARM platforms, including copy_page, memset,
> memzero, memcpy, copy_from_user and copy_to_user. It appears that
> there is room for improvement, especially with regard to using an
> optimal preload strategy for armv6/v7 architectures as well as
> aligning the write target. For example, on an armv6-based platform
> (RPi) I am seeing a 80% speed-up in copy_page and large sized
> memcpy. Gains in the range 10-25% are seen on a Cortex A8 device.
Interesting, especially for devices that have a narrow DDR bus where
we want to shave every possible bus cycle!
(...)
> So in short, I am looking for opinions, and test results especially
> from the userspace benchmark, to see the relative merit of these
> optimizations on different platforms.
OK I've run bench.script on the following platforms :
- Snowball board : it is a dual-core 1GHz cortex-a9 from STE (A9500).
It has some 32-bit LPDDR2 soldered on the CPU (package on package).
The test ran only in ARMv7 mode.
root at snowball:tmp# cat /proc/cpuinfo
processor : 0
model name : ARMv7 Processor rev 1 (v7l)
BogoMIPS : 4.80
Features : swp half thumb fastmult vfp edsp neon vfpv3 tls
CPU implementer : 0x41
CPU architecture: 7
CPU variant : 0x2
CPU part : 0xc09
CPU revision : 1
- Armada XP-GP board : it's a quad-core 1.6 GHz Marvell Armada-XP (PJ4Bv2)
CPU. It has 64-bit DDR3-1600 RAM on a DIMM. The tests were run in ARMv7
and Thumb2 modes. The difference was not impressive between the two
modes.
root at xpgp:tmp# cat /proc/cpuinfo
processor : 0
model name : ARMv7 Processor rev 2 (v7l)
BogoMIPS : 1594.16
Features : swp half thumb fastmult vfp edsp vfpv3 tls idiva idivt
CPU implementer : 0x56
CPU architecture: 7
CPU variant : 0x2
CPU part : 0x584
CPU revision : 2
- Mirabox : single-core 1.2 GHz Marvell Armada370 (PJ4B) CPU. It uses
16-bit DDR3-1200 soldered onboard. The tests were run in ARMv7 and
Thumb2 modes. It can be useful to compare with the xp-gp above because
its CPU can be seen as a scaled down version of the previous one, with
1/4 of the DRAM bus width, and both have the DRAM at half CPU frequency.
root at mirabox:tmp# cat /proc/cpuinfo
processor : 0
model name : ARMv7 Processor rev 1 (v7l)
BogoMIPS : 597.60
Features : swp half thumb fastmult vfp edsp vfpv3 vfpv3d16 tls idivt
CPU implementer : 0x56
CPU architecture: 7
CPU variant : 0x1
CPU part : 0x581
CPU revision : 1
I'm attaching all the results.
Hoping this helps,
Willy
-------------- next part --------------
libc memcpy:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 599.89 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 600.57 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 597.81 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 598.70 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 595.39 MB/s
kernel memcpy (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 618.28 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 615.10 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 618.15 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 615.02 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 621.19 MB/s
kernel memcpy (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 618.03 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 612.97 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 614.82 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 611.68 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 616.50 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 363.92 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 365.71 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 363.92 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 365.73 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 365.63 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 381.35 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 383.49 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 381.49 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 383.32 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 381.47 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 426.75 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 426.75 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 426.75 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 426.69 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 424.72 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 130, word aligned: 311.75 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 310.30 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 311.74 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 310.22 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 311.76 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 130, word aligned: 327.84 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 327.89 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 327.87 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 326.25 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 327.87 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 364.50 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 366.29 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 364.51 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 366.24 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 366.31 MB/s
kernel copy_from_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 361.11 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 362.86 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 361.10 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 362.86 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 361.13 MB/s
kernel copy_to_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 366.61 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 364.79 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 366.56 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 366.60 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 364.84 MB/s
libc memcpy:
4096 bytes page aligned: 356.71 MB/s
4096 bytes page aligned: 355.04 MB/s
4096 bytes page aligned: 356.67 MB/s
4096 bytes page aligned: 354.98 MB/s
4096 bytes page aligned: 356.68 MB/s
kernel memcpy (original):
4096 bytes page aligned: 355.32 MB/s
4096 bytes page aligned: 356.96 MB/s
4096 bytes page aligned: 355.31 MB/s
4096 bytes page aligned: 357.01 MB/s
4096 bytes page aligned: 355.30 MB/s
kernel memcpy (optimized):
4096 bytes page aligned: 341.05 MB/s
4096 bytes page aligned: 339.37 MB/s
4096 bytes page aligned: 341.04 MB/s
4096 bytes page aligned: 339.37 MB/s
4096 bytes page aligned: 341.03 MB/s
kernel copy_page (original):
4096 bytes page aligned: 382.31 MB/s
4096 bytes page aligned: 384.19 MB/s
4096 bytes page aligned: 382.29 MB/s
4096 bytes page aligned: 384.25 MB/s
4096 bytes page aligned: 382.30 MB/s
kernel copy_page (optimized):
4096 bytes page aligned: 340.55 MB/s
4096 bytes page aligned: 338.96 MB/s
4096 bytes page aligned: 340.60 MB/s
4096 bytes page aligned: 338.96 MB/s
4096 bytes page aligned: 340.56 MB/s
libc memcpy:
Mixed from 1 to 1023 (power law), unaligned: 513.06 MB/s
Mixed from 1 to 1023 (power law), unaligned: 513.02 MB/s
Mixed from 1 to 1023 (power law), unaligned: 512.94 MB/s
Mixed from 1 to 1023 (power law), unaligned: 510.37 MB/s
Mixed from 1 to 1023 (power law), unaligned: 513.35 MB/s
kernel memcpy (original):
Mixed from 1 to 1023 (power law), unaligned: 532.66 MB/s
Mixed from 1 to 1023 (power law), unaligned: 535.20 MB/s
Mixed from 1 to 1023 (power law), unaligned: 532.29 MB/s
Mixed from 1 to 1023 (power law), unaligned: 535.41 MB/s
Mixed from 1 to 1023 (power law), unaligned: 535.59 MB/s
kernel memcpy (optimized):
Mixed from 1 to 1023 (power law), unaligned: 528.33 MB/s
Mixed from 1 to 1023 (power law), unaligned: 531.12 MB/s
Mixed from 1 to 1023 (power law), unaligned: 527.64 MB/s
Mixed from 1 to 1023 (power law), unaligned: 530.72 MB/s
Mixed from 1 to 1023 (power law), unaligned: 528.05 MB/s
libc memset:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 888.47 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 884.25 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 888.42 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 888.49 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 884.05 MB/s
kernel memset (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 962.84 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 958.71 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 963.20 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 958.83 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 962.86 MB/s
kernel memset (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1004.37 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 999.61 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1004.49 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 999.43 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1004.46 MB/s
kernel memzero (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 922.59 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 926.98 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 926.99 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 922.46 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 927.07 MB/s
kernel memzero (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 930.00 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 934.53 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 930.89 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 935.60 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 935.32 MB/s
libc memset:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 520.37 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 520.42 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 517.93 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 520.36 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 517.84 MB/s
kernel memset (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 594.94 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 591.54 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 594.39 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 594.45 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 591.58 MB/s
kernel memset (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 658.84 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 655.68 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 658.78 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 655.58 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 658.85 MB/s
kernel memzero (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 567.21 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 569.94 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 569.92 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 567.08 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 569.93 MB/s
kernel memzero (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 586.06 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 588.64 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 585.75 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 588.86 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 588.66 MB/s
libc memset:
4096 bytes page aligned: 2052.77 MB/s
4096 bytes page aligned: 2052.69 MB/s
4096 bytes page aligned: 2042.84 MB/s
4096 bytes page aligned: 2052.72 MB/s
4096 bytes page aligned: 2042.30 MB/s
kernel memset (original):
4096 bytes page aligned: 1920.98 MB/s
4096 bytes page aligned: 1911.66 MB/s
4096 bytes page aligned: 1921.13 MB/s
4096 bytes page aligned: 1921.17 MB/s
4096 bytes page aligned: 1911.92 MB/s
kernel memset (optimized):
4096 bytes page aligned: 1900.46 MB/s
4096 bytes page aligned: 1891.21 MB/s
4096 bytes page aligned: 1900.52 MB/s
4096 bytes page aligned: 1891.16 MB/s
4096 bytes page aligned: 1900.64 MB/s
kernel memzero (original):
4096 bytes page aligned: 1910.57 MB/s
4096 bytes page aligned: 1920.05 MB/s
4096 bytes page aligned: 1920.02 MB/s
4096 bytes page aligned: 1910.87 MB/s
4096 bytes page aligned: 1920.06 MB/s
kernel memzero (optimized):
4096 bytes page aligned: 1917.74 MB/s
4096 bytes page aligned: 1927.05 MB/s
4096 bytes page aligned: 1917.28 MB/s
4096 bytes page aligned: 1927.11 MB/s
4096 bytes page aligned: 1926.87 MB/s
libc memset:
Mixed from 1 to 1023 (power law), unaligned: 759.37 MB/s
Mixed from 1 to 1023 (power law), unaligned: 759.42 MB/s
Mixed from 1 to 1023 (power law), unaligned: 755.88 MB/s
Mixed from 1 to 1023 (power law), unaligned: 759.32 MB/s
Mixed from 1 to 1023 (power law), unaligned: 756.04 MB/s
kernel memset (original):
Mixed from 1 to 1023 (power law), unaligned: 802.77 MB/s
Mixed from 1 to 1023 (power law), unaligned: 798.89 MB/s
Mixed from 1 to 1023 (power law), unaligned: 801.62 MB/s
Mixed from 1 to 1023 (power law), unaligned: 802.67 MB/s
Mixed from 1 to 1023 (power law), unaligned: 798.07 MB/s
kernel memset (optimized):
Mixed from 1 to 1023 (power law), unaligned: 862.50 MB/s
Mixed from 1 to 1023 (power law), unaligned: 857.72 MB/s
Mixed from 1 to 1023 (power law), unaligned: 862.52 MB/s
Mixed from 1 to 1023 (power law), unaligned: 857.00 MB/s
Mixed from 1 to 1023 (power law), unaligned: 860.71 MB/s
kernel memzero (original):
Mixed from 1 to 1023 (power law), unaligned: 784.48 MB/s
Mixed from 1 to 1023 (power law), unaligned: 780.41 MB/s
Mixed from 1 to 1023 (power law), unaligned: 784.97 MB/s
Mixed from 1 to 1023 (power law), unaligned: 781.14 MB/s
Mixed from 1 to 1023 (power law), unaligned: 783.99 MB/s
kernel memzero (optimized):
Mixed from 1 to 1023 (power law), unaligned: 793.48 MB/s
Mixed from 1 to 1023 (power law), unaligned: 796.39 MB/s
Mixed from 1 to 1023 (power law), unaligned: 792.86 MB/s
Mixed from 1 to 1023 (power law), unaligned: 796.20 MB/s
Mixed from 1 to 1023 (power law), unaligned: 796.68 MB/s
-------------- next part --------------
libc memcpy:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 614.78 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 618.39 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 614.90 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 618.16 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 614.83 MB/s
kernel memcpy (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 654.11 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 650.60 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 653.49 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 653.81 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 649.56 MB/s
kernel memcpy (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 653.09 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 650.86 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 653.72 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 650.74 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 653.71 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 332.22 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 333.86 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 332.22 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 333.86 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 333.77 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 365.63 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 365.65 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 363.96 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 365.63 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 363.95 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 403.08 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 401.21 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 403.06 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 401.23 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 403.02 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 130, word aligned: 293.84 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 293.87 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 293.79 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 292.46 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 293.78 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 130, word aligned: 312.63 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 314.11 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 312.64 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 314.05 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 312.63 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 347.08 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 345.40 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 347.01 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 347.06 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 347.05 MB/s
kernel copy_from_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 338.99 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 337.42 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 338.96 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 337.42 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 339.07 MB/s
kernel copy_to_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 336.61 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 338.16 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 336.61 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 338.21 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 336.58 MB/s
libc memcpy:
4096 bytes page aligned: 358.08 MB/s
4096 bytes page aligned: 356.32 MB/s
4096 bytes page aligned: 358.07 MB/s
4096 bytes page aligned: 356.39 MB/s
4096 bytes page aligned: 358.08 MB/s
kernel memcpy (original):
4096 bytes page aligned: 356.76 MB/s
4096 bytes page aligned: 358.47 MB/s
4096 bytes page aligned: 356.76 MB/s
4096 bytes page aligned: 358.47 MB/s
4096 bytes page aligned: 356.86 MB/s
kernel memcpy (optimized):
4096 bytes page aligned: 342.33 MB/s
4096 bytes page aligned: 340.66 MB/s
4096 bytes page aligned: 342.32 MB/s
4096 bytes page aligned: 340.70 MB/s
4096 bytes page aligned: 342.31 MB/s
kernel copy_page (original):
4096 bytes page aligned: 381.93 MB/s
4096 bytes page aligned: 383.87 MB/s
4096 bytes page aligned: 381.97 MB/s
4096 bytes page aligned: 383.86 MB/s
4096 bytes page aligned: 381.98 MB/s
kernel copy_page (optimized):
4096 bytes page aligned: 341.86 MB/s
4096 bytes page aligned: 341.83 MB/s
4096 bytes page aligned: 341.86 MB/s
4096 bytes page aligned: 341.80 MB/s
4096 bytes page aligned: 341.85 MB/s
libc memcpy:
Mixed from 1 to 1023 (power law), unaligned: 484.57 MB/s
Mixed from 1 to 1023 (power law), unaligned: 482.42 MB/s
Mixed from 1 to 1023 (power law), unaligned: 484.45 MB/s
Mixed from 1 to 1023 (power law), unaligned: 482.49 MB/s
Mixed from 1 to 1023 (power law), unaligned: 484.27 MB/s
kernel memcpy (original):
Mixed from 1 to 1023 (power law), unaligned: 503.45 MB/s
Mixed from 1 to 1023 (power law), unaligned: 505.11 MB/s
Mixed from 1 to 1023 (power law), unaligned: 502.65 MB/s
Mixed from 1 to 1023 (power law), unaligned: 505.09 MB/s
Mixed from 1 to 1023 (power law), unaligned: 502.69 MB/s
kernel memcpy (optimized):
Mixed from 1 to 1023 (power law), unaligned: 490.07 MB/s
Mixed from 1 to 1023 (power law), unaligned: 490.26 MB/s
Mixed from 1 to 1023 (power law), unaligned: 486.98 MB/s
Mixed from 1 to 1023 (power law), unaligned: 489.95 MB/s
Mixed from 1 to 1023 (power law), unaligned: 487.95 MB/s
libc memset:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 844.51 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 840.39 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 844.37 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 840.68 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 844.55 MB/s
kernel memset (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 886.05 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 890.19 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 890.11 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 885.76 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 889.84 MB/s
kernel memset (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 930.57 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 934.93 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 930.50 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 934.75 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 930.35 MB/s
kernel memzero (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 860.46 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 860.40 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 860.34 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 860.40 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 856.31 MB/s
kernel memzero (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 881.67 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 877.42 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 881.60 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 877.48 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 881.70 MB/s
libc memset:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 496.66 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 499.04 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 498.98 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 496.62 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 498.96 MB/s
kernel memset (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 551.78 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 554.33 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 551.63 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 554.13 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 551.60 MB/s
kernel memset (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 601.07 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 597.87 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 601.06 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 601.08 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 598.38 MB/s
kernel memzero (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 525.40 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 522.99 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 525.42 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 522.74 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 525.28 MB/s
kernel memzero (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 556.46 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 559.02 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 559.16 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 559.00 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 559.13 MB/s
libc memset:
4096 bytes page aligned: 2029.13 MB/s
4096 bytes page aligned: 2038.87 MB/s
4096 bytes page aligned: 2029.11 MB/s
4096 bytes page aligned: 2038.82 MB/s
4096 bytes page aligned: 2028.82 MB/s
kernel memset (original):
4096 bytes page aligned: 1918.99 MB/s
4096 bytes page aligned: 1909.79 MB/s
4096 bytes page aligned: 1919.03 MB/s
4096 bytes page aligned: 1918.82 MB/s
4096 bytes page aligned: 1918.96 MB/s
kernel memset (optimized):
4096 bytes page aligned: 1920.02 MB/s
4096 bytes page aligned: 1910.71 MB/s
4096 bytes page aligned: 1920.03 MB/s
4096 bytes page aligned: 1910.58 MB/s
4096 bytes page aligned: 1919.89 MB/s
kernel memzero (original):
4096 bytes page aligned: 1885.37 MB/s
4096 bytes page aligned: 1894.53 MB/s
4096 bytes page aligned: 1885.11 MB/s
4096 bytes page aligned: 1894.52 MB/s
4096 bytes page aligned: 1894.52 MB/s
kernel memzero (optimized):
4096 bytes page aligned: 1895.10 MB/s
4096 bytes page aligned: 1894.72 MB/s
4096 bytes page aligned: 1885.82 MB/s
4096 bytes page aligned: 1895.08 MB/s
4096 bytes page aligned: 1885.86 MB/s
libc memset:
Mixed from 1 to 1023 (power law), unaligned: 737.90 MB/s
Mixed from 1 to 1023 (power law), unaligned: 734.13 MB/s
Mixed from 1 to 1023 (power law), unaligned: 737.61 MB/s
Mixed from 1 to 1023 (power law), unaligned: 734.18 MB/s
Mixed from 1 to 1023 (power law), unaligned: 737.53 MB/s
kernel memset (original):
Mixed from 1 to 1023 (power law), unaligned: 786.00 MB/s
Mixed from 1 to 1023 (power law), unaligned: 786.00 MB/s
Mixed from 1 to 1023 (power law), unaligned: 785.98 MB/s
Mixed from 1 to 1023 (power law), unaligned: 782.09 MB/s
Mixed from 1 to 1023 (power law), unaligned: 785.96 MB/s
kernel memset (optimized):
Mixed from 1 to 1023 (power law), unaligned: 813.68 MB/s
Mixed from 1 to 1023 (power law), unaligned: 817.65 MB/s
Mixed from 1 to 1023 (power law), unaligned: 813.22 MB/s
Mixed from 1 to 1023 (power law), unaligned: 817.10 MB/s
Mixed from 1 to 1023 (power law), unaligned: 813.94 MB/s
kernel memzero (original):
Mixed from 1 to 1023 (power law), unaligned: 746.57 MB/s
Mixed from 1 to 1023 (power law), unaligned: 746.77 MB/s
Mixed from 1 to 1023 (power law), unaligned: 742.82 MB/s
Mixed from 1 to 1023 (power law), unaligned: 746.56 MB/s
Mixed from 1 to 1023 (power law), unaligned: 743.25 MB/s
kernel memzero (optimized):
Mixed from 1 to 1023 (power law), unaligned: 785.01 MB/s
Mixed from 1 to 1023 (power law), unaligned: 781.21 MB/s
Mixed from 1 to 1023 (power law), unaligned: 785.10 MB/s
Mixed from 1 to 1023 (power law), unaligned: 781.19 MB/s
Mixed from 1 to 1023 (power law), unaligned: 784.99 MB/s
-------------- next part --------------
libc memcpy:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 944.06 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 939.55 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 936.32 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 938.91 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 935.52 MB/s
kernel memcpy (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 921.58 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 918.61 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 915.82 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 915.27 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 911.62 MB/s
kernel memcpy (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 908.06 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 905.13 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 907.52 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 906.64 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 907.89 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 547.23 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 547.29 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 546.17 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 547.24 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 547.50 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 541.90 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 541.91 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 541.93 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 542.91 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 541.95 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 615.08 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 614.48 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 615.11 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 615.07 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 614.90 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 130, word aligned: 459.28 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 459.87 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 459.40 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 459.62 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 459.40 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 130, word aligned: 457.91 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 458.35 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 457.98 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 458.22 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 457.85 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 545.62 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 544.90 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 545.52 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 545.42 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 545.54 MB/s
kernel copy_from_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 485.72 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 484.69 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 484.78 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 485.02 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 485.64 MB/s
kernel copy_to_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 489.08 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 491.05 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 492.40 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 493.27 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 491.08 MB/s
libc memcpy:
4096 bytes page aligned: 1027.53 MB/s
4096 bytes page aligned: 1020.33 MB/s
4096 bytes page aligned: 1026.20 MB/s
4096 bytes page aligned: 1025.76 MB/s
4096 bytes page aligned: 1024.70 MB/s
kernel memcpy (original):
4096 bytes page aligned: 1026.80 MB/s
4096 bytes page aligned: 1027.25 MB/s
4096 bytes page aligned: 1026.46 MB/s
4096 bytes page aligned: 1020.09 MB/s
4096 bytes page aligned: 1027.83 MB/s
kernel memcpy (optimized):
4096 bytes page aligned: 841.49 MB/s
4096 bytes page aligned: 847.07 MB/s
4096 bytes page aligned: 840.32 MB/s
4096 bytes page aligned: 847.07 MB/s
4096 bytes page aligned: 841.32 MB/s
kernel copy_page (original):
4096 bytes page aligned: 948.27 MB/s
4096 bytes page aligned: 940.34 MB/s
4096 bytes page aligned: 946.30 MB/s
4096 bytes page aligned: 942.02 MB/s
4096 bytes page aligned: 948.32 MB/s
kernel copy_page (optimized):
4096 bytes page aligned: 850.59 MB/s
4096 bytes page aligned: 857.73 MB/s
4096 bytes page aligned: 851.24 MB/s
4096 bytes page aligned: 858.75 MB/s
4096 bytes page aligned: 851.73 MB/s
libc memcpy:
Mixed from 1 to 1023 (power law), unaligned: 715.47 MB/s
Mixed from 1 to 1023 (power law), unaligned: 714.09 MB/s
Mixed from 1 to 1023 (power law), unaligned: 715.65 MB/s
Mixed from 1 to 1023 (power law), unaligned: 714.83 MB/s
Mixed from 1 to 1023 (power law), unaligned: 712.47 MB/s
kernel memcpy (original):
Mixed from 1 to 1023 (power law), unaligned: 721.70 MB/s
Mixed from 1 to 1023 (power law), unaligned: 719.15 MB/s
Mixed from 1 to 1023 (power law), unaligned: 721.34 MB/s
Mixed from 1 to 1023 (power law), unaligned: 718.81 MB/s
Mixed from 1 to 1023 (power law), unaligned: 721.02 MB/s
kernel memcpy (optimized):
Mixed from 1 to 1023 (power law), unaligned: 635.79 MB/s
Mixed from 1 to 1023 (power law), unaligned: 636.97 MB/s
Mixed from 1 to 1023 (power law), unaligned: 635.52 MB/s
Mixed from 1 to 1023 (power law), unaligned: 636.23 MB/s
Mixed from 1 to 1023 (power law), unaligned: 636.05 MB/s
libc memset:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1323.49 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1326.82 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1348.12 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1328.57 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1324.56 MB/s
kernel memset (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1786.48 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1782.46 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1776.21 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1745.68 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1771.53 MB/s
kernel memset (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1770.77 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1759.21 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1721.21 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1782.98 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1762.74 MB/s
kernel memzero (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1745.20 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1763.23 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1743.48 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1766.37 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1728.34 MB/s
kernel memzero (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1682.73 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1660.62 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1695.76 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1703.42 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1766.86 MB/s
libc memset:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 901.11 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 901.81 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 889.89 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 886.94 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 899.02 MB/s
kernel memset (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1142.87 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1145.74 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1141.91 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1142.41 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1143.23 MB/s
kernel memset (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1129.60 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1132.20 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1131.63 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1131.37 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1128.10 MB/s
kernel memzero (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1110.96 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1105.10 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1106.56 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1107.89 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1105.29 MB/s
kernel memzero (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1081.12 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1086.37 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1086.06 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1086.13 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1085.48 MB/s
libc memset:
4096 bytes page aligned: 1371.96 MB/s
4096 bytes page aligned: 1362.53 MB/s
4096 bytes page aligned: 1383.10 MB/s
4096 bytes page aligned: 1356.89 MB/s
4096 bytes page aligned: 1367.61 MB/s
kernel memset (original):
4096 bytes page aligned: 1321.56 MB/s
4096 bytes page aligned: 1337.12 MB/s
4096 bytes page aligned: 1318.98 MB/s
4096 bytes page aligned: 1330.80 MB/s
4096 bytes page aligned: 1324.66 MB/s
kernel memset (optimized):
4096 bytes page aligned: 1317.07 MB/s
4096 bytes page aligned: 1305.07 MB/s
4096 bytes page aligned: 1311.78 MB/s
4096 bytes page aligned: 1301.32 MB/s
4096 bytes page aligned: 1305.47 MB/s
kernel memzero (original):
4096 bytes page aligned: 1320.70 MB/s
4096 bytes page aligned: 1317.15 MB/s
4096 bytes page aligned: 1380.78 MB/s
4096 bytes page aligned: 1316.34 MB/s
4096 bytes page aligned: 1363.25 MB/s
kernel memzero (optimized):
4096 bytes page aligned: 1302.89 MB/s
4096 bytes page aligned: 1349.68 MB/s
4096 bytes page aligned: 1305.33 MB/s
4096 bytes page aligned: 1338.91 MB/s
4096 bytes page aligned: 1304.71 MB/s
libc memset:
Mixed from 1 to 1023 (power law), unaligned: 1296.85 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1281.93 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1284.15 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1303.82 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1289.72 MB/s
kernel memset (original):
Mixed from 1 to 1023 (power law), unaligned: 1635.98 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1631.05 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1630.50 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1629.33 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1640.34 MB/s
kernel memset (optimized):
Mixed from 1 to 1023 (power law), unaligned: 1674.27 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1661.84 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1670.77 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1656.26 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1664.30 MB/s
kernel memzero (original):
Mixed from 1 to 1023 (power law), unaligned: 1583.12 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1576.78 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1579.13 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1571.27 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1554.87 MB/s
kernel memzero (optimized):
Mixed from 1 to 1023 (power law), unaligned: 1613.16 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1624.66 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1613.26 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1624.16 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1611.64 MB/s
-------------- next part --------------
libc memcpy:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 938.28 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 938.13 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 938.22 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 937.87 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 938.26 MB/s
kernel memcpy (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 992.48 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 992.77 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 992.53 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 992.82 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 992.45 MB/s
kernel memcpy (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 869.57 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 870.32 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 869.57 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 870.32 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 869.65 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 506.25 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 506.18 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 506.17 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 506.16 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 506.19 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 542.36 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 542.08 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 541.74 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 542.09 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 542.71 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 568.31 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 567.96 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 567.96 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 567.81 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 567.88 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 130, word aligned: 425.27 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 425.41 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 425.29 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 426.54 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 425.58 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 130, word aligned: 458.17 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 458.13 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 458.73 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 458.32 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 458.95 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 503.75 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 503.23 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 503.38 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 502.87 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 503.40 MB/s
kernel copy_from_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 486.47 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 485.02 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 485.65 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 485.20 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 485.11 MB/s
kernel copy_to_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 456.43 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 455.72 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 455.60 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 455.58 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 456.06 MB/s
libc memcpy:
4096 bytes page aligned: 2733.85 MB/s
4096 bytes page aligned: 2734.82 MB/s
4096 bytes page aligned: 2735.47 MB/s
4096 bytes page aligned: 2733.74 MB/s
4096 bytes page aligned: 2735.10 MB/s
kernel memcpy (original):
4096 bytes page aligned: 2763.15 MB/s
4096 bytes page aligned: 2764.57 MB/s
4096 bytes page aligned: 2762.87 MB/s
4096 bytes page aligned: 2764.31 MB/s
4096 bytes page aligned: 2763.97 MB/s
kernel memcpy (optimized):
4096 bytes page aligned: 2021.61 MB/s
4096 bytes page aligned: 2022.85 MB/s
4096 bytes page aligned: 2021.30 MB/s
4096 bytes page aligned: 2022.75 MB/s
4096 bytes page aligned: 2021.18 MB/s
kernel copy_page (original):
4096 bytes page aligned: 1536.64 MB/s
4096 bytes page aligned: 1536.07 MB/s
4096 bytes page aligned: 1536.62 MB/s
4096 bytes page aligned: 1536.44 MB/s
4096 bytes page aligned: 1536.04 MB/s
kernel copy_page (optimized):
4096 bytes page aligned: 2029.46 MB/s
4096 bytes page aligned: 2028.46 MB/s
4096 bytes page aligned: 2029.26 MB/s
4096 bytes page aligned: 2028.49 MB/s
4096 bytes page aligned: 2029.51 MB/s
libc memcpy:
Mixed from 1 to 1023 (power law), unaligned: 677.42 MB/s
Mixed from 1 to 1023 (power law), unaligned: 677.45 MB/s
Mixed from 1 to 1023 (power law), unaligned: 677.43 MB/s
Mixed from 1 to 1023 (power law), unaligned: 677.49 MB/s
Mixed from 1 to 1023 (power law), unaligned: 677.55 MB/s
kernel memcpy (original):
Mixed from 1 to 1023 (power law), unaligned: 705.91 MB/s
Mixed from 1 to 1023 (power law), unaligned: 705.96 MB/s
Mixed from 1 to 1023 (power law), unaligned: 706.14 MB/s
Mixed from 1 to 1023 (power law), unaligned: 706.18 MB/s
Mixed from 1 to 1023 (power law), unaligned: 706.32 MB/s
kernel memcpy (optimized):
Mixed from 1 to 1023 (power law), unaligned: 671.04 MB/s
Mixed from 1 to 1023 (power law), unaligned: 671.49 MB/s
Mixed from 1 to 1023 (power law), unaligned: 671.19 MB/s
Mixed from 1 to 1023 (power law), unaligned: 671.87 MB/s
Mixed from 1 to 1023 (power law), unaligned: 671.50 MB/s
libc memset:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1288.97 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1288.99 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1288.74 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1288.95 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1288.51 MB/s
kernel memset (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1698.82 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1695.12 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1695.28 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1699.55 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1698.91 MB/s
kernel memset (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1826.35 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1826.33 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1833.66 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1833.25 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1834.97 MB/s
kernel memzero (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1608.61 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1603.63 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1606.36 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1608.51 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1607.49 MB/s
kernel memzero (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1654.00 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1653.34 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1653.09 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1647.16 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1653.98 MB/s
libc memset:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 779.98 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 780.05 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 779.98 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 780.09 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 779.82 MB/s
kernel memset (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 971.07 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 969.65 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 969.63 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 969.63 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 969.45 MB/s
kernel memset (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1166.68 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1166.31 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1166.68 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1166.41 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1166.45 MB/s
kernel memzero (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 915.94 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 915.88 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 916.08 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 915.77 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 915.94 MB/s
kernel memzero (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 980.79 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 981.17 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 981.46 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 981.44 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 981.17 MB/s
libc memset:
4096 bytes page aligned: 2808.48 MB/s
4096 bytes page aligned: 2809.23 MB/s
4096 bytes page aligned: 2809.10 MB/s
4096 bytes page aligned: 2808.32 MB/s
4096 bytes page aligned: 2808.85 MB/s
kernel memset (original):
4096 bytes page aligned: 4285.77 MB/s
4096 bytes page aligned: 4286.95 MB/s
4096 bytes page aligned: 4285.80 MB/s
4096 bytes page aligned: 4287.03 MB/s
4096 bytes page aligned: 4286.30 MB/s
kernel memset (optimized):
4096 bytes page aligned: 4332.88 MB/s
4096 bytes page aligned: 4333.13 MB/s
4096 bytes page aligned: 4332.22 MB/s
4096 bytes page aligned: 4333.00 MB/s
4096 bytes page aligned: 4331.64 MB/s
kernel memzero (original):
4096 bytes page aligned: 4286.68 MB/s
4096 bytes page aligned: 4286.68 MB/s
4096 bytes page aligned: 4286.96 MB/s
4096 bytes page aligned: 4286.31 MB/s
4096 bytes page aligned: 4285.41 MB/s
kernel memzero (optimized):
4096 bytes page aligned: 4307.47 MB/s
4096 bytes page aligned: 4306.33 MB/s
4096 bytes page aligned: 4307.97 MB/s
4096 bytes page aligned: 4305.94 MB/s
4096 bytes page aligned: 4307.61 MB/s
libc memset:
Mixed from 1 to 1023 (power law), unaligned: 1150.12 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1149.80 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1150.06 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1149.76 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1149.91 MB/s
kernel memset (original):
Mixed from 1 to 1023 (power law), unaligned: 1482.23 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1483.26 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1483.42 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1482.48 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1483.19 MB/s
kernel memset (optimized):
Mixed from 1 to 1023 (power law), unaligned: 1683.39 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1680.19 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1681.58 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1680.15 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1680.06 MB/s
kernel memzero (original):
Mixed from 1 to 1023 (power law), unaligned: 1357.13 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1357.31 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1356.41 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1357.16 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1356.60 MB/s
kernel memzero (optimized):
Mixed from 1 to 1023 (power law), unaligned: 1469.08 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1470.31 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1469.47 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1468.80 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1469.37 MB/s
-------------- next part --------------
libc memcpy:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 869.54 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 869.27 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 869.78 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 869.52 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 869.50 MB/s
kernel memcpy (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 954.22 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 954.17 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 954.16 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 954.08 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 954.19 MB/s
kernel memcpy (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 852.17 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 852.53 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 852.37 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 852.44 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 852.45 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 455.51 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 457.69 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 455.01 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 455.30 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 455.68 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 512.36 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 512.02 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 512.47 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 512.47 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 512.66 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 538.32 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 537.83 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 538.36 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 538.29 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 539.25 MB/s
libc memcpy:
Mixed multiples of 4 from 4 to 130, word aligned: 392.90 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 388.25 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 388.67 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 392.51 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 392.09 MB/s
kernel memcpy (original):
Mixed multiples of 4 from 4 to 130, word aligned: 433.21 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 433.73 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 433.34 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 433.91 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 433.43 MB/s
kernel memcpy (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 474.10 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 474.06 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 474.29 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 474.10 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 473.95 MB/s
kernel copy_from_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 455.22 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 455.10 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 454.55 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 454.71 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 454.86 MB/s
kernel copy_to_user (optimized):
Mixed multiples of 4 from 4 to 130, word aligned: 429.08 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 429.08 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 429.42 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 429.12 MB/s
Mixed multiples of 4 from 4 to 130, word aligned: 429.59 MB/s
libc memcpy:
4096 bytes page aligned: 2698.97 MB/s
4096 bytes page aligned: 2703.85 MB/s
4096 bytes page aligned: 2706.42 MB/s
4096 bytes page aligned: 2701.26 MB/s
4096 bytes page aligned: 2699.65 MB/s
kernel memcpy (original):
4096 bytes page aligned: 2735.92 MB/s
4096 bytes page aligned: 2735.76 MB/s
4096 bytes page aligned: 2739.53 MB/s
4096 bytes page aligned: 2737.95 MB/s
4096 bytes page aligned: 2735.23 MB/s
kernel memcpy (optimized):
4096 bytes page aligned: 2016.76 MB/s
4096 bytes page aligned: 2015.85 MB/s
4096 bytes page aligned: 2016.87 MB/s
4096 bytes page aligned: 2015.99 MB/s
4096 bytes page aligned: 2018.49 MB/s
kernel copy_page (original):
4096 bytes page aligned: 1533.05 MB/s
4096 bytes page aligned: 1533.36 MB/s
4096 bytes page aligned: 1533.81 MB/s
4096 bytes page aligned: 1533.62 MB/s
4096 bytes page aligned: 1533.05 MB/s
kernel copy_page (optimized):
4096 bytes page aligned: 2016.48 MB/s
4096 bytes page aligned: 2019.79 MB/s
4096 bytes page aligned: 2016.49 MB/s
4096 bytes page aligned: 2017.68 MB/s
4096 bytes page aligned: 2018.23 MB/s
libc memcpy:
Mixed from 1 to 1023 (power law), unaligned: 640.12 MB/s
Mixed from 1 to 1023 (power law), unaligned: 640.23 MB/s
Mixed from 1 to 1023 (power law), unaligned: 640.13 MB/s
Mixed from 1 to 1023 (power law), unaligned: 640.34 MB/s
Mixed from 1 to 1023 (power law), unaligned: 640.36 MB/s
kernel memcpy (original):
Mixed from 1 to 1023 (power law), unaligned: 681.11 MB/s
Mixed from 1 to 1023 (power law), unaligned: 680.79 MB/s
Mixed from 1 to 1023 (power law), unaligned: 681.19 MB/s
Mixed from 1 to 1023 (power law), unaligned: 680.93 MB/s
Mixed from 1 to 1023 (power law), unaligned: 681.05 MB/s
kernel memcpy (optimized):
Mixed from 1 to 1023 (power law), unaligned: 645.50 MB/s
Mixed from 1 to 1023 (power law), unaligned: 644.98 MB/s
Mixed from 1 to 1023 (power law), unaligned: 645.10 MB/s
Mixed from 1 to 1023 (power law), unaligned: 644.91 MB/s
Mixed from 1 to 1023 (power law), unaligned: 645.03 MB/s
libc memset:
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1246.47 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1246.77 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1246.49 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1246.87 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1246.58 MB/s
kernel memset (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1609.02 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1612.50 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1612.66 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1614.68 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1609.93 MB/s
kernel memset (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1744.85 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1747.18 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1748.65 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1745.03 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1745.42 MB/s
kernel memzero (original):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1509.51 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1510.41 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1509.70 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1508.00 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1508.73 MB/s
kernel memzero (optimized):
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1615.44 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1617.76 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1612.05 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1616.54 MB/s
Mixed powers of 2 from 4 to 4096 (power law), word aligned: 1610.91 MB/s
libc memset:
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 735.51 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 735.65 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 735.62 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 735.75 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 735.83 MB/s
kernel memset (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 884.22 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 884.39 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 884.11 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 885.90 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 884.09 MB/s
kernel memset (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1025.79 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1025.70 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1025.98 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1025.56 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 1025.59 MB/s
kernel memzero (original):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 831.09 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 830.34 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 830.77 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 830.50 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 830.64 MB/s
kernel memzero (optimized):
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 919.83 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 920.16 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 919.50 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 919.75 MB/s
Mixed multiples of 4 from 4 to 1024 (power law), word aligned: 920.02 MB/s
libc memset:
4096 bytes page aligned: 2789.85 MB/s
4096 bytes page aligned: 2790.47 MB/s
4096 bytes page aligned: 2789.64 MB/s
4096 bytes page aligned: 2790.60 MB/s
4096 bytes page aligned: 2789.42 MB/s
kernel memset (original):
4096 bytes page aligned: 4292.31 MB/s
4096 bytes page aligned: 4292.19 MB/s
4096 bytes page aligned: 4291.39 MB/s
4096 bytes page aligned: 4291.91 MB/s
4096 bytes page aligned: 4291.29 MB/s
kernel memset (optimized):
4096 bytes page aligned: 4321.51 MB/s
4096 bytes page aligned: 4319.98 MB/s
4096 bytes page aligned: 4321.53 MB/s
4096 bytes page aligned: 4319.93 MB/s
4096 bytes page aligned: 4321.46 MB/s
kernel memzero (original):
4096 bytes page aligned: 4243.19 MB/s
4096 bytes page aligned: 4242.35 MB/s
4096 bytes page aligned: 4243.32 MB/s
4096 bytes page aligned: 4242.29 MB/s
4096 bytes page aligned: 4243.34 MB/s
kernel memzero (optimized):
4096 bytes page aligned: 4261.67 MB/s
4096 bytes page aligned: 4262.59 MB/s
4096 bytes page aligned: 4262.13 MB/s
4096 bytes page aligned: 4262.75 MB/s
4096 bytes page aligned: 4262.62 MB/s
libc memset:
Mixed from 1 to 1023 (power law), unaligned: 1084.53 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1084.89 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1084.61 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1084.71 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1084.43 MB/s
kernel memset (original):
Mixed from 1 to 1023 (power law), unaligned: 1364.45 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1363.67 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1364.87 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1364.47 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1364.17 MB/s
kernel memset (optimized):
Mixed from 1 to 1023 (power law), unaligned: 1508.02 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1510.44 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1508.57 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1508.86 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1510.14 MB/s
kernel memzero (original):
Mixed from 1 to 1023 (power law), unaligned: 1261.52 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1261.24 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1262.57 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1260.26 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1261.35 MB/s
kernel memzero (optimized):
Mixed from 1 to 1023 (power law), unaligned: 1412.76 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1412.17 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1413.32 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1412.77 MB/s
Mixed from 1 to 1023 (power law), unaligned: 1413.13 MB/s
next prev parent reply other threads:[~2013-07-13 17:24 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-07-13 15:51 Call for testing/opinions: Optimized memset/memcpy Harm Hanemaaijer
2013-07-13 16:48 ` Dr. David Alan Gilbert
2013-07-13 21:13 ` Harm Hanemaaijer
2013-07-15 13:15 ` Catalin Marinas
2013-07-14 11:19 ` Harm Hanemaaijer
2013-07-14 11:32 ` Dr. David Alan Gilbert
2013-07-14 11:37 ` Ard Biesheuvel
2013-07-14 13:13 ` Russell King - ARM Linux
2013-07-14 13:33 ` Harm Hanemaaijer
2013-07-14 14:09 ` Ard Biesheuvel
2013-07-14 14:32 ` Russell King - ARM Linux
2013-07-13 17:24 ` Willy Tarreau [this message]
2013-07-13 21:51 ` Harm Hanemaaijer
2013-07-14 6:13 ` Willy Tarreau
2013-07-14 11:00 ` Harm Hanemaaijer
2013-07-14 13:09 ` Russell King - ARM Linux
2013-07-14 13:59 ` Harm Hanemaaijer
2013-07-14 15:21 ` Siarhei Siamashka
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130713172445.GL32054@1wt.eu \
--to=w@1wt.eu \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.