arch/86: AVX RAID5 xor checksumming v1

* arch/86: AVX RAID5 xor checksumming v1
@ 2012-04-18 22:58 Jim Kukunas
  2012-04-18 22:58 ` [PATCH] raid5: add AVX optimized RAID5 checksumming Jim Kukunas
  2012-04-19  0:02 ` arch/86: AVX RAID5 xor checksumming v1 NeilBrown
  0 siblings, 2 replies; 4+ messages in thread
From: Jim Kukunas @ 2012-04-18 22:58 UTC (permalink / raw)
  To: neilb; +Cc: hpa, linux-kernel, linux-raid

Hi Folks,

The following patch adds an AVX implementation of the RAID5 xor checksumming
functions.

This version differs from version 0, in that it:
	0) checks whether assembler supports AVX
	1) replaces __attribute__(aligned(32)) with __aligned(32)
	2) adds do/while(0); to macro blocks

Based on xor_speed, the AVX implementation appears to be ~32% faster than the
SSE implementation on my i7 2600:

        generic_sse: 15088.000 MB/sec
	avx: 19936.000 MB/sec

Thanks.

^ permalink raw reply	[flat|nested] 4+ messages in thread