All of lore.kernel.org
 help / color / mirror / Atom feed
* arch/86: AVX RAID5 xor checksumming
@ 2012-04-10 17:22 Jim Kukunas
  2012-04-10 17:22 ` [PATCH] raid5: add AVX optimized RAID5 checksumming Jim Kukunas
  2012-04-10 17:37 ` arch/86: AVX RAID5 xor checksumming H. Peter Anvin
  0 siblings, 2 replies; 8+ messages in thread
From: Jim Kukunas @ 2012-04-10 17:22 UTC (permalink / raw)
  To: x86; +Cc: hpa, neilb, linux-kernel, mingo, tglx

Hi Folks,

The following patch adds an AVX implementation of the RAID5 xor checksumming
functions.

Based on xor_speed, the AVX implementation appears to be ~32% faster than the
SSE implementation on my i7 2600:

        generic_sse: 15088.000 MB/sec
	avx: 19936.000 MB/sec

Thanks.


^ permalink raw reply	[flat|nested] 8+ messages in thread
* arch/86: AVX RAID5 xor checksumming v1
@ 2012-04-18 22:58 Jim Kukunas
  2012-04-18 22:58 ` [PATCH] raid5: add AVX optimized RAID5 checksumming Jim Kukunas
  0 siblings, 1 reply; 8+ messages in thread
From: Jim Kukunas @ 2012-04-18 22:58 UTC (permalink / raw)
  To: neilb; +Cc: hpa, linux-kernel, linux-raid

Hi Folks,

The following patch adds an AVX implementation of the RAID5 xor checksumming
functions.

This version differs from version 0, in that it:
	0) checks whether assembler supports AVX
	1) replaces __attribute__(aligned(32)) with __aligned(32)
	2) adds do/while(0); to macro blocks

Based on xor_speed, the AVX implementation appears to be ~32% faster than the
SSE implementation on my i7 2600:

        generic_sse: 15088.000 MB/sec
	avx: 19936.000 MB/sec

Thanks.

^ permalink raw reply	[flat|nested] 8+ messages in thread
* AVX RAID5 xor checksumming
@ 2012-03-29 21:44 Jim Kukunas
  2012-03-29 21:44 ` [PATCH] raid5: add AVX optimized RAID5 checksumming Jim Kukunas
  0 siblings, 1 reply; 8+ messages in thread
From: Jim Kukunas @ 2012-03-29 21:44 UTC (permalink / raw)
  To: linux-raid; +Cc: hpa, neilb

Hi Folks,

The following patch adds an AVX implementation of the RAID5 xor checksumming
functions.

Based on xor_speed, the AVX implementation appears to be ~32% faster than the
SSE implementation on my i7 2600:

	generic_sse: 15088.000 MB/sec
	avx: 19936.000 MB/sec

Thanks.


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2012-04-18 22:58 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2012-04-10 17:22 arch/86: AVX RAID5 xor checksumming Jim Kukunas
2012-04-10 17:22 ` [PATCH] raid5: add AVX optimized RAID5 checksumming Jim Kukunas
2012-04-10 17:37 ` arch/86: AVX RAID5 xor checksumming H. Peter Anvin
2012-04-10 20:14   ` NeilBrown
2012-04-10 21:20     ` H. Peter Anvin
2012-04-14 12:02       ` Ingo Molnar
  -- strict thread matches above, loose matches on Subject: below --
2012-04-18 22:58 arch/86: AVX RAID5 xor checksumming v1 Jim Kukunas
2012-04-18 22:58 ` [PATCH] raid5: add AVX optimized RAID5 checksumming Jim Kukunas
2012-03-29 21:44 AVX RAID5 xor checksumming Jim Kukunas
2012-03-29 21:44 ` [PATCH] raid5: add AVX optimized RAID5 checksumming Jim Kukunas

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.