Re: Where exactly will arch_fast_hash be used

* Re: Where exactly will arch_fast_hash be used
@ 2014-12-07  5:20 George Spelvin
  2014-12-07  9:28 ` Herbert Xu
  2014-12-07 13:14 ` Hannes Frederic Sowa
  0 siblings, 2 replies; 26+ messages in thread
From: George Spelvin @ 2014-12-07  5:20 UTC (permalink / raw)
  To: herbert; +Cc: dborkman, hannes, linux, linux-kernel, netdev, tgraf

If you want DoS-resistant hash tables, I'm working on adding SipHash
to the kernel.

This is a keyed pseudo-random function designed specifically for that
application.  I am starting with ext4 directory hashes, and then intended
to expand to secure sequence numbers (since it's far faster than MD5).

(I'm trying to figure out a good interface, since the crypto API
is a bit heavy for something to heavily optimized.)

But one comment caught my eye:
> Even if security wasn't an issue, straight CRC32 has really poor
> lower-order bit distribution, which makes it a terrible choice for
> a hash table that simply uses the lower-order bits.

Er... huh?  That's the first time I've heard that claim, and while I'm not
Philip Koopman or Guy Castagnoli, I thought I understood CRCs pretty well.

CRCs generally mix bits pretty well.  The sparse 16-bit CRCs chosen
for implementation simplicity had some limitations, but the Castagnoli
polynomial is quite dense.

And their mathematical symmetry means that the low bits really shouldn't
be any different from any other bits.  But if it is an issue, it's just
as easy work to shift down the correct number of high bits rather than
using the low.

Can you point me to a source for that statement?

^ permalink raw reply	[flat|nested] 26+ messages in thread