komihash 5.0

Question

komihash 5.0

avaneev opened this issue a year ago · comments

Hi!

Version 5.0.
Simplified handling of the "final byte", yielding 0.5 cycles/hash improvement.
Rearranged Loop64 memory addresses, yielding 0.5 GB/s large-block hashing improvement.
Note that the output values of the function changed.

https://github.com/avaneev/komihash

Reini Urban commented a year ago

done

Aleksey Vaneev commented a year ago

Thanks!

Reini Urban · Answer 1 · Tue Jun 13 2023 20:47:53 GMT+0800 (China Standard Time)

branch komihash5

Aleksey Vaneev · Answer 2 · Wed Jun 14 2023 17:52:36 GMT+0800 (China Standard Time)

updated to v5.1, very minor change

Aleksey Vaneev · Answer 3 · Thu Jun 15 2023 08:54:05 GMT+0800 (China Standard Time)

Please also update the code size, it may be lower now.

Aleksey Vaneev · Answer 4 · Wed Jun 21 2023 19:35:15 GMT+0800 (China Standard Time)

Bulk performance is still 12.3GB/s, for no technically understandable reason, and laughably close to the new polymur-hash. Should really be about 19 GB/s...

dumblob · Answer 5 · Wed Jun 21 2023 19:42:02 GMT+0800 (China Standard Time)

Bulk performance is still 12.3GB/s, for no technically understandable reason

Perhaps there is not enough research into this topic. I myself do not know of any reliable way how to predict performance (non-)contributions (e.g. in percent) of certain programming "tuples" of operations/instructions, patterns, and techniques.

Perhaps I should try some very fine-grained complexity metrics to see if there is any correlation.

Reini Urban · Answer 6 · Wed Jun 21 2023 20:21:44 GMT+0800 (China Standard Time)

actually the size is larger now

objdump -dC build/SMHasher |less
25830 - 25d5b: 1323

Aleksey Vaneev · Answer 7 · Wed Jun 21 2023 20:56:30 GMT+0800 (China Standard Time)

Here are komihash test results on a large variety of platforms: https://bench.cr.yp.to/results-hash.html
Reini's compiler is likely misconfigured.

Aleksey Vaneev · Answer 8 · Wed Dec 06 2023 01:41:24 GMT+0800 (China Standard Time)

I've found out that it's GCC which creates a much slow 64-byte hashing code on Zen platform. With Clang, or GCC on Intel platforms there are no issues. This does not affect small-string timings, though.

Aleksey Vaneev · Answer 9 · Thu Dec 07 2023 01:28:34 GMT+0800 (China Standard Time)

Strangely enough, the komihash_stream_oneshot() function does perform as expected with GCC on Zen. There's some issue with the compiler on this code.