klauspost / reedsolomon

Reed-Solomon Erasure Coding in Go

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Optimization for power

ghatwala opened this issue · comments

Hi @klauspost ,

I have been trying to understand the process for creating optimized go code for power(ppc64le) in here "https://github.com/klauspost/reedsolomon" similar to the optimization done for amd(galois_amd64.go) and arm(galois_arm64.go).

My understanding is below for example on "amd64"
The routines that have been optimized for amd64 and are present in " galois_amd64.go" in klauspost repo.
o galMulSlice
o galMulSliceXor
o sliceXor
The un-optimized implementation of the above routines is in "galois_noasm.go " and this would be the one that is getting invoked for Power currently. ?

To implement the equivalent power assembly - which intel equivalent needs to be considered - AVX2 or SSSE3 ?

This has been fixed with #88

Indeed!