_mm_movemask_epi8 regression
jacksonrnewhouse opened this issue · comments
Jackson Newhouse commented
The aarch64 code path for _mm_movemask_epi8 introduced in #50 looks to be a regression when you actually compile it. The default behavior compiles to 7 instructions with no constants, while the "fast path" is 14 instructions plus a constant. Should it be reverted?
fast path: https://godbolt.org/z/41s54d
default: https://godbolt.org/z/xsYfz8
marktwtn commented
I'll make a simple time experiment of _mm_movemask_epi8
.
marktwtn commented