An optimized CPU version of DGEMV with comparable/faster performance than Intel MKL.
SIMD, AVX512, OpenMP.
We require processors supporting AVX512F instructions to compile and run the code.
Highly optimized DGEMV on CPU with both serial and parallel performance better than MKL and OpenBLAS.
An optimized CPU version of DGEMV with comparable/faster performance than Intel MKL.
SIMD, AVX512, OpenMP.
We require processors supporting AVX512F instructions to compile and run the code.
Highly optimized DGEMV on CPU with both serial and parallel performance better than MKL and OpenBLAS.
GNU General Public License v3.0