huchinlp / Hands-on-GEMM

A tutorial on GEMM

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Hands-on-GEMM

A GEMM tutorial by Yi Zhang/张译 at NLP Lab., Northeastern University.

Performance

SGEMM 性能对比

Usage

src/cuda 文件夹下面,找到你想看性能的 gemm,记住那个名字,然后回到主项目文件夹下,首先mkdir build,然后输入 make benchmark_xxx

如你想看 double_buffer_yhs_refine_gemm.cu 这个矩阵乘的性能,就输入:

make benchmark_double_buffer_yhs_refine

然后二进制会出现在 bin 文件夹下面。

Tutorial

知乎链接:这里

公众号链接:这里

About

A tutorial on GEMM

License:GNU General Public License v3.0


Languages

Language:Cuda 95.9%Language:C++ 2.9%Language:Makefile 1.1%Language:C 0.0%