DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Repository from Github https://github.comdeepseek-ai/DeepGEMMRepository from Github https://github.comdeepseek-ai/DeepGEMM
youzjuer opened this issue 3 months ago · comments
I just wanna change A and B from fp8 to fp32,but I found it will occur an error in tma desc as:
why
DeepGEMM does not support fp32 input.