The gemm kernel does not use swizzled shared memory layout.
haruhi55 opened this issue · comments
TiledCUDA/include/cell/traits/gemm.hpp
Lines 35 to 39 in e375ddc
The GEMM kernel does not utilize swizzled shared memory.
TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstraction for processing tiles.
haruhi55 opened this issue · comments
TiledCUDA/include/cell/traits/gemm.hpp
Lines 35 to 39 in e375ddc
The GEMM kernel does not utilize swizzled shared memory.