TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstraction for processing tiles.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
haruhi55 opened this issue 2 months ago · comments
Simplify configuration of copy plan for row-major and column-major shared memory tiles.