TiledTensor / TiledCUDA

TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstraction for processing tiles.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Rename `SharedTileIterator` to `TileIterator` to reduce redundancy

haruhi55 opened this issue · comments

TileIterator works for both global memory tiles and shared memory tiles. There is no significant difference between these two, as both are contiguous memory compared to distributed thread-local register files.

Therefore, shall we rename SharedTileIterator to TileIterator to simplify the concept?

Agreed, we can rename SharedTileIterator to TileIterator so that it can be used for both Global and Shared memory.

Let me make this change.