Add a straightforward implementation for TileIterator

Question

Add a straightforward implementation for TileIterator

haruhi55 opened this issue a month ago · comments

By dividing a large tile into smaller sub-tiles, we create a 2D array of sub-tiles. The TileIterator provides a logical view of these sub-tiles, allowing us to use logical indices to iterate over them.

Given a shared memory tile has a row-major layout like this:

using Tile = SharedTile<float, RowMajor<4, 12>>;

// Suppose a `Tile` has contents like this:
0.0, 1.0, 2.0, 3.0, 4.0, 5.0, 6.0, 7.0, 8.0, 9.0, 10.0, 11.0, 
12.0, 13.0, 14.0, 15.0, 16.0, 17.0, 18.0, 19.0, 20.0, 21.0, 22.0, 23.0, 
24.0, 25.0, 26.0, 27.0, 28.0, 29.0, 30.0, 31.0, 32.0, 33.0, 34.0, 35.0, 
36.0, 37.0, 38.0, 39.0, 40.0, 41.0, 42.0, 43.0, 44.0, 45.0, 46.0, 47.0,

Chunk a large Tile along two dimensions, and we get a grid of sub-tiles. Use the iterator to iterate over sub-tiles along the first dimension:

using Iterator = SharedTileIterator<Tile, TileShape<2, 4>>;

for (int i = 0; i < Iterator::sc0; ++i) {
    tiles(i, _).to_tile().dump_value();
}

Expected outputs are:

// Iteration-[0, _], the returned type is a `SharedTile`

0.0, 1.0, 2.0, 3.0, 4.0, 5.0, 6.0, 7.0, 8.0, 9.0, 10.0, 11.0, 
12.0, 13.0, 14.0, 15.0, 16.0, 17.0, 18.0, 19.0, 20.0, 21.0, 22.0, 23.0, 

// Iteration-[1, _], the returned type is a `SharedTile`
24.0, 25.0, 26.0, 27.0, 28.0, 29.0, 30.0, 31.0, 32.0, 33.0, 34.0, 35.0, 
36.0, 37.0, 38.0, 39.0, 40.0, 41.0, 42.0, 43.0, 44.0, 45.0, 46.0, 47.0,