csarofeen / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Home Page:http://pytorch.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Persistent buffer with L2 cache

naoyam opened this issue · comments

L2 cache is quite large on A100 and later generations. Should be used in addition to registers and shared memory.