ag1988 / top_k_attention

The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonathan Berant. SustaiNLP 2021).

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ag1988/top_k_attention Stargazers