jlamprou / Infini-Attention

Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval

Home Page:https://arxiv.org/abs/2404.07143

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

jlamprou/Infini-Attention Stargazers