HazyResearch / based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

HazyResearch/based Stargazers