[NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling
Home Page:https://arxiv.org/abs/2306.11197
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool