qutang / mixture-of-experts

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

qutang/mixture-of-experts Issues

No issues in this repository yet.