giangdip2410 / HyperRouter

Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"

Repository from Github https://github.comgiangdip2410/HyperRouterRepository from Github https://github.comgiangdip2410/HyperRouter

giangdip2410/HyperRouter Issues

No issues in this repository yet.