Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
Repository from Github https://github.comgiangdip2410/HyperRouterRepository from Github https://github.comgiangdip2410/HyperRouter