Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Home Page:https://arxiv.org/abs/2305.10429
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool