tt6746690 / doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Home Page:https://arxiv.org/abs/2305.10429

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

tt6746690/doremi Issues

No issues in this repository yet.