huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[Unit Test] Add unit test for DoReMi's trainer

xrsrke opened this issue · comments