Eval dataset is hard coded to be "openwebtext_ppl"
dlwh opened this issue · comments
Shouldn't be doing that. Related to #112
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
dlwh opened this issue · comments
Shouldn't be doing that. Related to #112