yandex / YaLM-100B

Pretrained language model with 100B parameters

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Dataset information

finetunej opened this issue · comments

Thanks for the very interesting model release.

If possible, could a bit information about the dataset used for training be provided (e.g. language split percentages)?

Thank you for your interest! We have just added detailed dataset description.

Thanks, very helpful!