Dataset information

Question

finetunej opened this issue 2 years ago · comments

Thanks for the very interesting model release.

If possible, could a bit information about the dataset used for training be provided (e.g. language split percentages)?

Ruslan Vasilev · Answer 1 · Sun Jun 26 2022 16:46:00 GMT+0800 (China Standard Time)

Thank you for your interest! We have just added detailed dataset description.

finetune · Answer 2 · Sun Jun 26 2022 19:00:50 GMT+0800 (China Standard Time)

Thanks, very helpful!