Dataset information
finetunej opened this issue · comments
Thanks for the very interesting model release.
If possible, could a bit information about the dataset used for training be provided (e.g. language split percentages)?
Thank you for your interest! We have just added detailed dataset description.
Thanks, very helpful!