Digital-Umuganda / common_voice_dataset_rw

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Kinyarwanda common voice dataset

The Kinyarwanda common voice dataset is a dataset of Kinyarwanda sentences collected in order to train the kinyarwanda deepspeech model. The Kinyarwanda common voice dataset is made of 1,200,000 million + sentences

To check out the common voice dataset go to the common voice website select Kinyarwanda in the language option and you can choose releases depending on the amount of data you want. Note: the latest release have more data

About

License:Mozilla Public License 2.0