deepspeech common-voice kinyarwanda kinyarwanda-corpus

Kinyarwanda common voice dataset

The Kinyarwanda common voice dataset is a dataset of Kinyarwanda sentences collected in order to train the kinyarwanda deepspeech model. The Kinyarwanda common voice dataset is made of 1,200,000 million + sentences

To check out the common voice dataset go to the common voice website select Kinyarwanda in the language option and you can choose releases depending on the amount of data you want. Note: the latest release have more data

About

deepspeech common-voice kinyarwanda kinyarwanda-corpus

Mozilla Public License 2.0