support the commonvoice dataset

Question

dpriskorn opened this issue 2 years ago · comments

how to handle big downloads?
do the sentences have a unique persistent ID?
any API for lookup using the ID?

emailed mozilla 2 days ago to ask

Nizo Priskorn · Answer 1 · Thu Jan 27 2022 04:56:54 GMT+0800 (China Standard Time)

No answer from Mozilla yet. Maybe the best way forward is to count lines in their file with sentences?