Common Voice's repositories
common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
cv-dataset
Metadata and versioning details for the Common Voice dataset
commonvoice-fr
Tooling for producing French dataset for Common Voice
sentence-collector
Tool to collect and review sentences for Common Voice
CorporaCreator
Command line tool to create corpora for Common Voice
cv-sentence-extractor
Scraping Wikipedia for fair use sentences
community-playbook
Mozilla Voice Community Playbook
common-voice-bundler
Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language
our-voices-model-competition
Our Voices Competition
voicebot-telegram
Voicebot for contributing voice snippets to voice.mozilla.org
common-voice-methodology
A living document outlining a methodological approach for building read speech sentence corpora.
wikipedia-data
Different analysis and files from wikipedia text analysis
helm-charts
Common Voice Helm Charts
text-tools
Manipulate sentences.
common-voice-yumie2
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
cv-global-sprint-sentences
This is a repo that will contain all the reviewed sentences collected by the global sprint.
other-useful-datasets
This is our new repository to make other open speech datasets from the community easier to find. If you'd like to add yours, please get in touch!
voice-corpora-automation
Automation for generating the common voice corpora
bulk-sentence-dutch
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
common-voice-1
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
connected-react-router
A Redux binding for React Router v4
mp3-duration-reporter
Calculate the individual and total duration of a directory full of .mp3 files