speech-data-collection

There are 0 repository under speech-data-collection topic.

MahtaFetrat / ManaTTS-Persian-Speech-Dataset
ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
data-preprocessing dataset-preparation forced-alignment mana-tts persian persian-speech speech-corpus speech-data-collection speech-dataset speech-processing speech-synthesis text-to-speech text-to-speech-dataset tts tts-dataset data-collection manatts
Language:Jupyter Notebook 44
MahtaFetrat / GPTInformal-Persian-Speech-Dataset
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
data-collection data-preprocessing dataset-preparation forced-alignment mana-tts persian persian-speech speech-corpus speech-data-collection speech-dataset speech-processing speech-synthesis text-to-speech text-to-speech-dataset tts tts-dataset manatts
9
MahtaFetrat / VirgoolInformal-Speech-Dataset
A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
asr asr-evaluation forced-alignment persian persian-speech-dataset persian-speech-recognition persian-text-to-speech speech-dataset speech-processing tts persian-speech-corpus speech-data-collection
Language:Jupyter Notebook 5