rajivpoddar / text_audio_align

Generate text and audio splits from strings successfully aligned by Gentle

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Dump of files related to text and audio alignment, language models, and ASR data generation/testing/prediction.

Aligner and manifest generator

Generate text and audio splits from strings successfully aligned by Gentle, and write a manifest file of the generated files.

aligner

with asr_data_gen.py and aligner.py moved to /PATH/TO/gentle/,

$ cd gentle
$ python asr_data_gen.py FILE_ID

manifest generator

$ python pytorch_manifest.py (optional: --files_dir /PATH/TO/WAV_AND_TXT/DIRS/ --out_file /PATH/TO/MANIFEST_FILE/)

About

Generate text and audio splits from strings successfully aligned by Gentle


Languages

Language:Python 93.7%Language:Shell 6.3%