contraslash / sphinx-ops-isolated-words-models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Sphinx OPS Isolated Word Models

This is the main repository to build an acoustic model for sphinx based on the Open Speech Corpus Isolated Word Corpus.

First execute the script 01_download_word_recordings.py, this script will fetch all data from OPS.

Then execute the script 02_convert_mp4_to_wav.py, to execute this script you must have FFMpeg installed and on your path.

After you need to prepare sphinx configuration data, to achieve this:

Then you can call the script 03_configure_sphinx.py, this script will configure almost all the files required by sphinx, but to create a custom language model you need to execute 04_generate_language_model.sh.

Make sure you have sphinxtrain installed on your pc

Now execute

sphinxtrain -t ops_isolated_words setup

After this in your etc folder you will have a full structure or what you need for your project

Please check this link for further information.

Search for $CFG_HMM_TYPE and select .semi If you are on a multicore machine change $CFG_QUEUE_TYPE to Queue::POSIX and $CFG_NPART and $DEC_CFG_NPART to your machine cores

Then execute the train

sphinxtrain run

This could take some time.

To check the results

pocketsphinx_continuous -hmm model_parameters/ops_isolated_words.ci_semi/ -lm etc/ops_isolated_words.lm.DMP -dict etc/ops_isolated_words.dic -inmic yes   

About


Languages

Language:HTML 99.2%Language:Python 0.7%Language:Shell 0.0%