robmsmt / KerasDeepSpeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Repository from Github https://github.comrobmsmt/KerasDeepSpeech

keras deepspeech asr ctc coreml speechrecognition speech-to-text deep-learning machine-learning neural-networks baidu speech deeplearning neural-network nn

Keras DeepSpeech

Repository for experimenting with different CTC based model designs for ASR. Supports live recording and testing of speech and quickly creates customised datasets using own-voice dataset creation scripts!

OVERVIEW

SETUP

Recommended > use virtualenv installed with python2.7 (3.x untested and will not work with Core ML)
git clone https://github.com/robmsmt/KerasDeepSpeech
pip install -r requirements.txt
Get the data using the import/download scripts in the folder, LibriSpeech is a good example.
Download the language model (large file) run ./lm/get_lm.sh

RUN

To Train, simply run python run-train.py In order to specify training/validation files use python run-train.py --train_files <csvfile> --valid_files <csvfile> (see run-train for complete arguments list)
To Test, run python run-test.py --test_files <datacsvfile>

CREDIT

Mozilla DeepSpeech
Baidu DS1 & DS2 papers

Licence

The content of this project itself is licensed under the GNU General Public License. Copyright © 2018

Contributing

Have a question? Like the tool? Don't like it? Open an issue and let's talk about it! Pull requests are appreciated!

About

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

keras deepspeech asr ctc coreml speechrecognition speech-to-text deep-learning machine-learning neural-networks baidu speech deeplearning neural-network nn

GNU Affero General Public License v3.0

Languages

Language:Python 82.2%Language:Jupyter Notebook 17.7%Language:Shell 0.1%