Create pre-trained model on Librispeech

Question

Create pre-trained model on Librispeech

SeanNaren opened this issue 8 years ago · comments

There has been a lot of attempts to get the model trained on Librispeech 1k hours of training data so providing a trained model I think would be very beneficial (as well as the steps to replicate).

Prepare dataset on file system
Create LMDB and make modifications to support this
Train till convergence
Add model to repo and document

Stretch goal:

Allow loading of previous model and training from these weights

Nicolás Melendez commented 8 years ago

+1

Sean Naren · Answer 1 · Thu Sep 29 2016 17:38:40 GMT+0800 (China Standard Time)

I've added a checklist to the main issue which I'll update as I complete tasks!

Sean Naren · Answer 2 · Fri Nov 25 2016 18:28:46 GMT+0800 (China Standard Time)

I'm training the model now. I've modified params and will explain how I trained this model from the ground up. It doesn't strictly follow the DeepSpeech architecture mainly out of performance reasons and aiding convergence.

Yannis Assael · Answer 3 · Thu Jan 05 2017 01:07:00 GMT+0800 (China Standard Time)

Hi @SeanNaren, are you planning to upload the model? The community thanks you!

Sean Naren · Answer 4 · Thu Jan 05 2017 03:35:57 GMT+0800 (China Standard Time)

@iassael Sorry for the disappearance... will upload the models tomorrow and give the WER/CERs currently for them. Still plan on improving the smaller version but will get the release out!

Yannis Assael · Answer 5 · Thu Jan 05 2017 08:15:05 GMT+0800 (China Standard Time)

@SeanNaren that's amazing. If you need any help with hosting let me know!

Sean Naren · Answer 6 · Thu Jan 05 2017 23:17:34 GMT+0800 (China Standard Time)

Here we are guys :) Huuuge thank you to @maciejkorzepa for giving me the LibriSpeech model!

I think we could squeeze a little bit more out of some of the models in terms of accuracies, but they are a good baseline to start from!

@iassael Thanks for the offer of hosting, Seems like github handled them alright

Ryan Leary · Answer 7 · Wed Jan 25 2017 05:12:36 GMT+0800 (China Standard Time)

@SeanNaren do you have information on how you were able to make this converge?