wav2letter++
wav2letter++ is a fast open source speech processing toolkit from Facebook AI Research. It is written entirely in C++ and uses the ArrayFire tensor library and the flashlight machine learning library for maximum efficiency. Our approach is detailed in this arXiv paper.
Building wav2letter++
See Building Instructions for details.
Full documentation
We also provide complete recipes for WSJ, Timit and Librispeech and they be found in recipes
folder.
Citation
If you use the code in your paper, then please cite it as:
@article{pratap2018w2l,
author = {Vineel Pratap, Awni Hannun, Qiantong Xu, Jeff Cai, Jacob Kahn, Gabriel Synnaeve, Vitaliy Liptchinsky, Ronan Collobert},
title = {wav2letter++: The Fastest Open-source Speech Recognition System},
journal = {CoRR},
volume = {abs/1812.07625},
year = {2018},
url = {https://arxiv.org/abs/1812.07625},
}
Join the wav2letter community
- Facebook page: https://www.facebook.com/groups/717232008481207/
- Google group: https://groups.google.com/forum/#!forum/wav2letter-users
- Contact: vineelkpratap@fb.com, awni@fb.com, qiantong@fb.com, jcai@fb.com, jacobkahn@fb.com, gab@fb.com, vitaliy888@fb.com, locronan@fb.com
See the CONTRIBUTING file for how to help out.
License
wav2letter++ is BSD-licensed, as found in the LICENSE file.