wav2letter++

wav2letter++ is a highly efficient end-to-end automatic speech recognition (ASR) toolkit written entirely in C++, leveraging ArrayFire and flashlight.

The toolkit started from models predicting letters directly from the raw waveform, and now evolved as an all-purpose end-to-end ASR research toolkit, supporting a wide range of models and learning techniques. It also embarks a very efficient modular beam-search decoder, for both structured learning (CTC, ASG) and seq2seq approaches.

Important disclaimer: as a number of models from this repository could be used for other modalities, we moved most of the code to flashlight.

This repository includes recipes to reproduce the following research papers as well as pre-trained models:

Data preparation for our training and evaluation can be found in data folder.

The previous iteration of wav2letter can be found in the:

(before merging codebases for wav2letter and flashlight) wav2letter-v0.2 branch.
(written in Lua) wav2letter-lua branch.

Build recipes

First, isntall flashlight with all its dependencies. Then

mkdir build && cd build && cmake .. && make -j8

If flashlight or ArrayFire are installed in nonstandard paths via CMAKE_INSTALL_PREFIX, they can be found by passing -Dflashlight_DIR=[PREFIX]/usr/share/flashlight/cmake/ -DArrayFire_DIR=[PREFIX]/usr/share/ArrayFire/cmake when running cmake.

Join the wav2letter community

Facebook page: https://www.facebook.com/groups/717232008481207/
Google group: https://groups.google.com/forum/#!forum/wav2letter-users
Contact: vineelkpratap@fb.com, awni@fb.com, qiantong@fb.com, jacobkahn@fb.com, antares@fb.com, avidov@fb.com, gab@fb.com, vitaliy888@fb.com, locronan@fb.com

See the CONTRIBUTING file for how to help out.

License

wav2letter++ is BSD-licensed, as found in the LICENSE file.

About

Facebook AI Research's Automatic Speech Recognition Toolkit

https://github.com/facebookresearch/wav2letter/wiki

Other

Languages

Language:Python 59.9%Language:C++ 28.4%Language:Shell 9.2%Language:CMake 1.4%Language:Dockerfile 0.8%Language:C 0.2%