xiaosdawn / wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Home Page:https://github.com/facebookresearch/wav2letter/wiki

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

wav2letter++

CircleCI Join the chat at https://gitter.im/wav2letter/community

Important Note:

wav2letter has been moved and consolidated into Flashlight in the ASR application.

Future wav2letter development will occur in Flashlight.

To build the old, pre-consolidation version of wav2letter, checkout the wav2letter v0.2 release, which depends on the old Flashlight v0.2 release. The wav2letter-lua project can be found on the wav2letter-lua branch, accordingly.

For more information on wav2letter++, see or cite this arXiv paper.

Recipes

This repository includes recipes to reproduce the following research papers as well as pre-trained models:

Data preparation for training and evaluation can be found in data directory.

Building the Recipes

First, install Flashlight with the ASR application. Then, after cloning the project source:

mkdir build && cd build
cmake .. && make -j8

If Flashlight or ArrayFire are installed in nonstandard paths via a custom CMAKE_INSTALL_PREFIX, they can be found by passing

-Dflashlight_DIR=[PREFIX]/usr/share/flashlight/cmake/ -DArrayFire_DIR=[PREFIX]/usr/share/ArrayFire/cmake

when running cmake.

Join the wav2letter community

License

wav2letter++ is BSD-licensed, as found in the LICENSE file.

About

Facebook AI Research's Automatic Speech Recognition Toolkit

https://github.com/facebookresearch/wav2letter/wiki

License:Other


Languages

Language:C++ 51.3%Language:Python 37.1%Language:Shell 5.7%Language:CMake 3.5%Language:Perl 1.5%Language:Dockerfile 0.5%Language:C 0.4%