panayi / gentle

gentle forced aligner

Home Page:https://lowerquality.com/gentle/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Gentle

Robust yet lenient forced-aligner built on Kaldi. A tool for aligning speech with text.

Getting Started

There are three ways to install Gentle.

  1. Download the pre-built Mac application. This package includes a GUI that will start the server and a browser. It only works on Mac OS.

  2. Use the Docker image. Just run docker run -P lowerquality/gentle. This works on all platforms supported by Docker.

  3. Download the source code and run ./install.sh. Then run python3 serve.py to start the server. This works on Mac and Linux.

Using Gentle

By default, the aligner listens at http://localhost:8765. That page has a graphical interface for transcribing audio, viewing results, and downloading data.

There is also a REST API so you can use Gentle in your programs. Here's an example of how to use the API with CURL:

curl -F "audio=@audio.mp3" -F "transcript=@words.txt" "http://localhost:8765/transcriptions?async=false"

If you've downloaded the source code you can also run the aligner as a command line program:

git clone https://github.com/lowerquality/gentle.git
cd gentle
./install.sh
python3 align.py audio.mp3 words.txt

The default behaviour outputs the JSON to stdout. See python3 align.py --help for options.

About

gentle forced aligner

https://lowerquality.com/gentle/

License:MIT License


Languages

Language:Shell 42.3%Language:C++ 40.3%Language:Python 7.2%Language:Perl 5.8%Language:C 1.5%Language:TeX 1.5%Language:Cuda 0.6%Language:HTML 0.5%Language:Makefile 0.2%Language:MATLAB 0.0%Language:Dockerfile 0.0%Language:Awk 0.0%Language:sed 0.0%