asticode / go-asticoqui

Golang bindings for Coqui's speech-to-text library

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GoReportCard GoDoc

Golang bindings for Coqui's 🐸STT speech-to-text library.

asticoqui is compatible with version v1.0.0, v1.1.0, and v1.2.0 of 🐸STT.

Installation

Install tflite

Run the following command:

    $ pip3 install --extra-index-url https://google-coral.github.io/py-repo/ tflite_runtime

If you're interested in running against your CUDA-enabled GPU (optional), then set the environment variable STT_TFLITE_DELEGATE=gpu.

Install Coqui STT

  1. fetch an up-to-date native_client.*.tar.xz matching your system from 🐸STT releases. For example, on macOS:
    $ wget https://github.com/coqui-ai/STT/releases/download/v1.2.0/native_client.tflite.macOS.tar.xz
  1. extract its content to $HOME/.coqui/. For example, on macOS:
    $ mkdir $HOME/.coqui/
    $ tar -xvzf native_client.tflite.macOS.tar.xz -C $HOME/.coqui/
  1. set environment variables to point to client
    $ export CGO_LDFLAGS="-L$HOME/.coqui/"
    $ export CGO_CXXFLAGS="-I$HOME/.coqui/"
    $ export LD_LIBRARY_PATH="$HOME/.coqui/:$LD_LIBRARY_PATH"

Install asticoqui

Install dependencies

Run the following command:

    $ go get -u github.com/asticode/go-asticoqui/...

Install executables

Run the following command:

    $ go install github.com/asticode/go-asticoqui/cmd

Example Usage

Get the pre-trained model and scorer

Go to this page and click Enter Email to Download at the bottom of the page. Download model.tflite and huge_vocabulary.scorer.

Get the audio files

Run the following commands:

    $ cd $HOME/.coqui
    $ wget https://github.com/coqui-ai/STT/releases/download/v1.2.0/audio-1.2.0.tar.gz
    $ tar -xvfz audio-1.2.0.tar.gz

Use this client

Run the following commands:

    $ go run coqui/main.go -model model.tflite -scorer huge_vocabulary.scorer -audio audio/2830-3980-0043.wav
    
        Text: experience proves this
    
    $ go run coqui/main.go -model model.tflite -scorer huge_vocabulary.scorer -audio audio/4507-16021-0012.wav
    
        Text: why should one hall on the way
        
    $ go run coqui/main.go -model model.tflite -scorer huge_vobaculary.scorer -audio audio/8455-210777-0068.wav
    
        Text: your power is sufficient i said

About

Golang bindings for Coqui's speech-to-text library

License:MIT License


Languages

Language:Go 57.2%Language:C++ 31.8%Language:C 11.0%