Griefed / glados-tts

A GLaDOS TTS, using Forward Tacotron and HiFiGAN. Inference is fast and stable, even on the CPU. A low quality vocoder model is included for mobile use. Rudimentary TTS script included. Works perfectly on Linux, partially on Maybe someone smarter than me can make a GUI.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GLaDOS Text-to-speech (TTS) Voice Generator

Neural network based TTS Engine.

If you want to just play around with the TTS, this works as stand-alone.

python3 glados-tts/glados.py

the TTS Engine can also be used remotely on a machine more powerful then the Pi to process in house TTS: (executed from glados-tts directory

python3 engine-remote.py

Default port is 8124 Be sure to update settings.env variable in your main Glados-voice-assistant directory:

TTS_ENGINE_API			= http://192.168.1.3:8124/synthesize/

Description

The initial, regular Tacotron model was trained first on LJSpeech, and then on a heavily modified version of the Ellen McClain dataset (all non-Portal 2 voice lines removed, punctuation added).

  • The Forward Tacotron model was only trained on about 600 voice lines.
  • The HiFiGAN model was generated through transfer learning from the sample.
  • All models have been optimized and quantized.

Installation Instruction

If you want to install the TTS Engine on your machine, please follow the steps below.

  1. Install the espeak synthesizer according to the installation instructions for your operating system.
  2. Install the required Python packages, e.g., by running pip install -r requirements.txt

Building on docker

   docker build -t glados-tts .
   docker run -it --rm -p 8124:8124 glados-tts serve --port 8124

Development using docker

   docker run -it --rm -p 8124:8124 -v Path/To/Your/Folder:/app glados-tts /bin/bash

Now you can edit the files in your folder and run the server from inside the container. Or you could attach vscode to the container and edit the files from there. The changes will be reflected in your folder.

About

A GLaDOS TTS, using Forward Tacotron and HiFiGAN. Inference is fast and stable, even on the CPU. A low quality vocoder model is included for mobile use. Rudimentary TTS script included. Works perfectly on Linux, partially on Maybe someone smarter than me can make a GUI.

License:MIT License


Languages

Language:Python 96.1%Language:Batchfile 2.0%Language:Dockerfile 1.9%