Yeongtae / tacorn

Combination of the Tacotron-2 implementation by Rayhane-mamah with the WaveRNN-inspired method by fatchord

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

tacorn

WARNING1: The pre-trained models are (yet again) not compatible with the latest version of Rayhane-mamah's Tacotron2 repository.

WARNING2: This is experimental, messy and will most likely not be developed further.

This repository combines the Tacotron-2 implementation of Rayhane-mamah (https://github.com/Rayhane-mamah/Tacotron-2) with the WaveRNN-inspired (but heavily diverged) method by fatchord (https://github.com/fatchord/WaveRNN).

Samples

Synthesis

If you just want to synthesize from the pre-trained (English, LJ) models, currently you just have to run

bash install.sh

then:

python synthesize.py

Usage:

usage: synthesize.py [-h] [--sentences_file SENTENCES_FILE]
                     [--output_dir OUTPUT_DIR]

optional arguments:
  -h, --help            show this help message and exit
  --sentences_file SENTENCES_FILE
                        Input file containing sentences to synthesize
  --output_dir OUTPUT_DIR
                        Output folder for synthesized wavs

Please note that the install.sh script pulling a pre-trained model is just a temporary solution and will be changed with future version.

Training

install.sh grabs pre-trained models from the LJ dataset (https://keithito.com/LJ-Speech-Dataset/), so you don't necessarily have to do this step.

To continue training on the LJ dataset, or start from scratch:

bash install.sh
cd tacotron
wget http://data.keithito.com/data/speech/LJSpeech-1.1.tar.bz2
bunzip LJSpeech-1.1.tar.bz2
tar xf LJSpeech-1.1.tar
python3 preprocess.py
python3 train.py
# you can stop after synthesis of the GTA mels, once it's in wavenet training
cd ..
bash preprocess.sh
bash train.sh

If you're happy with the Tacotron output before it finished by itself, you can also interrupt the training and do:

cat 1|0|0| > logs-Tacotron-2/state_log
python3 train.py 

Pre-trained models

About

Combination of the Tacotron-2 implementation by Rayhane-mamah with the WaveRNN-inspired method by fatchord

License:MIT License


Languages

Language:Python 98.5%Language:Shell 1.5%