Tacotron-Pytorch

An efficient PyTorch implementation of Tacotron.

Paper: Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Requirements

pip3 install -r requirements.txt

File structure

Hyperparameters.py --- contain all hyperparameters
Network.py --- encoder\decoder
Modules.py --- some modules for tacotron
Loss.py --- calculate loss
Data.py --- load dataset
utils.py --- some util function for loading and saving data
Synthesis.py --- generate wav file

How to train

Download dataset --- LJSpeech or thchs30
Adjust hyperparameters in Hyperparameters.py
make a directory named log in the parent of parent directory of Tacotron code

--- log
|    |
|    --- log[log_number]
|
--- code
     |
     --- Tacotron
             |
             --- train.py
             |
             --- Network.py
             |
           ......

run train.py

python3 train.py [log_number] [dataset_size] [start_epoch]

[log_number]: the log directory number
[dataset_size]: int or all
[start_epoch]: which epoch start to train (0 if start from scratch )

for example:
python3 train.py 0 all 0

How to generate wav

run Synthesis.py

python3 Synthesis.py [log_number] [epoch]

[log_number]: the log directory number
[epoch]: model of which epoch to load

About

An implementation of Tacotron with Pytorch0.4

pytorch tacotron tts

MIT License

Languages

Language:Python 100.0%