thepowerfuldeez / tacotron2

My implementation of tacotron2 model

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Tacotron2 implementation

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions (https://arxiv.org/abs/1712.05884)
  • Your dataset should contain txt file with lines audio_id|text (LJSpeech format)
  • Mel-spectrograms are created before training and are located in mels/ folder

audio_id – only name of spectrogram npy file (without .npy extension) that is located in mels/ folder w.r.t. to metadata file.

USAGE:

  • pip install -r requirements.txt (you will need python 3.8 + pytorch 1.3 + CUDA10.1)
  • Run python train.py

About

My implementation of tacotron2 model


Languages

Language:Python 100.0%