Rahul2991 / WaveVAE

A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

WaveVAE

work in progress

Note that my implementation isn't stable yet.

A Pytorch Implementation of WaveVAE (Mel Spectrogram --> Waveform)

part of "Parallel Neural Text-to-Speech"

Requirements

  1. Install Required Packages
pip install -r requirements.txt
  1. For Pytorch https://pytorch.org/get-started/locally/

Note : Tested on Python 3.9.16, Pytorch 2.2.0 (CUDA 11.8) on Win 10

Examples

Step 1. Download Dataset

Step 2. Preprocessing (Preparing Mel Spectrogram)

python preprocessing.py --in_dir ljspeech --out_dir DATASETS/ljspeech

Step 3. Train Model

python train.py --model_name wavevae_1 --batch_size 4 --num_gpu 2

Step 4. Synthesize

--load_step CHECKPOINT : the # of the model's global training step (also depicted in the trained weight file)

python synthesize.py --model_name wavevae_1 --load_step 10000 --num_samples 5

References

About

A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")

License:MIT License


Languages

Language:Python 100.0%