Gary Wang's repositories
WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
tacotron2-vae
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
learn2learn
PyTorch Meta-learning Framework for Researchers
MelGAN-Pytorch
A Pytorch Implementation of MelGAN
tacotron2-gst
Tacotron2 with Global Style Tokens
Autoregressive-Predictive-Coding
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
GST-Tacotron-1
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
hugo-quick-start
Hugo Quick Start on Render
librispeech-alignments
Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset
project-CURRENNT-scripts
This repository contains the scripts to use CURRENNT
raw_voice_cleanup
Examples of cleaning up raw voices
self-attention-tacotron
An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960
UniversalVocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"