BridgetteSong's repositories
ExpressiveTacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
BunchedLPCnet
This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.
efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
multiband-hifigan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Parallel-Tacotron2
Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Robust_Fine_Grained_Prosody_Control
Pytorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis (Unofficial)
SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
TTS_TFLite
This repository is a collection of TTS Models in TFLite
UnivNet-pytorch
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
VQMIVC
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021