BridgetteSong

BridgetteSong's repositories

ExpressiveTacotron

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

Language:Python74 4 2

BunchedLPCnet

This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.

Language:CBSD-3-Clause14 3 3

Tacotron2

Language:Python13 3 1

ddsp

DDSP: Differentiable Digital Signal Processing

Language:PythonApache-2.01 10

Attentions-in-Tacotron

Language:Python010

DL-Art-School

DLAS - A configuration-driven trainer for generative models

Language:PythonApache-2.0000

efficient_tts

Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

Language:PythonMIT010

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonMIT000

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Language:PythonApache-2.0010

multiband-hifigan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT010

Parallel-Tacotron2

Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language:PythonMIT010

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookMIT010

Robust_Fine_Grained_Prosody_Control

Pytorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis (Unofficial)

Language:PythonBSD-3-Clause010

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Language:PythonMIT010

STYLER

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech

Language:PythonMIT010

StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Language:PythonMIT010

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Language:PythonApache-2.0010

TTS_TFLite

This repository is a collection of TTS Models in TFLite

Language:Jupyter NotebookApache-2.0010

UnivNet-pytorch

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

MIT000

VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Language:PythonMIT010

VQMIVC

Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021

MIT000