G-Wang

followers

following

stars

Google

New York

Gary Wang's repositories

WaveRNN-Pytorch

Fatcord's Alternative WaveRNN (Faster training)

Language:PythonMIT126 12 12

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION2 10

tacotron2-vae

Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"

Language:Jupyter NotebookBSD-3-Clause2 30

forexfun

Language:Python1 20

learn2learn

PyTorch Meta-learning Framework for Researchers

Language:PythonMIT1 10

melgan-neurips

Language:PythonMIT1 10

MelGAN-Pytorch

A Pytorch Implementation of MelGAN

Language:Jupyter Notebook1 10

tacotron2-gst

Tacotron2 with Global Style Tokens

Language:Jupyter NotebookBSD-3-Clause1 20

Autoregressive-Predictive-Coding

Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning

Language:Python010

awr

Implementation of advantage-weighted regression.

Language:PythonMIT010

CMPT_419_TTS_Report

020

ctrl

Conditional Transformer Language Model for Controllable Generation

Language:PythonBSD-3-Clause010

flite

A small fast portable speech synthesis system

Language:CNOASSERTION020

flowseq

Generative Flow based Sequence-to-Sequence Toolkit written in Python.

Language:PythonApache-2.0010

gentle

gentle forced aligner

Language:PythonMIT010

GST-Tacotron-1

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Language:PythonMIT020

hugo-quick-start

Hugo Quick Start on Render

020

librispeech-alignments

Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset

Language:Python010

lingvo

Lingvo

Language:PythonApache-2.0020

melgan

Unofficial PyTorch implementation of MelGAN vocoder (WIP, audio sample at Issue #3)

Language:PythonBSD-3-Clause010

moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Language:PythonNOASSERTION010

planet

Deep Planning Network: Control from pixels by latent planning with learned dynamics

Language:PythonApache-2.0020

project-CURRENNT-scripts

This repository contains the scripts to use CURRENNT

Language:PythonBSD-3-Clause020

raw_voice_cleanup

Examples of cleaning up raw voices

Language:C++BSD-3-Clause020

rlpyt

Reinforcement Learning in PyTorch

Language:PythonMIT010

self-attention-tacotron

An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960

Language:PythonBSD-3-Clause030

TTS

Deep learning for Text to Speech

Language:Jupyter NotebookMPL-2.0020

UniversalVocoding

A PyTorch implementation of "Robust Universal Neural Vocoding"

Language:PythonMIT010

vae_tacotron

Language:PythonMIT020

WaveRNN-1

A WaveRNN implementation

Language:PythonMIT020