Shaojin Ding's repositories
Adversarial-Many-to-Many-VC
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna
GroupLatentEmbedding
Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Interspeech 2019]
fac-via-ppg
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
darts.pytorch1.1
Implementation with latest PyTorch (v1.1) for multi-gpu DARTS https://arxiv.org/abs/1806.09055
DeepSpeaker-pytorch
Speaker embedding(verification and recognition) using Pytorch
Detectron.pytorch
A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.
dragonmapper
Identification and conversion functions for Chinese text processing
faster-rcnn.pytorch
A faster pytorch implementation of faster r-cnn
Listen-Attend-and-Spell-Pytorch
Listen Attend and Spell (LAS) implement in pytorch
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
pytorch-vq-vae
PyTorch implementation of VQ-VAE by Aäron van den Oord et al.
rasta_py
RASTA-PLP and MFCC tool based rasta-mat
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
shaojinding.github.io
shaojinding.github.io
speaker-id
This repository contains audio samples and supplementary materials accompanying publications related to the speaker-id team at Google.
Speech_Recognition_with_Tensorflow
Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
VAE-GMVAE
This repository contains the implementation of the VAE and Gaussian Mixture VAE using TensorFlow and several network architectures
VQ-VAE-Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
wavenet_vocoder
WaveNet vocoder