Gary Wang's starred repositories
MelNet-SpeechGeneration
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Models in PyTorch
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
implicit_alignment
Code for ICML2020 "Implicit Class-Conditioned Domain Alignment for Unsupervised Domain Adaptation"
GAN-Slimming
[ECCV 2020] "All-in-One GAN Compression by Unified Optimization" by Haotao Wang, Shupeng Gui, Haichuan Yang, Ji Liu, and Zhangyang Wang
voxceleb_trainer
In defence of metric learning for speaker recognition
WavAugment
A library for speech data augmentation in time-domain
gard-adversarial-speaker-id
Adversarial attack and defense strategies for deep speaker recognition systems
SC-WaveRNN
Official PyTorch implementation of Speaker Conditional WaveRNN
TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
hopfield-layers
Hopfield Networks is All You Need
contrastive-unpaired-translation
Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan (ECCV 2020, in PyTorch)
TransCoder
Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf