Yunlin Chen's repositories
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
crank
Non-parallel Voice Conversion
CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
DurIAN
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
face-nn
游戏捏脸,基于神经风格迁移框架生成逼真人脸
google-research
Google Research
jukebox
Code for the paper "Jukebox: A Generative Model for Music"
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)
multiband_melgan
An unofficial implementation of https://arxiv.org/abs/2005.05106
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Neural-Voice-Cloning-With-Few-Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
PiENet
Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals
pitch-net
Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).
PPSpeech
PPSpeech: Phrase based Parallel End-to-End TTS System
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
spleeter
Deezer source separation library including pretrained models.
torch_npss
pytorch implementation of Neural Parametric Singing Synthesizer 歌声合成
vae_tacotron2
VAE Tacotron 2, an alternative of GST Tacotron
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
Wav2Pix
Speech-conditioned face generation using Generative Adversarial Networks
WaveGrad
Implementation of Google Brain's WaveGrad vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
WaveRNN
Pytorch implementation of Deepmind's WaveRNN model
WGANSing
Multi-voice singing voice synthesis