Yunlin Chen's repositories
AnimateAnyone-unofficial
Unofficial Implementation of Animate Anyone
VPEval
VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
audiowmark
Audio Watermarking
CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
NeuralVoicePuppetryMMD
This github contains the network architectures of NeuralVoicePuppetry.
WaveGrad
Implementation of Google Brain's WaveGrad vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
PPSpeech
PPSpeech: Phrase based Parallel End-to-End TTS System
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
chrome-music-lab
A collection of experiments for exploring how music works, all built with the Web Audio API.
jukebox
Code for the paper "Jukebox: A Generative Model for Music"
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
multiband_melgan
An unofficial implementation of https://arxiv.org/abs/2005.05106
DurIAN
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
crank
Non-parallel Voice Conversion
google-research
Google Research
pitch-net
Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).
spleeter
Deezer source separation library including pretrained models.
WGANSing
Multi-voice singing voice synthesis
face-nn
游戏捏脸,基于神经风格迁移框架生成逼真人脸
melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
torch_npss
pytorch implementation of Neural Parametric Singing Synthesizer 歌声合成
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
vae_tacotron2
VAE Tacotron 2, an alternative of GST Tacotron