Yunlin Chen's starred repositories
Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
fast-vid2vid
The code for ECCV22 paper "Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis"
Tune-A-Video
Unofficial implementation of Tune-A-Video
python-ffmpeg-video-streaming
📼 Package media content for online streaming(DASH and HLS) using FFmpeg
ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
ffmpeg-rtmp
Ffmpeg RTMP example
diffused-heads
Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Deep3DFaceRecon_pytorch
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
frame-interpolation
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
Awesome-Image-Harmonization
A curated list of papers, code and resources pertaining to image harmonization.
Face2FaceRHO
The Official PyTorch Implementation for Face2Face^ρ (ECCV2022)
chrome-music-lab
A collection of experiments for exploring how music works, all built with the Web Audio API.
Deep3DFaceReconstruction
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)
TTS-Portuguese-Corpus
Open Source Text-To-Speech Portuguese Dataset
FastDeploy
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.
MB-iSTFT-VITS
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
stable-diffusion
A latent text-to-image diffusion model
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
NeuralVoicePuppetryMMD
This github contains the network architectures of NeuralVoicePuppetry.