Yunlin Chen (linzai1992)

linzai1992

Geek Repo

Company:@Microsoft

Location:Suzhou

Github PK Tool:Github PK Tool

Yunlin Chen's starred repositories

Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).

Stargazers:386Issues:0Issues:0

fast-vid2vid

The code for ECCV22 paper "Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis"

Language:PythonStargazers:156Issues:0Issues:0

Mead

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

Language:PythonLicense:MITStargazers:227Issues:0Issues:0

Tune-A-Video

Unofficial implementation of Tune-A-Video

Language:PythonStargazers:188Issues:0Issues:0

python-ffmpeg-video-streaming

📼 Package media content for online streaming(DASH and HLS) using FFmpeg

Language:PythonLicense:MITStargazers:827Issues:0Issues:0

ffmpeg-python

Python bindings for FFmpeg - with complex filtering support

Language:PythonLicense:Apache-2.0Stargazers:9662Issues:0Issues:0

ffmpeg-rtmp

Ffmpeg RTMP example

Language:CStargazers:12Issues:0Issues:0

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonLicense:MITStargazers:2912Issues:0Issues:0

diffused-heads

Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Language:PythonLicense:NOASSERTIONStargazers:447Issues:0Issues:0

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonLicense:MITStargazers:1592Issues:0Issues:0

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonLicense:Apache-2.0Stargazers:2740Issues:0Issues:0

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonLicense:MITStargazers:858Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:1519Issues:0Issues:0

css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Language:HTMLLicense:Apache-2.0Stargazers:456Issues:0Issues:0

Awesome-Image-Harmonization

A curated list of papers, code and resources pertaining to image harmonization.

Stargazers:403Issues:0Issues:0

Face2FaceRHO

The Official PyTorch Implementation for Face2Face^ρ (ECCV2022)

Language:PythonLicense:BSD-3-ClauseStargazers:212Issues:0Issues:0

chrome-music-lab

A collection of experiments for exploring how music works, all built with the Web Audio API.

Language:JavaScriptLicense:Apache-2.0Stargazers:2115Issues:0Issues:0

botsim

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:113Issues:0Issues:0

Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Language:PythonLicense:MITStargazers:2142Issues:0Issues:0

TTS-Portuguese-Corpus

Open Source Text-To-Speech Portuguese Dataset

License:CC-BY-4.0Stargazers:147Issues:0Issues:0

FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

Language:C++License:Apache-2.0Stargazers:2817Issues:0Issues:0

MB-iSTFT-VITS

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Language:PythonLicense:Apache-2.0Stargazers:403Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:229Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66590Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2324Issues:0Issues:0

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter NotebookStargazers:544Issues:0Issues:0

GradTTS

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

Language:PythonLicense:MITStargazers:168Issues:0Issues:0

AVFR-Gan

Audio-Visual Generative Adversarial Network for Face Reenactment

Stargazers:155Issues:0Issues:0

AudioDVP

AudioDVP:Photorealistic Audio-driven Video Portraits

Language:PythonStargazers:295Issues:0Issues:0

NeuralVoicePuppetryMMD

This github contains the network architectures of NeuralVoicePuppetry.

License:NOASSERTIONStargazers:76Issues:0Issues:0