Yunlin Chen (linzai1992)

linzai1992

Geek Repo

Company:@Microsoft

Location:Suzhou

Github PK Tool:Github PK Tool

Yunlin Chen's starred repositories

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonLicense:MITStargazers:1620Issues:0Issues:0

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonLicense:Apache-2.0Stargazers:2772Issues:0Issues:0

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonLicense:MITStargazers:865Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:1542Issues:0Issues:0

css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Language:HTMLLicense:Apache-2.0Stargazers:456Issues:0Issues:0

Awesome-Image-Harmonization

A curated list of papers, code and resources pertaining to image harmonization.

Stargazers:405Issues:0Issues:0

Face2FaceRHO

The Official PyTorch Implementation for Face2Face^ρ (ECCV2022)

Language:PythonLicense:BSD-3-ClauseStargazers:212Issues:0Issues:0

chrome-music-lab

A collection of experiments for exploring how music works, all built with the Web Audio API.

Language:JavaScriptLicense:Apache-2.0Stargazers:2116Issues:0Issues:0

botsim

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:113Issues:0Issues:0

Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Language:PythonLicense:MITStargazers:2152Issues:0Issues:0

TTS-Portuguese-Corpus

Open Source Text-To-Speech Portuguese Dataset

License:CC-BY-4.0Stargazers:148Issues:0Issues:0

FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

Language:C++License:Apache-2.0Stargazers:2857Issues:0Issues:0

MB-iSTFT-VITS

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Language:PythonLicense:Apache-2.0Stargazers:404Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:229Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66946Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2336Issues:0Issues:0

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter NotebookStargazers:545Issues:0Issues:0

GradTTS

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

Language:PythonLicense:MITStargazers:174Issues:0Issues:0

AVFR-Gan

Audio-Visual Generative Adversarial Network for Face Reenactment

Stargazers:156Issues:0Issues:0

AudioDVP

AudioDVP:Photorealistic Audio-driven Video Portraits

Language:PythonStargazers:295Issues:0Issues:0

NeuralVoicePuppetryMMD

This github contains the network architectures of NeuralVoicePuppetry.

License:NOASSERTIONStargazers:76Issues:0Issues:0

NeuralVoicePuppetry

This github contains the network architectures of NeuralVoicePuppetry.

License:NOASSERTIONStargazers:172Issues:0Issues:0

CVPR2022-DaGAN

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Language:PythonLicense:NOASSERTIONStargazers:956Issues:0Issues:0

SSP-NeRF

[ECCV 2022 Oral] Code for "Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation"

Language:PythonStargazers:229Issues:0Issues:0

LIA

[ICLR 22] Latent Image Animator: Learning to Animate Images via Latent Space Navigation

Language:PythonLicense:NOASSERTIONStargazers:581Issues:0Issues:0

Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".

Language:PythonStargazers:415Issues:0Issues:0

DFRF

[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".

Language:PythonLicense:MITStargazers:334Issues:0Issues:0

VToonify

[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3516Issues:0Issues:0

Expression-Net

Deep 3DMM facial expression parameter extraction

Language:PythonStargazers:511Issues:0Issues:0