linzai1992

followers

following

stars

@Microsoft

Suzhou

Yunlin Chen's starred repositories

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonMIT162000

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonApache-2.0277200

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonMIT86500

havenask

Language:C++Apache-2.0154200

css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Language:HTMLApache-2.045600

Awesome-Image-Harmonization

A curated list of papers, code and resources pertaining to image harmonization.

Face2FaceRHO

The Official PyTorch Implementation for Face2Face^ρ (ECCV2022)

Language:PythonBSD-3-Clause21200

chrome-music-lab

A collection of experiments for exploring how music works, all built with the Web Audio API.

Language:JavaScriptApache-2.0211600

botsim

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Language:Jupyter NotebookBSD-3-Clause11300

Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Language:PythonMIT215200

TTS-Portuguese-Corpus

Open Source Text-To-Speech Portuguese Dataset

CC-BY-4.014800

FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

Language:C++Apache-2.0285700

MB-iSTFT-VITS

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Language:PythonApache-2.040400

perceiver-ar

Language:PythonApache-2.022900

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION6694600

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT233600

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter Notebook54500

GradTTS

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

Language:PythonMIT17400

AVFR-Gan

Audio-Visual Generative Adversarial Network for Face Reenactment

AudioDVP

AudioDVP:Photorealistic Audio-driven Video Portraits

Language:Python29500

NeuralVoicePuppetryMMD

This github contains the network architectures of NeuralVoicePuppetry.

NOASSERTION7600

NeuralVoicePuppetry

This github contains the network architectures of NeuralVoicePuppetry.

NOASSERTION17200

CVPR2022-DaGAN

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Language:PythonNOASSERTION95600

awesome_talking_face_generation

SSP-NeRF

[ECCV 2022 Oral] Code for "Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation"

Language:Python22900

LIA

[ICLR 22] Latent Image Animator: Learning to Animate Images via Latent Space Navigation

Language:PythonNOASSERTION58100

Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".

Language:Python41500

DFRF

[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".

Language:PythonMIT33400

VToonify

[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

Language:Jupyter NotebookNOASSERTION351600

Expression-Net

Deep 3DMM facial expression parameter extraction

Language:Python51100