Yunlin Chen (linzai1992)

linzai1992

Geek Repo

Company:@Microsoft

Location:Suzhou

Github PK Tool:Github PK Tool

Yunlin Chen's starred repositories

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:8867Issues:72Issues:82
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2840Issues:24Issues:71

vq-vae-2-pytorch

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:1511Issues:20Issues:77

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Language:PythonLicense:MITStargazers:1306Issues:27Issues:41

fish-speech

Brand new TTS solution

Language:PythonLicense:BSD-3-ClauseStargazers:839Issues:26Issues:80

InternVL

[CVPR 2024 Oral] InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B

Language:PythonLicense:MITStargazers:780Issues:10Issues:65

Make-A-Character

Official repo for Make-A-Character: High Quality Text-to-3D Character Generation within Minutes

En3D

Official implementation of "En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data"

Language:PythonLicense:Apache-2.0Stargazers:391Issues:40Issues:6

Awesome-Talking-Head-Synthesis

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

SVD_Xtend

Stable Video Diffusion Training Code and Extensions.

ZeroSpeech

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Aurora

🐳 Aurora is a [Chinese Version] MoE model. Aurora is a further work based on Mixtral-8x7B, which activates the chat capability of the model's Chinese open domain.

Language:PythonLicense:Apache-2.0Stargazers:254Issues:8Issues:16

megatts2

Unoffical implementation of Megatts2

Language:PythonLicense:MITStargazers:199Issues:19Issues:11

havatar

[TOG 2023] HAvatar: High-fidelity Head Avatar via Facial Model ConditionedNeural Radiance Field

Bridge-TTS

Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).

PitchExtractor

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Language:PythonLicense:MITStargazers:103Issues:5Issues:7

LatentAvatar

A PyTorch implementation of "LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar"

Language:PythonLicense:MITStargazers:96Issues:19Issues:10

AvatarMAV

A PyTorch implementation of "AvatarMAV: Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels"

Language:PythonLicense:MITStargazers:83Issues:15Issues:9

NVEdit

Official PyTorch implementation for the paper "Neural Video Fields Editing"

SyncMVD

Official PyTorch & Diffusers implementation of "Text-Guided Texturing by Synchronized Multi-View Diffusion"

Language:PythonLicense:MITStargazers:70Issues:4Issues:0

BFRffusion

Official codes of Towards Real-World Blind Face Restoration with Generative Diffusion Prior

Language:PythonLicense:MITStargazers:54Issues:0Issues:0

DeepDance

Code repo of the paper "DeepDance: Music-to-Dance Motion Choreography with Adversarial Learning"

Language:PythonLicense:GPL-3.0Stargazers:52Issues:3Issues:3

xiaoicesing2

The source code for the paper XiaoiceSing2 (interspeech2023)

Language:PythonLicense:BSD-3-ClauseStargazers:41Issues:6Issues:0

Pro-Motion

Plan, Posture and Go: Towards Open-World Text-to-Motion Generation

Stargazers:32Issues:0Issues:0

music2dance

Audio-driven synthesis of choreographic movements using GANs

Wikiformer

Code for AAAI 2024 paper Wikiformer

Language:PythonLicense:Apache-2.0Stargazers:15Issues:0Issues:0