Yunlin Chen (linzai1992)

linzai1992

Geek Repo

Company:@Microsoft

Location:Suzhou

Github PK Tool:Github PK Tool

Yunlin Chen's starred repositories

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonLicense:MITStargazers:1022Issues:0Issues:0

syncnet_python

Out of time: automated lip sync in the wild

Language:PythonLicense:MITStargazers:612Issues:0Issues:0

VideoMamba

VideoMamba: State Space Model for Efficient Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:632Issues:0Issues:0

V3D

V3D: Video Diffusion Models are Effective 3D Generators

Language:PythonStargazers:410Issues:0Issues:0

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

Language:PythonStargazers:1836Issues:0Issues:0

speech-datasets-collection

a curated list of speech datasets (110+ datasets, 75+ easy to download)

License:Apache-2.0Stargazers:66Issues:0Issues:0

ZMM-TTS

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

Language:CLicense:BSD-3-ClauseStargazers:89Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:17314Issues:0Issues:0

SpeechMOS

Easy-to-Use Speech MOS predictors

Language:PythonLicense:MITStargazers:166Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:2721Issues:0Issues:0

MoVQGAN

MoVQGAN - model for the image encoding and reconstruction

Language:Jupyter NotebookStargazers:104Issues:0Issues:0

TATS

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Language:PythonLicense:MITStargazers:246Issues:0Issues:0

MeBT

official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers (CVPR 2023)

Language:PythonStargazers:28Issues:0Issues:0

vqvae-vqgan-pytorch-lightning

VQ-VAE/GAN implementation in pytorch-lightning

Language:PythonLicense:MITStargazers:34Issues:0Issues:0

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookLicense:MITStargazers:5466Issues:0Issues:0

all-in-one

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training

Language:PythonStargazers:274Issues:0Issues:0

MiniSora-DiT

minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora

Language:PythonLicense:Apache-2.0Stargazers:31Issues:0Issues:0

StableVITON

[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On

Language:PythonStargazers:771Issues:0Issues:0

maskgit

Official Jax Implementation of MaskGIT

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:387Issues:0Issues:0

LOVECon

Official implementation for "LOVECon: Text-driven Training-free Long Video Editing with ControlNet"

Language:PythonLicense:MITStargazers:35Issues:0Issues:0
Language:PythonStargazers:60Issues:0Issues:0

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonLicense:NOASSERTIONStargazers:4437Issues:0Issues:0

CVTHead

[WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"

Language:PythonStargazers:68Issues:0Issues:0
Language:PythonStargazers:5525Issues:0Issues:0

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:10842Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:Apache-2.0Stargazers:10657Issues:0Issues:0

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1064Issues:0Issues:0

DynamiCrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:1883Issues:0Issues:0

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:452Issues:0Issues:0

SoraReview

The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".

Stargazers:465Issues:0Issues:0