YuKun Zhou's starred repositories
InfoGrowth
Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
consistency_models
Official repo for consistency models.
ControlNet
Let us control diffusion models!
magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
stable-diffusion
A latent text-to-image diffusion model
dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
NeurIPS_2022-Generative_Hyper_Representations
Code Repository for the NeurIPS 2022 paper: "Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights".