Dacheng Li's starred repositories
ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
rectified-flow-pytorch
Implementation of rectified flow and some of its followup research / improvements in Pytorch
SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
LivePortrait
Bring portraits to life!
retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
WildAvatar_Toolbox
[ArXiv 2024] WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation
PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
q-diffusion
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
delta-iris
Efficient World Models with Context-Aware Tokenization. ICML 2024
PipeFusion
A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters