Dacheng Li's starred repositories

ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Language:PythonLicense:MITStargazers:415Issues:0Issues:0

rectified-flow-pytorch

Implementation of rectified flow and some of its followup research / improvements in Pytorch

Language:PythonLicense:MITStargazers:82Issues:0Issues:0

SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Language:PythonLicense:NOASSERTIONStargazers:536Issues:0Issues:0

flute

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

Language:CudaLicense:Apache-2.0Stargazers:20Issues:0Issues:0

Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Stargazers:313Issues:0Issues:0

big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2053Issues:0Issues:0

LivePortrait

Bring portraits to life!

Language:PythonLicense:MITStargazers:7278Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3860Issues:0Issues:0

retrieval-scaling

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Language:PythonStargazers:54Issues:0Issues:0

GRUtopia

GRUtopia: Dream General Robots in a City at Scale

Language:PythonLicense:MITStargazers:245Issues:0Issues:0

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1080Issues:0Issues:0

WildAvatar_Toolbox

[ArXiv 2024] WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation

Language:PythonLicense:NOASSERTIONStargazers:67Issues:0Issues:0
Stargazers:65Issues:0Issues:0

qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Language:PythonLicense:Apache-2.0Stargazers:350Issues:0Issues:0

VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

Language:PythonStargazers:110Issues:0Issues:0
Language:PythonLicense:MITStargazers:43Issues:0Issues:0

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonLicense:MITStargazers:1357Issues:0Issues:0

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonLicense:Apache-2.0Stargazers:2558Issues:0Issues:0

FreeInit

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Language:PythonLicense:MITStargazers:454Issues:0Issues:0

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonLicense:AGPL-3.0Stargazers:1488Issues:0Issues:0

HPSv2

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:329Issues:0Issues:0

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:12879Issues:0Issues:0

RouteLLM

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Language:PythonLicense:Apache-2.0Stargazers:2052Issues:0Issues:0

diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Language:PythonLicense:MITStargazers:327Issues:0Issues:0
Language:PythonStargazers:113Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:46Issues:0Issues:0

q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

Language:PythonLicense:MITStargazers:301Issues:0Issues:0

delta-iris

Efficient World Models with Context-Aware Tokenization. ICML 2024

Language:PythonLicense:GPL-3.0Stargazers:56Issues:0Issues:0

VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Language:PythonLicense:Apache-2.0Stargazers:403Issues:0Issues:0

PipeFusion

A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters

Language:PythonLicense:Apache-2.0Stargazers:123Issues:0Issues:0