DachengLi1

followers

following

stars

dacheng-li.info

Dacheng Li's starred repositories

ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Language:PythonMIT41500

rectified-flow-pytorch

Implementation of rectified flow and some of its followup research / improvements in Pytorch

Language:PythonMIT8200

SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Language:PythonNOASSERTION53600

flute

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

Language:CudaApache-2.02000

Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Language:Jupyter NotebookApache-2.0205300

LivePortrait

Bring portraits to life!

Language:PythonMIT727800

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT386000

retrieval-scaling

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Language:Python5400

GRUtopia

GRUtopia: Dream General Robots in a City at Scale

Language:PythonMIT24500

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.0108000

WildAvatar_Toolbox

[ArXiv 2024] WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation

Language:PythonNOASSERTION6700

VEnhancer

6500

qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Language:PythonApache-2.035000

VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

Language:Python11000

MPS

Language:PythonMIT4300

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonMIT135700

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonApache-2.0255800

FreeInit

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Language:PythonMIT45400

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonAGPL-3.0148800

HPSv2

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Language:Jupyter NotebookApache-2.032900

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.01287900

RouteLLM

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Language:PythonApache-2.0205200

diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Language:PythonMIT32700

OpenVid-1M

Language:Python11300

IRASim

Language:PythonApache-2.04600

q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

Language:PythonMIT30100

delta-iris

Efficient World Models with Context-Aware Tokenization. ICML 2024

Language:PythonGPL-3.05600

VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Language:PythonApache-2.040300

PipeFusion

A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters

Language:PythonApache-2.012300