Chaojian Li's starred repositories
starter-workflows
Accelerating new GitHub Actions workflows
arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
cs249r_book
Collaborative book Machine Learning Systems
TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, sparsity, distillation, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
tinymembench
Simple benchmark for memory throughput and latency
Grendel-GS
Ongoing research training gaussian splatting at scale by distributed system
nerfbaselines
Reproducible evaluation of NeRF methods
ShiftAddLLM
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
LogarithmicPosit
[DAC'24] Official Implementation of the Logarithmic Posit (LP) Number System