Rongjie Yi's starred repositories
SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Awesome-Diffusion-Model-Based-Image-Editing-Methods
Diffusion Model-Based Image Editing: A Survey (arXiv)
smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
stable-diffusion.cpp
Stable Diffusion in pure C/C++
Efficient_Foundation_Model_Survey
Survey Paper List - Efficient LLM and Foundation Models
Personal_LLM_Agents_Survey
Paper list for Personal LLM Agents
Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs