Dinghow Yang's starred repositories
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
ThunderKittens
Tile primitives for speedy kernels
splatter-image
Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024
chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Agent-FLAN
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
LLMDebugger
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
LM-Infinite
Implementation of paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
PointMetaBase
This is a PyTorch implementation of PointMetaBase proposed by our paper "Meta Architecure for Point Cloud Analysis"
Grounded_3D-LLM
Code&Data for Grounded 3D-LLM with Referent Tokens
llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
Backdoor_DPR
Code for "Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation"