zzxxxl's starred repositories
MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
refusal_direction
Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
llm-applications
A comprehensive guide to building RAG-based LLM applications for production.
TinyChatEngine
TinyChatEngine: On-Device LLM Inference Library
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding