zzxxxl's starred repositories
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
llama_index
LlamaIndex is a data framework for your LLM applications
Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference
MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
evolutionary-model-merge
Official repository of Evolutionary Optimization of Model Merging Recipes
CUDA-Learn-Notes
🎉 CUDA Learn Notes with PyTorch: fp32、fp16/bf16、fp8/int8、flash_attn、sgemm、sgemv、warp/block reduce、dot prod、elementwise、softmax、layernorm、rmsnorm、hist etc.
gpt-assistant-android
免费的ChatGPT API的安卓语音助手,可用音量键唤起并进行语音交流,支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.
flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
refusal_direction
Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".
LLM-Extrapolation
Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"
lottery-ticket-adaptation
Lottery Ticket Adaptation
Evaluation-Multimodal-LLMs-Survey
A Survey on Benchmarks of Multimodal Large Language Models
RepresentationSurgery
Representation Surgery for Multi-Task Model Merging. ICML, 2024.