llama

There are 178 repositories under llama topic.

ollama / ollama
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
deepseek gemma gemma3 gemma3n go golang gpt-oss llama llama2 llama3 llava llm llms mistral ollama phi4 qwen
Language:Go 155539
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
amd blackwell cuda deepseek deepseek-v3 gpt gpt-oss inference kimi llama llm llm-serving model-serving moe openai pytorch qwen qwen3 tpu transformer
Language:Python 62456
LLaMA-Factory
hiyouga / LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
agent ai deepseek fine-tuning gemma gpt instruction-tuning large-language-models llama llama3 llm lora moe nlp peft qlora quantization qwen rlhf transformers
Language:Python 62040
unsloth
unslothai / unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
agent deepseek deepseek-r1 fine-tuning gemma gemma3 gpt-oss llama llama3 llm llms mistral openai qwen qwen3 reinforcement-learning text-to-speech tts unsloth voice-cloning
Language:Python 48027
aider
Aider-AI / aider
aider is AI pair programming in your terminal
anthropic chatgpt claude-3 cli command-line gemini gpt-3 gpt-35-turbo gpt-4 gpt-4o llama openai sonnet
Language:Python 38261
LocalAI
mudler / LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference
ai api audio-generation decentralized distributed gemma image-generation libp2p llama llm mamba mcp mistral musicgen object-detection rerank rwkv stable-diffusion text-generation tts
Language:Go 37778
chatchat-space / Langchain-Chatchat
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
chatbot chatchat chatglm chatgpt embedding faiss fastchat gpt knowledge-base langchain langchain-chatglm llama llm milvus ollama qwen rag retrieval-augmented-generation streamlit xinference
Language:Python 36466
fishaudio / fish-speech
SOTA Open Source TTS
llama transformer tts valle vits vqgan vqvae
Language:Python 24005
haotian-liu / LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
chatbot chatgpt foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava multi-modality multimodal vision-language-model visual-language-learning
Language:Python 23912
HqWu-HITCS / Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
awesome-lists chatglm chinese llama llm nlp
21646
yamadashy / repomix
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.
ai anthropic artificial-intelligence chatbot chatgpt claude deepseek developer-tools gemini genai generative-ai gpt javascript language-model llama llm mcp nodejs openai typescript
Language:TypeScript 20089
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
blackwell cuda deepseek deepseek-r1 deepseek-v3 deepseek-v3-2 gpt-oss inference kimi llama llama3 llava llm llm-serving moe openai pytorch qwen3 transformer vlm
Language:Python 19970
Chinese-LLaMA-Alpaca
ymcui / Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
alpaca alpaca-2 large-language-models llama llama-2 llm lora nlp plm pre-trained-language-models quantization
Language:Python 18940
meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
ai finetuning langchain llama llama2 llm machine-learning python pytorch vllm
Language:Jupyter Notebook 18013
GaiZhenbiao / ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
chatbot chatgpt-api chatglm claude dalle3 ernie gemini gemma llama midjourney minimax moss ollama qwen spark stablelm inspurai
Language:Python 15434
LlamaFamily / Llama-Chinese
Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用
agent llama llama4 llm pretraining rl
Language:Python 14730
AstrBotDevs / AstrBot
✨ 一站式 LLM 聊天机器人平台及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify
agent ai chatbot chatgpt docker gemini gpt llama llm mcp openai python qq qqbot qqchannel telegram
Language:Python 13166
cocktailpeanut / dalai
The simplest way to run LLaMA on your local machine
ai llama llm
Language:CSS 13038
PaddleNLP
PaddlePaddle / PaddleNLP
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
nlp embedding bert ernie paddlenlp pretrained-models transformers information-extraction question-answering search-engine semantic-analysis sentiment-analysis neural-search uie document-intelligence compression llm distributed-training llama
Language:Python 12836
bentoml / OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna
Language:Python 11920
ludwig
ludwig-ai / ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
computer-vision data-centric data-science deep deep-learning deeplearning fine-tuning learning llama llama2 llm llm-training machine-learning machinelearning mistral ml natural-language natural-language-processing neural-network pytorch
Language:Python 11611
TheR1D / shell_gpt
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
chatgpt cheat-sheet cli commands gpt-3 gpt-4 linux llama llm ollama openai productivity python shell terminal
Language:Python 11516
getumbrel / llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
ai chatgpt code-llama codellama gpt gpt-4 gpt4all llama llama-2 llama-cpp llama2 llamacpp llm localai openai self-hosted
Language:TypeScript 10990
modelscope / ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).
deepseek-r1 embedding grpo internvl liger llama llama4 llm lora megatron moe multimodal open-r1 peft qwen3 qwen3-next qwen3-omni qwen3-vl reranker sft
Language:Python 10939
tensorzero
tensorzero / tensorzero
TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.
ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt large-language-models llama llm llmops llms machine-learning ml ml-engineering mlops openai python rust
Language:Rust 10519
dataelement / bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
agent ai chatbot enterprise finetune genai gpt langchian llama llm llmdevops llmops ocr openai orchestration python rag react sft workflow
Language:TypeScript 9887
bigscience-workshop / petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
bloom deep-learning distributed-systems language-models large-language-models machine-learning neural-networks pytorch volunteer-computing pipeline-parallelism tensor-parallelism guanaco llama chatbot gpt transformer nlp pretrained-models falcon mixtral
Language:Python 9830
langchain4j / langchain4j
LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks.
anthropic chatgpt chroma embeddings gemini gpt huggingface java langchain llama llm llms milvus ollama onnx openai openai-api pgvector pinecone vector-database
Language:Java 9529
LostRuins / koboldcpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
koboldcpp llamacpp llm koboldai llama ggml gguf gemma language-model mistral
Language:C++ 8854
xorbitsai / inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm
Language:Python 8704
oumi-ai / oumi
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
dpo evaluation fine-tuning gpt-oss gpt-oss-120b gpt-oss-20b inference llama llms sft slms vlms
Language:Python 8590
SJTU-IPADS / PowerInfer
High-speed Large Language Model Serving for Local Deployment
large-language-models llama llm llm-inference local-inference
Language:C++ 8378
reorproject / reor
Private & local AI personal knowledge management app for high entropy people.
ai lancedb llama llamacpp local-first markdown note-taking ollama pkm rag second-brain vector-database
Language:JavaScript 8364
LianjiaTech / BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
bloom chinese-nlp gpt-evaluation gpt-q instruct-finetune instruct-gpt instruction-set llama lora open-models
Language:HTML 8260
zilliztech / GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
chatbot chatgpt chatgpt-api llm milvus similarity-search vector-search aigc openai memcache gpt langchain autogpt redis babyagi llama-index llama dolly semantic-search
Language:Python 7817
Chinese-LLaMA-Alpaca-2
ymcui / Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
64k alpaca alpaca-2 alpaca2 flash-attention large-language-models llama llama-2 llama2 llm nlp rlhf yarn
Language:Python 7175

llama

ollama / ollama

vllm-project / vllm

hiyouga / LLaMA-Factory

unslothai / unsloth

Aider-AI / aider

mudler / LocalAI

chatchat-space / Langchain-Chatchat

fishaudio / fish-speech

haotian-liu / LLaVA

HqWu-HITCS / Awesome-Chinese-LLM

yamadashy / repomix

sgl-project / sglang

ymcui / Chinese-LLaMA-Alpaca

meta-llama / llama-cookbook

GaiZhenbiao / ChuanhuChatGPT

LlamaFamily / Llama-Chinese

AstrBotDevs / AstrBot

cocktailpeanut / dalai

PaddlePaddle / PaddleNLP

bentoml / OpenLLM

ludwig-ai / ludwig

TheR1D / shell_gpt

getumbrel / llama-gpt

modelscope / ms-swift

tensorzero / tensorzero

dataelement / bisheng

bigscience-workshop / petals

langchain4j / langchain4j

LostRuins / koboldcpp

xorbitsai / inference

oumi-ai / oumi

SJTU-IPADS / PowerInfer

reorproject / reor

LianjiaTech / BELLE

zilliztech / GPTCache

ymcui / Chinese-LLaMA-Alpaca-2