llamacpp

There are 20 repositories under llamacpp topic.

janhq / jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
electron gpt llama2 llamacpp localai self-hosted
Language:TypeScript 18367
anything-llm
Mintplex-Labs / anything-llm
The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
rag lmstudio localai vector-database ollama local-llm chromadb desktop-app llama3 llamacpp llm llm-application llm-webui webui ai-agents crewai crewaiui
Language:JavaScript 14149
getumbrel / llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
ai chatgpt gpt gpt-4 gpt4all llama llama-2 llama-cpp llama2 llamacpp llm localai openai self-hosted code-llama codellama
Language:TypeScript 10381
reorproject / reor
Private & local AI personal knowledge management app.
ai lancedb llama llamacpp local-first markdown note-taking pkm vector-database rag second-brain ollama
Language:TypeScript 5983
serge
serge-chat / serge
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
llama alpaca docker fastapi llamacpp python web svelte sveltekit tailwindcss nginx
Language:Svelte 5565
khoj
khoj-ai / khoj
Your AI second brain. A copilot to get answers to your questions, whether they be from your own notes or from the internet. Use powerful, online (e.g gpt4) or private, local (e.g mistral) LLMs. Self-host locally or use our web app. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.
semantic-search org-mode emacs markdown obsidian-md chat chatgpt ai llm productivity agent self-hosted rag whatsapp-ai pwa offline-llm llamacpp obsidian llama3
Language:Python 4907
LostRuins / koboldcpp
A simple one-file way to run various GGML and GGUF models with KoboldAI's UI
koboldcpp llamacpp llm
Language:C++ 3953
xorbitsai / inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
ggml pytorch chatglm deployment flan-t5 llm wizardlm artificial-intelligence machine-learning whisper inference openai-api mistral gemma llama llamacpp vllm qwen chatglm3 llama3
Language:Python 2799
Josh-XT / AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
ai automation artificial chromadb intelligence llama llamacpp openai python agi llm llmops agent-llm agixt
Language:Python 2473
SciSharp / LLamaSharp
A C#/.NET library to run LLM models (🦙LLaMA/LLaVA) on your local device efficiently.
chatbot gpt llama llamacpp llm semantic-kernel llava multi-modal llama2 llama3 llama-cpp
Language:C# 2015
rjmacarthy / twinny
The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but completely free and 100% private.
artificial-intelligence code-chat code-completion code-generation codellama copilot free llama2 llamacpp ollama ollama-api ollama-chat private vscode-extension
Language:TypeScript 1796
janhq / cortex
Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM). Powers 👋 Jan
gguf llama2 llamacpp tensorrt-llm accelerated ai inference-engine openai-api stable-diffusion cuda llama llm llms
Language:C++ 1635
LlamaChat
alexrozanski / LlamaChat
Chat with your favourite LLaMA models in a native macOS app
ai llama llamacpp machine-learning macos swift swiftui
Language:Swift 1415
floneum / floneum
A toolkit for controllable, private AI on consumer hardware in rust
ai llm rust llamacpp kalosm candle llama mistral floneum-v3
Language:Rust 985
vercel / modelfusion
The TypeScript library for building AI applications.
ai artificial-intelligence chatbot claude dall-e embedding gpt-3 huggingface javascript js llamacpp llm mistral multi-modal ollama openai stable-diffusion ts typescript whisper
Language:TypeScript 959
Dicklesworthstone / swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
embedding-similarity embedding-vectors embeddings llama2 llamacpp semantic-search
Language:Python 878
Atome-FE / llama-node
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
ai gpt large-language-models llama llama-rs llm napi napi-rs nodejs llama-node embeddings llamacpp langchain rwkv
Language:Rust 849
maid
Mobile-Artificial-Intelligence / maid
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
android chatbot chatgpt facebook flutter gguf large-language-models llama llama-cpp llama2 llamacpp mistral openai openorca local-ai ollama mobile-ai android-ai mobile-artificial-intelligence ffigen
Language:Dart 818
Dot
alexpinel / Dot
Standalone app for easy RAG with local LLM
ai app embeddings gpt llm local mistral rag standalone standalone-app desktop-app document-chat faiss langchain localai mistral-7b privategpt llamacpp self-hosted
Language:JavaScript 763
RahulSChand / gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
ggml huggingface llm quantization gpu language-model pytorch llama llama2 llamacpp
Language:JavaScript 649
xNul / code-llama-for-vscode
Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.
code-llama code llama studio visual vscode llm local continue copilot llamacpp continuedev llama2 meta ollama codellama assistant
Language:Python 517
lxe / llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
ai artificial-intelligence computer-vision llama llamacpp llm local-llm machine-learning multimodal webapp
Language:JavaScript 463
Fuzzy-Search / realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
bakllavva cpp demo-application inference llama llamacpp llm
Language:Python 365
fynnfluegge / codeqai
Local first semantic code search and chat powered by vector embeddings and LLMs
faiss langchain llamacpp llm openai huggingface llama2 gpt sentence-transformers codellama ollama
Language:Python 335
llama-cpp-agent
Maximilian-Winter / llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
agents llamacpp llm llms function-calling parallel-function-call llm-agent llm-framework
Language:Python 316
vicuna-tools / vicuna-installation-guide
The "vicuna-installation-guide" provides step-by-step instructions for installing and configuring Vicuna 13 and 7B
large-language-models llamacpp llm vicuna vicuna-installation-guide
287
intel / neural-speed
An innovative library for efficient LLM inference via low-bit quantization
cpu fp8 gaudi2 gpu int4 int8 llm-inference low-bit sparsity fp4 llm-fine-tuning mxformat nf4 llamacpp int2 int3
Language:C++ 250
joone / loz
Loz is a command-line tool that enables your preferred LLM to execute system commands and utilize Unix pipes, integrating AI capabilities with other Unix tools.
cli git typescript llm ollama openai-api codellama gpt llama2 llamacpp nodejs automation
Language:TypeScript 248
ErikBjare / gptme
A CLI and web UI to interact with LLMs in a Chat-style interface, with code execution capabilities and other tools.
autogpt chatbot chatgpt gpt-engineer llamacpp llm openai cli
Language:Python 240
CASALIOY
su77ungr / CASALIOY
♾️ toolkit for air-gapped LLMs on consumer-grade hardware
langchain llm qdrant llamacpp question-answering
Language:Python 231
inferflow / inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
llama2 llamacpp llm-inference model-quantization multi-gpu-inference mixture-of-experts moe gemma falcon minicpm mistral bloom deepseek internlm phi-2 baichuan2 mixtral m2m100 qwen
Language:C++ 225
morpheuslord / HackBot
AI-powered cybersecurity chatbot designed to provide helpful and accurate answers to your cybersecurity-related queries and also do code analysis and scan analysis.
ai chatbot cli-chat-app llama2 automation cybersecurity cybersecurity-education cybersecurity-tools llama2-7b llamacpp llm-inference runpod llama-api
Language:Python 216
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
cpu ggml vision-transformer whisper-cpp edge-computing llamacpp ai computer-vision image-classification c cpp
Language:C++ 178
Nuked88 / ComfyUI-N-Nodes
A suite of custom nodes for ConfyUI that includes GPT text-prompt generation, LoadVideo, SaveVideo, LoadFramesFromFolder and FrameInterpolator
comfyui gpt stablediffusion llama llamacpp savevideo loadvideo videonode
Language:Python 155
aub.ai
BrutalCoding / aub.ai
AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.
android dart flutter gen-ai ios ipados linux llamacpp macos mistral-7b native-apps pubdev localllama nlp on-device-ai gemini on-device gemini-nano genai indiedev
Language:Dart 151
1b5d / llm-api
Run any Large Language Model behind a unified API
chatgpt gptq huggingface langchain llama llamacpp llm llm-inference machine-learning python
Language:Python 145

llamacpp

janhq / jan

Mintplex-Labs / anything-llm

getumbrel / llama-gpt

reorproject / reor

serge-chat / serge

khoj-ai / khoj

LostRuins / koboldcpp

xorbitsai / inference

Josh-XT / AGiXT

SciSharp / LLamaSharp

rjmacarthy / twinny

janhq / cortex

alexrozanski / LlamaChat

floneum / floneum

vercel / modelfusion

Dicklesworthstone / swiss_army_llama

Atome-FE / llama-node

Mobile-Artificial-Intelligence / maid

alexpinel / Dot

RahulSChand / gpu_poor

xNul / code-llama-for-vscode

lxe / llavavision

Fuzzy-Search / realtime-bakllava

fynnfluegge / codeqai

Maximilian-Winter / llama-cpp-agent

vicuna-tools / vicuna-installation-guide

intel / neural-speed

joone / loz

ErikBjare / gptme

su77ungr / CASALIOY

inferflow / inferflow

morpheuslord / HackBot

staghado / vit.cpp

Nuked88 / ComfyUI-N-Nodes

BrutalCoding / aub.ai

1b5d / llm-api