ggerganov

Georgi Gerganov's starred repositories

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT20476 208 113

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT8505 80 33

magika

Detect file content types with deep learning

Language:PythonApache-2.07476 36 338

reor

Private & local AI personal knowledge management app.

Language:TypeScriptAGPL-3.06335 38 121

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++Apache-2.05610 37 70

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.

Language:GoApache-2.05308 28 89

Dot

Text-To-Speech, RAG, and LLMs. All local!

Language:JavaScriptGPL-3.01118 11 7

tlm

Local CLI Copilot, powered by CodeLLaMa. 💻🦙

Language:GoApache-2.01097 10 12

c_std

Implementation of C++ standard libraries in C

Language:CISC1059 16 10

distributed-llama

Tensor parallelism is all you need. Run LLMs on weak devices or make powerful devices even more powerful by distributing the workload and dividing the RAM usage.

Language:C++MIT914 18 34

openvino-plugins-ai-audacity

A set of AI-enabled effects, generators, and analyzers for Audacity®.

Language:C++GPL-3.0678 21 185

LLMUnity

Create characters in Unity with LLMs!

Language:C#MIT398 9 60

llama-cpp-agent

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.

Language:PythonNOASSERTION382 11 37