mitko's starred repositories
LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference
mistral-inference
Official inference library for Mistral models
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
postgresml
Postgres with GPUs for ML/AI apps.
LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
alignment-handbook
Robust recipes to align language models with human and AI preferences
sqlite-vec
A vector search SQLite extension that runs anywhere!
cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
fsdp_qlora
Training LLMs with QLoRA + FSDP
pgvectorscale
A complement to pgvector for high performance, cost efficient vector search on large workloads.
PicoMLXServer
The easiest way to run the fastest MLX-based LLMs locally
openinference
Auto-Instrumentation for AI Observability
cohere-terrarium
A simple Python sandbox for helpful LLM data agents
function-calling-eval
A framework for evaluating function calls made by LLMs