Tail's repositories
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
BentoML
Build Production-Grade AI Applications
builder
Drag and drop headless CMS for React, Vue, Svelte, Qwik, and more
chatllm.cpp
Pure C++ implementation of several models for real-time chatting on your computer (CPU)
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
langchain
⚡ Building applications with LLMs through composability ⚡
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
LLMLingua
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
lollms-webui
Lord of Large Language Models Web User Interface
modal-labs-examples
Examples of programs built using Modal
OLMo
Modeling, training, eval, and inference code for OLMo
OneDiffusion
OneDiffusion: Run any Stable Diffusion models and fine-tuned weights with ease
open-interpreter
A natural language interface for computers
openai-kit
A community Swift package used to interact with the OpenAI API
openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
OpenLLM
Operating LLMs in production
phpservermon
PHP Server Monitor
qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
quivr
Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.
ShushTranscribe
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
trt-llm-rag-windows
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs