richard's starred repositories

PruneMe

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

Language:PythonStargazers:161Issues:0Issues:0

DistillKit

An Open Source Toolkit For LLM Distillation

Language:PythonLicense:AGPL-3.0Stargazers:228Issues:0Issues:0

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:4265Issues:0Issues:0

awesome-production-llm

A curated list of awesome open-source libraries for production LLM

License:MITStargazers:290Issues:0Issues:0

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

Stargazers:712Issues:0Issues:0

Shapeshift

Transform JSON objects using vector embeddings

Language:TypeScriptStargazers:378Issues:0Issues:0

MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Language:PythonLicense:NOASSERTIONStargazers:906Issues:0Issues:0

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonLicense:Apache-2.0Stargazers:8556Issues:0Issues:0

korvus

Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python, JavaScript, Rust and C.

Language:RustLicense:MITStargazers:1205Issues:0Issues:0

R2R-Application

react + next.js dashboard for R2R: production-ready RAG engine with a sh*t ton of features.

Language:TypeScriptLicense:MITStargazers:59Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:161Issues:0Issues:0

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:PythonStargazers:700Issues:0Issues:0

Algorithm-Of-Thoughts

My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"

Language:PythonLicense:MITStargazers:84Issues:0Issues:0

tau-bench

Code and Data for Tau-Bench

Language:PythonLicense:MITStargazers:81Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:15815Issues:0Issues:0

llama3.1-rag-bot

Llama3.1 RAG system

Language:PythonStargazers:17Issues:0Issues:0

R2R

The Supabase for RAG - R2R lets you build, scale, and manage user-facing Retrieval-Augmented Generation applications in production.

Language:PythonLicense:MITStargazers:3095Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8434Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:66304Issues:0Issues:0

rocksdb

A library that provides an embeddable, persistent key-value store for fast storage.

Language:C++License:GPL-2.0Stargazers:28119Issues:0Issues:0

baml

BAML is a templating language to write typed LLM functions. Check out the promptfiddle.com playground

Language:RustLicense:Apache-2.0Stargazers:712Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:24948Issues:0Issues:0

MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Language:PythonLicense:Apache-2.0Stargazers:2482Issues:0Issues:0

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:2930Issues:0Issues:0

quickchart

Chart image and QR code web API

Language:JavaScriptLicense:AGPL-3.0Stargazers:1635Issues:0Issues:0

LLM-PlayLab

This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and impact of these models across various applications.

License:Apache-2.0Stargazers:69Issues:0Issues:0

Llama3-on-Mobile

This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on-device inference.

Language:MakefileLicense:MITStargazers:37Issues:0Issues:0

lancedb

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Language:PythonLicense:Apache-2.0Stargazers:3908Issues:0Issues:0

JamAIBase

Firebase for AI Agents: Open-source backend platform that puts powerful generative models at the core of your database. With managed memory and RAG capabilities, developers can easily build AI agents, enhance their apps with generative tables, and create magical UI experiences.

Language:PythonLicense:Apache-2.0Stargazers:158Issues:0Issues:0