Charles Foster's starred repositories

llama.cpp

LLM inference in C/C++

mlx

MLX: An array framework for Apple silicon

llamafile

Distribute and run LLMs with a single file.

Language:C++License:NOASSERTIONStargazers:13769Issues:126Issues:285

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:10891Issues:90Issues:991

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:7037Issues:54Issues:264

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5071Issues:57Issues:80

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:3764Issues:110Issues:109

higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3252Issues:79Issues:1

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2107Issues:21Issues:131

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1629Issues:16Issues:72

langroid

Harness LLMs with Multi-Agent Programming

Language:PythonLicense:MITStargazers:1522Issues:17Issues:127

ai-exploits

A collection of real world AI/ML exploits for responsibly disclosed vulnerabilities

Language:PythonLicense:NOASSERTIONStargazers:1223Issues:27Issues:1

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonLicense:Apache-2.0Stargazers:819Issues:12Issues:208

ATLAS

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

Language:PythonLicense:Apache-2.0Stargazers:673Issues:15Issues:5

HALOs

A library with extensible implementations of DPO, KTO, PPO, and other human-aware loss functions (HALOs).

Language:PythonLicense:Apache-2.0Stargazers:536Issues:6Issues:14

Memory-Cache

MemoryCache is an experimental development project to turn a local desktop environment into an on-device AI agent

Language:JavaScriptLicense:MPL-2.0Stargazers:519Issues:15Issues:29

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:399Issues:14Issues:1

annotated-mamba

Annotated version of the Mamba paper

Language:Jupyter NotebookLicense:MITStargazers:397Issues:23Issues:2
Language:PythonLicense:CC-BY-4.0Stargazers:226Issues:11Issues:16

stripedhyena

Repository for StripedHyena, a state-of-the-art beyond Transformer architecture

Language:PythonLicense:Apache-2.0Stargazers:216Issues:4Issues:6

zoology

Understand and test language model architectures on synthetic tasks.

Language:PythonLicense:Apache-2.0Stargazers:133Issues:15Issues:15

accelerated-scan

Accelerated First Order Parallel Associative Scan

Language:PythonLicense:MITStargazers:89Issues:6Issues:4
Language:PythonLicense:MITStargazers:56Issues:4Issues:1

eurisko

Doug Lenat's EURISKO from SAIL archives circa 1981

json-gpt

Fast and simple library to get correct JSON output from GPT

Language:PythonStargazers:4Issues:2Issues:0