Andrej's starred repositories

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:82602Issues:645Issues:6550

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:59946Issues:505Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:57884Issues:1681Issues:2595

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36621Issues:422Issues:1643

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:30743Issues:224Issues:3636

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:28743Issues:336Issues:266

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:22421Issues:181Issues:3467

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:17934Issues:178Issues:2437

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14632Issues:111Issues:155

mlx

MLX: An array framework for Apple silicon

triton

Development repository for the Triton language and compiler

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7522Issues:74Issues:463

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:6731Issues:109Issues:135

litgpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Language:PythonLicense:Apache-2.0Stargazers:6526Issues:69Issues:586

xv6-riscv

Xv6 for RISC-V

Language:CLicense:NOASSERTIONStargazers:6116Issues:94Issues:84

unsloth

2-5X faster 80% less memory LLM finetuning

Language:PythonLicense:Apache-2.0Stargazers:5780Issues:49Issues:226

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5779Issues:68Issues:264

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5056Issues:57Issues:80

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5009Issues:38Issues:32

AlphaCodium

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Language:PythonLicense:AGPL-3.0Stargazers:3020Issues:43Issues:14

cramming

Cramming the training of a (BERT-type) language model into limited compute.

Language:PythonLicense:MITStargazers:1231Issues:22Issues:32

hlb-CIFAR10

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Language:PythonLicense:Apache-2.0Stargazers:1187Issues:20Issues:3

VQ-Diffusion

Official implementation of VQ-Diffusion

Language:PythonLicense:MITStargazers:831Issues:11Issues:34

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonLicense:GPL-3.0Stargazers:759Issues:0Issues:0

fine-tune-mistral

Fine-tune mistral-7B on 3090s, a100s, h100s

Language:PythonLicense:MITStargazers:679Issues:6Issues:5

inbox_cleaner

A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.

llm_rules

RuLES: a benchmark for evaluating rule-following in language models

Language:PythonLicense:Apache-2.0Stargazers:191Issues:1Issues:1

bpeasy

Fast bare-bones BPE for modern tokenizer training

Language:PythonLicense:MITStargazers:125Issues:1Issues:0

vit-vqgan

JAX implementation ViT-VQGAN

Language:PythonLicense:MITStargazers:51Issues:2Issues:0