karpathy

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.05904 67 269

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++Apache-2.05823 38 77

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5392 63 96

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonApache-2.05191 38 37

AlphaCodium

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Language:PythonAGPL-3.03315 51 17

cramming

Cramming the training of a (BERT-type) language model into limited compute.

Language:PythonMIT1263 22 34

hlb-CIFAR10

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Language:PythonApache-2.01203 20 3

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonGPL-3.0811 17 9

fine-tune-mistral

Fine-tune mistral-7B on 3090s, a100s, h100s

Language:PythonMIT696 6 5

attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Language:PythonMIT424 8 2

inbox_cleaner

A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.

Language:Python410 5 6

llm_rules

RuLES: a benchmark for evaluating rule-following in language models

Language:PythonApache-2.0202 2 3

bpeasy

Fast bare-bones BPE for modern tokenizer training

Language:PythonMIT129 20