warner-benjamin

Benjamin Warner's starred repositories

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION25971 282 39

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookMIT18303 116 507

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause12628 117 923

ml-engineering

Machine Learning Engineering Open Book

Language:PythonCC-BY-SA-4.010302 107 18

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9093 183 17

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookNOASSERTION7850 68 227

latexify_py

A library to generate LaTeX expression from Python code.

Language:PythonApache-2.07112 55 82

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonMIT5756 48 968

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5392 63 96

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION4981 35 177

nbdev

Create delightful software with Jupyter Notebooks

Language:Jupyter NotebookApache-2.04821 48 867

textual_inversion

Language:Jupyter NotebookMIT2867 53 157

ffcv

FFCV: Fast Forward Computer Vision (and other ML workloads!)

Language:PythonApache-2.02812 20 274

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonApache-2.02541 25 159

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.02469 23 24

pillow-simd

The friendly PIL fork

Language:PythonNOASSERTION2122 42 89

samply

Command-line sampling profiler for macOS and Linux

Language:RustApache-2.02030 16 76

sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

Unlicense1620 37 7

awesome-stable-diffusion

Curated list of awesome resources for the Stable Diffusion AI Model.

MPL-2.01460 40 12

cramming

Cramming the training of a (BERT-type) language model into limited compute.

Language:PythonMIT1263 22 34

stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language:PythonMIT1066 18 115

envpool

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

Language:C++Apache-2.01051 23 137

autofaiss

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Language:PythonApache-2.0781 18 77

the-art-of-debugging

The Art of Debugging

Language:CCC-BY-SA-4.0763 160

CushyStudio

🛋 The AI and Generative Art platform for everyone

Language:TypeScriptAGPL-3.0635 11 72

CLIP-Guided-Diffusion

Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

Language:PythonNOASSERTION380 11 11

blurr

A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, and deploy transformer specific models.

Language:Jupyter NotebookApache-2.0289 16 76

FFCV-SSL

FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.

Language:PythonApache-2.0197 7 6

multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

Language:PythonMIT159 3 3

triton-autodiff

Experiment of using Tangent to autodiff triton

Language:PythonMIT66 60