will-thompson-k

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.0010

LLM-Benchmark-Logs

Just a bunch of benchmark logs for different LLMs

MIT010

llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Language:PythonMIT010

llm_steer

Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors

Language:PythonMIT010

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT010

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookMIT010

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookApache-2.0010

nano-llama31

nanoGPT style version of Llama 3.1

Language:Python000

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++Apache-2.0010

paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

Language:PythonApache-2.0010

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause000

transformer-debugger

Language:PythonMIT010

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0010

vocode-python

🤖 Build voice-based LLM agents. Modular + open source.

Language:PythonMIT010

weak-to-strong

Language:PythonMIT010

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonNOASSERTION010

will-thompson-k

Will Thompson's repositories

mistral_7b_lora_example

alignment-handbook

AlpaCare

axolotl

chain-of-verification

deepchem

DNA-Diffusion

flash-attention

generative_agents

jax-triton

langchain

cv

dont_know_jax

langgraph

lit-gpt

LLM-Benchmark-Logs

llm-swarm

llm_steer

LoRA

micrograd

mistral-src

nano-llama31

open_spiel

paxml

torchtitan

transformer-debugger

vllm

vocode-python

weak-to-strong

xformers