will-thompson-k

followers

following

stars

@tempuslabs

https://willthompson.name

@will_thompson_k

Will Thompson's starred repositories

get-started-with-JAX

The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.

Language:Jupyter NotebookMIT58400

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause119300

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0407500

llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Language:PythonMIT17600

jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

Language:TypeScriptAGPL-3.01981200

paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

Language:PythonApache-2.041900

transformer-debugger

Language:PythonMIT392800

luigi

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Language:PythonApache-2.01744400

NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Language:PythonNOASSERTION359200

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++Apache-2.0405700

llm_steer

Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors

Language:PythonMIT17900

weak-to-strong

Language:PythonMIT244400

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookApache-2.0898600

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.0649000

LLM-Benchmark-Logs

Just a bunch of benchmark logs for different LLMs

MIT11000

AlpaCare

Language:PythonApache-2.05900

chain-of-verification

Language:Python2600

swarm-jax

Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes

Language:Python22900

litgpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Language:PythonApache-2.0752000

speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Language:PythonMIT16400

awesome-hallucination-detection

List of papers on hallucination detection in LLMs.

Apache-2.043800

YaLM-100B

Pretrained language model with 100B parameters

Language:PythonApache-2.0372700

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT948300

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

triton-transformer

Implementation of a Transformer, but completely in Triton

Language:PythonMIT22500

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookMIT862600

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonNOASSERTION784100

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.02070600

litllm

Language:TypeScript1700

pgvector

Open-source vector similarity search for Postgres

Language:CNOASSERTION996500