Will Thompson (will-thompson-k)

will-thompson-k

Geek Repo

Company:@tempuslabs

Home Page:https://willthompson.name

Twitter:@will_thompson_k

Github PK Tool:Github PK Tool

Will Thompson's starred repositories

get-started-with-JAX

The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.

Language:Jupyter NotebookLicense:MITStargazers:584Issues:0Issues:0

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1193Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4075Issues:0Issues:0

llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Language:PythonLicense:MITStargazers:176Issues:0Issues:0

jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

Language:TypeScriptLicense:AGPL-3.0Stargazers:19812Issues:0Issues:0

paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

Language:PythonLicense:Apache-2.0Stargazers:419Issues:0Issues:0
Language:PythonLicense:MITStargazers:3928Issues:0Issues:0

luigi

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Language:PythonLicense:Apache-2.0Stargazers:17444Issues:0Issues:0

NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Language:PythonLicense:NOASSERTIONStargazers:3592Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:4057Issues:0Issues:0

llm_steer

Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors

Language:PythonLicense:MITStargazers:179Issues:0Issues:0
Language:PythonLicense:MITStargazers:2444Issues:0Issues:0

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8986Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:6490Issues:0Issues:0

LLM-Benchmark-Logs

Just a bunch of benchmark logs for different LLMs

License:MITStargazers:110Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:59Issues:0Issues:0
Language:PythonStargazers:26Issues:0Issues:0

swarm-jax

Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes

Language:PythonStargazers:229Issues:0Issues:0

litgpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Language:PythonLicense:Apache-2.0Stargazers:7520Issues:0Issues:0

speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Language:PythonLicense:MITStargazers:164Issues:0Issues:0

awesome-hallucination-detection

List of papers on hallucination detection in LLMs.

License:Apache-2.0Stargazers:438Issues:0Issues:0

YaLM-100B

Pretrained language model with 100B parameters

Language:PythonLicense:Apache-2.0Stargazers:3727Issues:0Issues:0

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9483Issues:0Issues:0

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Stargazers:5557Issues:0Issues:0

triton-transformer

Implementation of a Transformer, but completely in Triton

Language:PythonLicense:MITStargazers:225Issues:0Issues:0

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:8626Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7841Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:20706Issues:0Issues:0
Language:TypeScriptStargazers:17Issues:0Issues:0

pgvector

Open-source vector similarity search for Postgres

Language:CLicense:NOASSERTIONStargazers:9965Issues:0Issues:0