jayelm

Jesse Mu's starred repositories

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.029388 339 268

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookMIT18805 117 529

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonApache-2.014752 113 155

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookMIT9957 84 248

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookMIT9257 44 31

MarkovJunior

Probabilistic language based on pattern matching and constraint propagation, 153 examples

Language:C#MIT7454 93 28

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT6583 37 1091

metaseq

Repo for external large-scale work

Language:PythonMIT6458 112 294

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4462 50 290

leap.nvim

Neovim's answer to the mouse 🦘

Language:FennelMIT4317 15 174

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonApache-2.04091 56 19

riffusion-hobby

Stable diffusion for real-time music generation

Language:PythonMIT3370 39 93

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.02184 23 58

natbot

Drive a browser with GPT-3

Language:PythonMIT1899 48 10

MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Language:JavaMIT1765 28 119

FLAN

Language:PythonApache-2.01463 32 75

MiniChain

A tiny library for coding with large language models.

Language:PythonMIT1208 15 11

stable-diffusion

Language:Jupyter NotebookMIT1028 23 25

meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Language:PythonApache-2.0593 16 107

lab2d

A customisable 2D platform for agent-based AI research

Language:C++Apache-2.0422 14 30

BIG-Bench-Hard

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

MIT416 3 9

simulacra-aesthetic-captions

Dataset of prompts, synthetic AI generated images, and aesthetic ratings.

392 13 9

cascades

Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference, and more.

Language:PythonApache-2.0193 11 1

STaR

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Language:PythonApache-2.0118 3 1

prontoqa

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Language:PythonApache-2.0113 5 6

kilogram

The KiloGram Tangrams dataset

Language:Jupyter Notebook50 1 1

sst

Language:PythonMIT49 2 5

marl-ae-comm

PyTorch implementation for all models and environments in the paper "Learning to Ground Multi-Agent Communication with Autoencoders"

Language:Python43 2 2

ELV

Language:PythonMIT20 2 1

stable-ouroboros

Infinite chains of captions and generations

Language:PythonMIT8 20