Jesse Mu (jayelm)

jayelm

Geek Repo

Company:Stanford University

Location:Stanford, CA

Home Page:https://cs.stanford.edu/~muj/

Github PK Tool:Github PK Tool


Organizations
bccss
haleyhouse

Jesse Mu's starred repositories

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29388Issues:339Issues:268

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18805Issues:117Issues:529

dalle-mini

DALL·E Mini - Generate images from a text prompt

Language:PythonLicense:Apache-2.0Stargazers:14752Issues:113Issues:155

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9957Issues:84Issues:248

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:9257Issues:44Issues:31

MarkovJunior

Probabilistic language based on pattern matching and constraint propagation, 153 examples

Language:C#License:MITStargazers:7454Issues:93Issues:28

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6583Issues:37Issues:1091

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6458Issues:112Issues:294

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4462Issues:50Issues:290

leap.nvim

Neovim's answer to the mouse 🦘

Language:FennelLicense:MITStargazers:4317Issues:15Issues:174

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:4091Issues:56Issues:19

riffusion-hobby

Stable diffusion for real-time music generation

Language:PythonLicense:MITStargazers:3370Issues:39Issues:93

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2184Issues:23Issues:58

natbot

Drive a browser with GPT-3

Language:PythonLicense:MITStargazers:1899Issues:48Issues:10

MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Language:JavaLicense:MITStargazers:1765Issues:28Issues:119
Language:PythonLicense:Apache-2.0Stargazers:1463Issues:32Issues:75

MiniChain

A tiny library for coding with large language models.

Language:PythonLicense:MITStargazers:1208Issues:15Issues:11
Language:Jupyter NotebookLicense:MITStargazers:1028Issues:23Issues:25

meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:593Issues:16Issues:107

lab2d

A customisable 2D platform for agent-based AI research

Language:C++License:Apache-2.0Stargazers:422Issues:14Issues:30

BIG-Bench-Hard

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

simulacra-aesthetic-captions

Dataset of prompts, synthetic AI generated images, and aesthetic ratings.

cascades

Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference, and more.

Language:PythonLicense:Apache-2.0Stargazers:193Issues:11Issues:1

STaR

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Language:PythonLicense:Apache-2.0Stargazers:118Issues:3Issues:1

prontoqa

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Language:PythonLicense:Apache-2.0Stargazers:113Issues:5Issues:6

kilogram

The KiloGram Tangrams dataset

Language:Jupyter NotebookStargazers:50Issues:1Issues:1
Language:PythonLicense:MITStargazers:49Issues:2Issues:5

marl-ae-comm

PyTorch implementation for all models and environments in the paper "Learning to Ground Multi-Agent Communication with Autoencoders"

Language:PythonLicense:MITStargazers:20Issues:2Issues:1

stable-ouroboros

Infinite chains of captions and generations

Language:PythonLicense:MITStargazers:8Issues:2Issues:0