Nathan Lambert's starred repositories
elevenlabs-python
The official Python API for ElevenLabs Text to Speech.
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
awesome-o1
A bibliography and survey of the papers surrounding o1
yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
pandoc-book-template
A simple Pandoc template to build documents and ebooks.
smol-podcaster
smol-podcaster is your podcast production agent 🎙️
awesome-open-source-lms
Friends of OLMo and their links.
async_rlhf
Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models
adapt-demos
Lightweight tools for quick and easy LLM demo's
m-rewardbench
Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings
general-preference-model
Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.org/abs/2410.02197)
noncompliance
This repository contains data, code and models for contextual noncompliance.
tr-gy-8013-Fa-24
Course material for Fall 24 Deep Learning for Urban Systems Course