Nathan Lambert (natolambert)

natolambert

User data from Github https://github.com/natolambert

Company:Ai2 // Interconnects.ai

Location:Berkeley, CA

Home Page:https://natolambert.com

GitHub:@natolambert

Twitter:@natolambert


Organizations
PisterLab

Nathan Lambert's starred repositories

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Language:PythonLicense:Apache-2.0Stargazers:5598Issues:33Issues:517

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookLicense:MITStargazers:2785Issues:37Issues:60

elevenlabs-python

The official Python API for ElevenLabs Text to Speech.

Language:PythonLicense:MITStargazers:2432Issues:52Issues:284

PufferLib

Simplifying reinforcement learning for complex game environments

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1681Issues:9Issues:160

nomic

Interact, analyze and structure massive text, image, embedding, audio and video datasets

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonLicense:Apache-2.0Stargazers:1238Issues:19Issues:38

awesome-o1

A bibliography and survey of the papers surrounding o1

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonLicense:GPL-3.0Stargazers:980Issues:18Issues:17

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Language:PythonLicense:MITStargazers:979Issues:8Issues:27

arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Language:PythonLicense:Apache-2.0Stargazers:757Issues:8Issues:39

OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:672Issues:11Issues:12

rlhf-book

Textbook on reinforcement learning from human feedback

Language:TeXLicense:MITStargazers:476Issues:16Issues:10

pandoc-book-template

A simple Pandoc template to build documents and ebooks.

Language:CSSLicense:MITStargazers:423Issues:11Issues:15

smol-podcaster

smol-podcaster is your podcast production agent 🎙️

Language:PythonLicense:MITStargazers:332Issues:7Issues:5

RLAIF-V

[CVPR'25] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

awesome-open-source-lms

Friends of OLMo and their links.

WildBench

Benchmarking LLMs with Challenging Tasks from Real Users

Language:PythonLicense:Apache-2.0Stargazers:218Issues:4Issues:9

LLMBar

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Language:PythonLicense:MITStargazers:121Issues:7Issues:4

maya

Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya

Language:PythonLicense:Apache-2.0Stargazers:107Issues:4Issues:0

wildguard

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Language:PythonLicense:NOASSERTIONStargazers:65Issues:3Issues:2

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonLicense:Apache-2.0Stargazers:65Issues:1Issues:0

async_rlhf

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models

adapt-demos

Lightweight tools for quick and easy LLM demo's

Language:PythonLicense:Apache-2.0Stargazers:26Issues:2Issues:8

m-rewardbench

Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings

Language:PythonLicense:MITStargazers:26Issues:7Issues:11

general-preference-model

Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.org/abs/2410.02197)

Language:PythonLicense:Apache-2.0Stargazers:22Issues:2Issues:0

noncompliance

This repository contains data, code and models for contextual noncompliance.

Language:PythonLicense:MITStargazers:20Issues:2Issues:1

tr-gy-8013-Fa-24

Course material for Fall 24 Deep Learning for Urban Systems Course

Language:Jupyter NotebookLicense:MITStargazers:3Issues:0Issues:0