natolambert

User data from Github https://github.com/natolambert

followers

following

stars

Ai2 // Interconnects.ai

Berkeley, CA

https://natolambert.com

Organizations

PisterLab

Nathan Lambert's starred repositories

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Language:PythonApache-2.05598 33 517

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookMIT2785 37 60

elevenlabs-python

The official Python API for ElevenLabs Text to Speech.

Language:PythonMIT2432 52 284

PufferLib

Simplifying reinforcement learning for complex game environments

Language:CMIT1734 13 19

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.01681 9 160

nomic

Interact, analyze and structure massive text, image, embedding, audio and video datasets

Language:Python1561 28 65

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonApache-2.01238 19 38

awesome-o1

A bibliography and survey of the papers surrounding o1

Language:TeX1178 26 1

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonGPL-3.0980 18 17

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Language:PythonMIT979 8 27

MAP-NEO

Language:Python929 12 34

arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Language:PythonApache-2.0757 8 39

OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Language:Jupyter NotebookApache-2.0672 11 12

PickScore

Language:PythonMIT483 3 34

rlhf-book

Textbook on reinforcement learning from human feedback

Language:TeXMIT476 16 10

pandoc-book-template

A simple Pandoc template to build documents and ebooks.

Language:CSSMIT423 11 15

smol-podcaster

smol-podcaster is your podcast production agent 🎙️

Language:PythonMIT332 7 5

RLAIF-V

[CVPR'25] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Language:Python307 6 36

awesome-open-source-lms

Friends of OLMo and their links.

WildBench

Benchmarking LLMs with Challenging Tasks from Real Users

Language:PythonApache-2.0218 4 9

LLMBar

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Language:PythonMIT121 7 4

maya

Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya

Language:PythonApache-2.0107 40

wildguard

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Language:PythonNOASSERTION65 3 2

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonApache-2.065 10

async_rlhf

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models

Language:Python32 3 2

adapt-demos

Lightweight tools for quick and easy LLM demo's

Language:PythonApache-2.026 2 8

m-rewardbench

Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings

Language:PythonMIT26 7 11

general-preference-model

Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.org/abs/2410.02197)

Language:PythonApache-2.022 20

noncompliance

This repository contains data, code and models for contextual noncompliance.

Language:PythonMIT20 2 1

tr-gy-8013-Fa-24

Course material for Fall 24 Deep Learning for Urban Systems Course

Language:Jupyter NotebookMIT300