Beast code in Giters

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonApache-2.07094 96 1394

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Language:Jupyter NotebookMIT2671 24 31

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Language:PythonApache-2.02467 31 223

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonBSD-3-Clause1997 21 289

ReAct

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Language:Jupyter NotebookMIT1629 16 27

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.0818 40 54

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonApache-2.0719 8 41

quiet-star

Code for Quiet-STaR

Language:PythonApache-2.0300 10 6

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonApache-2.0202 4 39

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.

Language:C++MIT176 13 16

proceduralia

Pierluca D'Oro's starred repositories

system-design-primer

open-interpreter

professional-programming

llm-course

maybe

memray

ml-engineering

PhotoMaker

accelerate

Eureka

sglang

webdataset

ReAct

megablocks-public

nanotron

alpaca_farm

quiet-star

humanoid-bench

reward-bench

dynolog

RLHF-Reward-Modeling

SALMON

DiffusionDPO

ArCHer

RLCD

icl_task_vectors

rlfh-gen-div

args

Uncertainty-Aware-Language-Agent

weenygrad