Pierluca D'Oro's starred repositories

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Language:PythonLicense:NOASSERTIONStargazers:257665Issues:6646Issues:280

open-interpreter

A natural language interface for computers

Language:PythonLicense:AGPL-3.0Stargazers:49112Issues:367Issues:846

professional-programming

A collection of learning resources for curious software engineers

Language:PythonLicense:MITStargazers:45454Issues:976Issues:26

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:29919Issues:316Issues:53

maybe

The OS for your personal finances

Language:RubyLicense:AGPL-3.0Stargazers:26998Issues:143Issues:225

memray

Memray is a memory profiler for Python

Language:PythonLicense:Apache-2.0Stargazers:12631Issues:62Issues:163

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:9935Issues:100Issues:18

PhotoMaker

PhotoMaker

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8380Issues:96Issues:117

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7094Issues:96Issues:1394

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Language:Jupyter NotebookLicense:MITStargazers:2671Issues:24Issues:31

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Language:PythonLicense:Apache-2.0Stargazers:2467Issues:31Issues:223

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:1997Issues:21Issues:289

ReAct

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Language:Jupyter NotebookLicense:MITStargazers:1629Issues:16Issues:27
Language:PythonLicense:Apache-2.0Stargazers:857Issues:8Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:818Issues:40Issues:54

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonLicense:Apache-2.0Stargazers:719Issues:8Issues:41

quiet-star

Code for Quiet-STaR

Language:PythonLicense:Apache-2.0Stargazers:300Issues:10Issues:6
Language:PythonLicense:NOASSERTIONStargazers:241Issues:6Issues:5

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:202Issues:4Issues:39

dynolog

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.

Language:C++License:MITStargazers:176Issues:13Issues:16

RLHF-Reward-Modeling

A recipe to train reward models for RLHF.

Language:PythonLicense:Apache-2.0Stargazers:158Issues:6Issues:7

SALMON

Self-Alignment with Principle-Following Reward Models

Language:PythonLicense:GPL-3.0Stargazers:126Issues:5Issues:1

DiffusionDPO

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Language:PythonLicense:Apache-2.0Stargazers:116Issues:5Issues:8

ArCHer

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

RLCD

Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment

Language:PythonLicense:MITStargazers:54Issues:5Issues:3

rlfh-gen-div

This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity

Language:PythonLicense:NOASSERTIONStargazers:29Issues:3Issues:3

Uncertainty-Aware-Language-Agent

This is the official repo for Towards Uncertainty-Aware Language Agent.

Language:PythonLicense:MITStargazers:11Issues:1Issues:0

weenygrad

Minimalist vector AD

Language:PythonLicense:MITStargazers:9Issues:0Issues:0