lorenzflow

Lorenz Wolf's starred repositories

Kernel-Functional-Data

Jupyter Notebook of code used in the numerics for the paper "A Kernel Two-Sample Test for Functional Data" by George Wynne and Andrew B. Duncan.

Language:Jupyter Notebook700

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.

Language:PythonNOASSERTION8200

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Language:PythonApache-2.0148500

llm_optimization

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.

Language:PythonMIT2400

pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

Language:Jupyter NotebookMIT48300

dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonApache-2.0366500

bandits

Bayesian Bandits

Language:Jupyter NotebookMIT6300

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonApache-2.0509600

OpenAgents

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Language:PythonApache-2.0381000

D4RL

A collection of reference environments for offline reinforcement learning

Language:PythonApache-2.0125700

TextWorld

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Language:Jupyter NotebookNOASSERTION117300

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookApache-2.072700

rap-rank-reconstruction

Code for reproducing https://arxiv.org/abs/2211.03128

Language:Python1000

deep-Q-networks

Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51

Language:Jupyter Notebook26300

ml_collections

ML Collections is a library of Python Collections designed for ML use cases.

Language:PythonApache-2.086800

xmanager

A platform for managing machine learning experiments

Language:PythonApache-2.081200

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT1554100

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Language:PythonMIT38900

modular-rl

[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"

Language:Jupyter NotebookNOASSERTION21100

hands-on-rl

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Language:Jupyter NotebookMIT103300

awesome-mlss

🤖 Machine Learning Summer School deadlines

Language:HTMLMIT263400

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.0591000

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3526700

nanoPALM

Language:PythonMIT14100

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION954800

rl-starter-files

RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code

Language:PythonMIT63800

commonsense-rl

Knowledge-Aware RL agents with Commonsense Reasoning

Language:Inform 7Apache-2.07500

TFDMNet

Learning Convolutional Neural Networks in the Frequency Domain

Language:Python1100

DIAYN-PyTorch

Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.

Language:PythonMIT5600