Lorenz Wolf's starred repositories
Kernel-Functional-Data
Jupyter Notebook of code used in the numerics for the paper "A Kernel Two-Sample Test for Functional Data" by George Wynne and Andrew B. Duncan.
level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.
llm_optimization
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
rap-rank-reconstruction
Code for reproducing https://arxiv.org/abs/2211.03128
deep-Q-networks
Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51
ml_collections
ML Collections is a library of Python Collections designed for ML use cases.
hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
modular-rl
[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"
hands-on-rl
Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻🦸🏽
awesome-mlss
🤖 Machine Learning Summer School deadlines
Megatron-LM
Ongoing research training transformer models at scale
rl-starter-files
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
commonsense-rl
Knowledge-Aware RL agents with Commonsense Reasoning
DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.