timoklein

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION000

ClustPy

A Python library for advanced clustering algorithms

Language:PythonBSD-3-Clause000

cpp_optim

Nonlinear optimization examples in C++

Language:C++000

dmcgym

Language:PythonMIT000

garage

A toolkit for reproducible reinforcement learning research.

Language:PythonMIT000

HIR

Language:Python000

markov-abstractions-ablations

DM control Markov component ablations

Language:PythonMIT000

outlier_detection

Class based Python implementations of outlier detection algorithms.

Language:Scilab000

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

MIT000

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

NOASSERTION000

rl_graph_breaks

An example of torch.compile graph breaks in RL code using SAC-discrete as an example

Language:Python000

udemy_cpp

C++ Course

Language:Makefile020

wandb_tutorial

Code example for some basic wandb functionality

Language:Python000