sweetice

Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" published at NeurIPS 2018

Language:Python212 5 4

REDQ

Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.

Language:PythonMIT148 5 7

sunrise

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

Language:Python119 6 4

dice_rl

Language:PythonApache-2.098 6 10

mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation

Language:PythonGPL-3.070 2 10

DA-in-visualRL

Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).

69 40

generalized_dt

Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)

Language:Python6504

curl_rainbow

Language:PythonMIT52 3 6

revisiting-ppo

Language:Jupyter NotebookMIT47 40

deep-successor-features-for-transfer

A reusable framework for successor features for transfer in deep reinforcement learning using keras.

Language:PythonNOASSERTION39 30

OffCon3

📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)

Language:PythonMIT24 1 1

neural-approx-ss-lfi

Codes for ICLR 21 paper: Neural Approximate Sufficient Statistics for Implicit Models

Language:Jupyter Notebook19 20

KWNG

A Pytorch implementation of the KWNG estimator

Language:PythonBSD-3-Clause14 20

LeagueSandbox-RL-Learning

Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinforcement learning within v4.20 League of Legends.

Language:C#AGPL-3.010 2 2

WNPG

implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies

Language:Python10 10

ppg

Phasic Policy Gradient

Language:Python9 1 1

adaptive_estimators

Code for ICLR 2019 paper "Adaptive Estimators Show Information Compression in Deep Neural Networks" (https://openreview.net/forum?id=SkeZisA5t7)

Language:PythonMIT5 10