Safe RL Team

Safe RL Team's repositories

viper-verifiable-rl-impl

Implementation of the VIPER algorithm introduced in "Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.

Language:Python12 3 2

topics-in-RL

A compilation of recent machine learning papers focused on safe reinforcement learning

Language:EJSCC-BY-4.05 10

curriculum-learning-poster

Poster about Curriculum Induction for Safe Reinforcement Learning

Language:TeX2 20

Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble

Blog post about Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

Language:Jupyter Notebook2 30

Adaptive-Reward-Penalty-in-Safe-Reinforcement-Learning

In this work we have implemented RCPO into PPO and recreated the results from the original paper. This Blog post summarises our work and elaborates on our ideas and findings.

Language:HTMLMIT100

CARL-params

Code: Caution Parameters in "Cautious Adaptation for Reinforcement Learning in Safety-Critical Settings"

Language:Python1 10

curriculum-learning

Blog Post about Curriculum Induction for Safe Reinforcement Learning

Language:JavaScriptCC-BY-4.01 30

presentations

slides presenting state-of-art papers on safe reinforcement learning

MIT1 30

safe-action-repetition-article

Language:EJSCC-BY-4.01 10

viper-verifiable-reinforcement-learning

The blog post accompanying the implementation of the paper "Viper: Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.

Language:EJSCC-BY-4.01 30

adversarial-policies-pytorch-blog

Blog post for our implementation of the paper "Adversarial Policies: Attacking Deep Reinforcement Learning"

Language:HTML040

barrier-certificates

Code for Barrier Certificates Blog: https://safe-rl-team.github.io/barrier-certificates/

Language:HTMLUnlicense020

Blog-Post-about-There-is-No-Turning-Back

Language:HTML010

lambda-bo

Bayesian optimization hyperparameter optimization for LAMBDA

Language:PythonMIT010

PID

Blog post about Responsive Safety in Reinforcement Learning by PID Lagrangian Methods

Language:HTML000

CARL

Blog: Caution Parameters in "Cautious Adaptation for Reinforcement Learning in Safety-Critical Settings"

Language:JavaScriptUnlicense010

lambda-bo-blog

Lambda Bayesian optimization blog

Language:HTMLCC-BY-4.0010

post--example

Example Distill article repository—clone, rename, start writing!

Language:EJSCC-BY-4.0000

Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble-Implementation

This is a reimplementation of the EDAC algorithm in PyTorch. It was created as part of an University project and used for a blog post: https://github.com/Safe-RL-Team/Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble

Language:Python020

Safe RL Team

Safe-RL-Team

Safe RL Team's repositories

viper-verifiable-rl-impl

topics-in-RL

curriculum-learning-poster

Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble

Adaptive-Reward-Penalty-in-Safe-Reinforcement-Learning

barrier_certificates_code

CARL-params

curriculum-learning

presentations

rl-from-human-preferences

safe-action-repetition

safe-action-repetition-article

SRL-NLC

viper-verifiable-reinforcement-learning

adversarial-policies-pytorch-blog

advice-distillation-blog

barrier-certificates

Blog-Post-about-There-is-No-Turning-Back

CPO-Blog

lambda-bo

PID

advice-distillation-code

CARL

cpo

lambda-bo-blog

NoTurningBack

post--example

RCPO

SRL-NLC-Report

Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble-Implementation