Safe RL Team (Safe-RL-Team)

Safe RL Team

Safe-RL-Team

Geek Repo

Repositories of the Safe Reinforcement Learning Team @nigroup

Location:Germany

Github PK Tool:Github PK Tool

Safe RL Team's repositories

viper-verifiable-rl-impl

Implementation of the VIPER algorithm introduced in "Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.

topics-in-RL

A compilation of recent machine learning papers focused on safe reinforcement learning

Language:EJSLicense:CC-BY-4.0Stargazers:5Issues:1Issues:0

curriculum-learning-poster

Poster about Curriculum Induction for Safe Reinforcement Learning

Language:TeXStargazers:2Issues:2Issues:0

Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble

Blog post about Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

Language:Jupyter NotebookStargazers:2Issues:3Issues:0

Adaptive-Reward-Penalty-in-Safe-Reinforcement-Learning

In this work we have implemented RCPO into PPO and recreated the results from the original paper. This Blog post summarises our work and elaborates on our ideas and findings.

Language:HTMLLicense:MITStargazers:1Issues:0Issues:0

CARL-params

Code: Caution Parameters in "Cautious Adaptation for Reinforcement Learning in Safety-Critical Settings"

Language:PythonStargazers:1Issues:1Issues:0

curriculum-learning

Blog Post about Curriculum Induction for Safe Reinforcement Learning

Language:JavaScriptLicense:CC-BY-4.0Stargazers:1Issues:3Issues:0

presentations

slides presenting state-of-art papers on safe reinforcement learning

License:MITStargazers:1Issues:3Issues:0
Language:EJSLicense:CC-BY-4.0Stargazers:1Issues:1Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

viper-verifiable-reinforcement-learning

The blog post accompanying the implementation of the paper "Viper: Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.

Language:EJSLicense:CC-BY-4.0Stargazers:1Issues:3Issues:0

adversarial-policies-pytorch-blog

Blog post for our implementation of the paper "Adversarial Policies: Attacking Deep Reinforcement Learning"

Language:HTMLStargazers:0Issues:4Issues:0
Language:HTMLLicense:MITStargazers:0Issues:2Issues:0

barrier-certificates

Code for Barrier Certificates Blog: https://safe-rl-team.github.io/barrier-certificates/

Language:HTMLLicense:UnlicenseStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

CPO-Blog

Our main blog

Language:HTMLLicense:CC-BY-4.0Stargazers:0Issues:1Issues:0

lambda-bo

Bayesian optimization hyperparameter optimization for LAMBDA

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PID

Blog post about Responsive Safety in Reinforcement Learning by PID Lagrangian Methods

Language:HTMLStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

CARL

Blog: Caution Parameters in "Cautious Adaptation for Reinforcement Learning in Safety-Critical Settings"

Language:JavaScriptLicense:UnlicenseStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

lambda-bo-blog

Lambda Bayesian optimization blog

Language:HTMLLicense:CC-BY-4.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

post--example

Example Distill article repository—clone, rename, start writing!

Language:EJSLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:EJSLicense:CC-BY-4.0Stargazers:0Issues:1Issues:0

Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble-Implementation

This is a reimplementation of the EDAC algorithm in PyTorch. It was created as part of an University project and used for a blog post: https://github.com/Safe-RL-Team/Uncertainty-Based-Offline-RL-with-Diversified-Q-Ensemble

Language:PythonStargazers:0Issues:2Issues:0