sudo-michael

followers

following

stars

Vancouver, BC

https://sudo-michael.github.io/

Michael Lu's starred repositories

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT30216 428 4186

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonMIT6891 38 448

reptyr

Reparent a running program to a new terminal

Language:CMIT5790 96 69

zotero-better-bibtex

Make Zotero effective for us LaTeX holdouts

Language:TypeScriptMIT5242 47 2189

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonApache-2.05222 33 52

IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

Language:PythonNOASSERTION1934 37 202

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Language:PythonApache-2.01501 60 31

omnisafe

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Language:PythonApache-2.0912 38 103

melee

A decompilation of Super Smash Bros Melee brought to you by a bunch of clever folks.

Language:Assembly663 28 68

AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Language:PythonApache-2.0575 7 24

CommonLoopUtils

CLU lets you write beautiful training loops in JAX.

Language:Jupyter NotebookApache-2.0320 100

implicit_q_learning

Language:PythonMIT226 5 9

agd

Automatic gradient descent

Language:TeX207 4 3

expt

Experiment. Plot. Tabulate.

Language:PythonMIT67 4 16

saferl_kit

Language:PythonMIT56 1 2

fastrlap-release

Language:PythonMIT55 4 2

emdp

Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations

Language:PythonMIT47 5 8

effective-horizon

Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"

Language:Python41 3 3

wcsac

A PyTorch implementation of "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"

Language:PythonMIT39 4 1

LyapunovLearning

Language:Python37 30

Optimally-Weighted-PINNs

Language:Python37 90

mbppol

This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm" accepted at NeurIPS 2022.

Language:PythonMIT24 2 3

refineCBF

Language:Python19 10

ube-mbrl

Model-Based Uncertainty in Value Functions (AISTATS2023)

Language:PythonAGPL-3.015 40

seditor

Code release for the paper "Towards Safe Reinforcement Learning with a Safety Editor Policy", Yu et al., arXiv 2022

Language:Dockerfile13 2 3

Safe-panda-gym

OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.

Language:PythonMIT1100

CUP-safe-rl

NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization

Language:Python11 1 1

mesa-safe-rl

Language:Python7 10

dae

Official repository for "Direct Advantage Estimation"

Language:PythonMIT5 30

ESB-CPO

Implement of paper: Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization

Language:Python5 10