Michael Lu (sudo-michael)

sudo-michael

Geek Repo

Location:Vancouver, BC

Home Page:https://sudo-michael.github.io/

Twitter:@sudo_mlu

Github PK Tool:Github PK Tool

Michael Lu's starred repositories

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30216Issues:428Issues:4186

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:6891Issues:38Issues:448

reptyr

Reparent a running program to a new terminal

zotero-better-bibtex

Make Zotero effective for us LaTeX holdouts

Language:TypeScriptLicense:MITStargazers:5242Issues:47Issues:2189

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonLicense:Apache-2.0Stargazers:5222Issues:33Issues:52

IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

Language:PythonLicense:NOASSERTIONStargazers:1934Issues:37Issues:202

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Language:PythonLicense:Apache-2.0Stargazers:1501Issues:60Issues:31

omnisafe

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Language:PythonLicense:Apache-2.0Stargazers:912Issues:38Issues:103

melee

A decompilation of Super Smash Bros Melee brought to you by a bunch of clever folks.

AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Language:PythonLicense:Apache-2.0Stargazers:575Issues:7Issues:24

CommonLoopUtils

CLU lets you write beautiful training loops in JAX.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:320Issues:10Issues:0

agd

Automatic gradient descent

expt

Experiment. Plot. Tabulate.

Language:PythonLicense:MITStargazers:67Issues:4Issues:16
Language:PythonLicense:MITStargazers:56Issues:1Issues:2

emdp

Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations

Language:PythonLicense:MITStargazers:47Issues:5Issues:8

effective-horizon

Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"

wcsac

A PyTorch implementation of "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"

Language:PythonLicense:MITStargazers:39Issues:4Issues:1

mbppol

This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm" accepted at NeurIPS 2022.

Language:PythonLicense:MITStargazers:24Issues:2Issues:3
Language:PythonStargazers:19Issues:1Issues:0

ube-mbrl

Model-Based Uncertainty in Value Functions (AISTATS2023)

Language:PythonLicense:AGPL-3.0Stargazers:15Issues:4Issues:0

seditor

Code release for the paper "Towards Safe Reinforcement Learning with a Safety Editor Policy", Yu et al., arXiv 2022

Safe-panda-gym

OpenaAI Gym Franka Emika Panda robot environment based on PyBullet.

Language:PythonLicense:MITStargazers:11Issues:0Issues:0

CUP-safe-rl

NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization

Language:PythonStargazers:7Issues:1Issues:0

dae

Official repository for "Direct Advantage Estimation"

Language:PythonLicense:MITStargazers:5Issues:3Issues:0

ESB-CPO

Implement of paper: Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization

Language:PythonStargazers:5Issues:1Issues:0