sudo-michael

followers

following

stars

Vancouver, BC

https://sudo-michael.github.io/

Michael Lu's starred repositories

PythonRobotics

Python sample codes for robotics algorithms.

Language:PythonNOASSERTION22528 509 348

optuna

A hyperparameter optimization framework

Language:PythonMIT10358 115 1658

sioyek

Sioyek is a PDF viewer with a focus on textbooks and research papers

Language:CGPL-3.06929 40 875

HEBO

Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab

Language:Jupyter Notebook3154 338 47

dm-haiku

JAX-based neural network library

Language:PythonApache-2.02853 39 250

image-to-latex

Convert images of LaTex math equations into LaTex code.

Language:PythonMIT2026 20 27

ml-pen-and-paper-exercises

Pen and paper exercises in machine learning

Language:TeX1880 30 3

awesome-jax

JAX - A curated list of resources https://github.com/google/jax

CC0-1.01436 43 8

autodidact

A pedagogical implementation of Autograd

Language:Jupyter NotebookMIT931 18 4

chex

Language:PythonApache-2.0753 18 45

distrax

Language:PythonApache-2.0522 19 45

Safe-Reinforcement-Learning-Baselines

The repository is for safe reinforcement learning baselines.

Language:Jupyter Notebook450 110

Safe-Policy-Optimization

NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms

Language:PythonApache-2.0315 7 10

TD3_BC

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

Language:PythonMIT304 4 3

gym-line-follower

Line follower robot simulator environment for Open AI Gym.

Language:PythonMIT107 5 4

rl_with_resets

JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"

Language:PythonMIT96 30

Conjugate-Gradient

Painless conjugate gradient notebooks

Language:Jupyter Notebook71 1 1

safe-mbrl

Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method

Language:Python63 2 1

Bullet-Safety-Gym

An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.

Language:PythonMIT62 3 2

cvpo-safe-rl

Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)

Language:PythonGPL-3.061 4 10

Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning

Source files to replicate experiments in my ICLR 2022 paper.

Language:Python58 4 1

la-mbda

LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization

Language:PythonMIT30 2 7

reachability-based_trajectory_safeguard

We use reachability to ensure the safety of a decision agent acting on a dynamic system in real-time. We compute the Forward Reachable Set offline and use it online to adjust any potentially unsafe decisions that cause a collision with an obstacle.

Language:MATLAB29 3 1

safety_rl

Language:PythonNOASSERTION2000

LeaveNoTrace

Leave No Trace is an algorithm for safe reinforcement learning.

Language:PythonApache-2.015 7 1

HJxB

Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)

Language:Python14 40

rl_with_jax

clear single-file JAX implementations of common RL algorithms

Language:PythonMIT13 10

ISSA

Code for paper "Model-free Safe Control for Zero-Violation Reinforcement Learning" at Conference on Robot Learning (CoRL) 2021.

Language:PythonMIT8 30

natural-policy-gradient-reinforcement-learning

code for Understanding the Effects of Second-Order Approximations in Natural Policy Gradient Reinforcement Learning

Language:PythonMIT3 10

naturalgradient

code for blog post https://gebob19.github.io/natural-gradient/

Language:PythonMIT1 10