Michael Lu (sudo-michael)

sudo-michael

Geek Repo

Location:Vancouver, BC

Home Page:https://sudo-michael.github.io/

Twitter:@sudo_mlu

Github PK Tool:Github PK Tool

Michael Lu's starred repositories

PythonRobotics

Python sample codes for robotics algorithms.

Language:PythonLicense:NOASSERTIONStargazers:22528Issues:509Issues:348

optuna

A hyperparameter optimization framework

Language:PythonLicense:MITStargazers:10358Issues:115Issues:1658

sioyek

Sioyek is a PDF viewer with a focus on textbooks and research papers

Language:CLicense:GPL-3.0Stargazers:6929Issues:40Issues:875

HEBO

Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab

Language:Jupyter NotebookStargazers:3154Issues:338Issues:47

dm-haiku

JAX-based neural network library

Language:PythonLicense:Apache-2.0Stargazers:2853Issues:39Issues:250

image-to-latex

Convert images of LaTex math equations into LaTex code.

Language:PythonLicense:MITStargazers:2026Issues:20Issues:27

ml-pen-and-paper-exercises

Pen and paper exercises in machine learning

awesome-jax

JAX - A curated list of resources https://github.com/google/jax

autodidact

A pedagogical implementation of Autograd

Language:Jupyter NotebookLicense:MITStargazers:931Issues:18Issues:4
Language:PythonLicense:Apache-2.0Stargazers:753Issues:18Issues:45
Language:PythonLicense:Apache-2.0Stargazers:522Issues:19Issues:45

Safe-Reinforcement-Learning-Baselines

The repository is for safe reinforcement learning baselines.

Language:Jupyter NotebookStargazers:450Issues:11Issues:0

Safe-Policy-Optimization

NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms

Language:PythonLicense:Apache-2.0Stargazers:315Issues:7Issues:10

TD3_BC

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

Language:PythonLicense:MITStargazers:304Issues:4Issues:3

gym-line-follower

Line follower robot simulator environment for Open AI Gym.

Language:PythonLicense:MITStargazers:107Issues:5Issues:4

rl_with_resets

JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"

Language:PythonLicense:MITStargazers:96Issues:3Issues:0

Conjugate-Gradient

Painless conjugate gradient notebooks

Language:Jupyter NotebookStargazers:71Issues:1Issues:1

safe-mbrl

Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method

Bullet-Safety-Gym

An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.

Language:PythonLicense:MITStargazers:62Issues:3Issues:2

cvpo-safe-rl

Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)

Language:PythonLicense:GPL-3.0Stargazers:61Issues:4Issues:10

Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning

Source files to replicate experiments in my ICLR 2022 paper.

la-mbda

LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization

Language:PythonLicense:MITStargazers:30Issues:2Issues:7

reachability-based_trajectory_safeguard

We use reachability to ensure the safety of a decision agent acting on a dynamic system in real-time. We compute the Forward Reachable Set offline and use it online to adjust any potentially unsafe decisions that cause a collision with an obstacle.

Language:PythonLicense:NOASSERTIONStargazers:20Issues:0Issues:0

LeaveNoTrace

Leave No Trace is an algorithm for safe reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:15Issues:7Issues:1

HJxB

Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)

Language:PythonStargazers:14Issues:4Issues:0

rl_with_jax

clear single-file JAX implementations of common RL algorithms

Language:PythonLicense:MITStargazers:13Issues:1Issues:0

ISSA

Code for paper "Model-free Safe Control for Zero-Violation Reinforcement Learning" at Conference on Robot Learning (CoRL) 2021.

Language:PythonLicense:MITStargazers:8Issues:3Issues:0

natural-policy-gradient-reinforcement-learning

code for Understanding the Effects of Second-Order Approximations in Natural Policy Gradient Reinforcement Learning

Language:PythonLicense:MITStargazers:3Issues:1Issues:0

naturalgradient

code for blog post https://gebob19.github.io/natural-gradient/

Language:PythonLicense:MITStargazers:1Issues:1Issues:0