mahaitongdae

Haitong Ma's repositories

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonMIT200

pytorch-value-iteration-networks

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

Language:PythonBSD-3-Clause200

quad_nn

Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors IROS 2019

Language:PythonMIT100

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT000

CLF-CBF-QP

Matlab class/functions to simulate a system implementing a control lyapunov-control barrier function quadratic program controller

Language:MATLAB000

cpo-pytorch

An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch

Language:Python000

CQL

Code for conservative Q-learning

Language:Python000

Distributional-Soft-Actor-Critic

Language:Python000

focops

Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).

Language:Python000

gym-carla

An OpenAI gym wrapper for CARLA simulator

Language:PythonMIT000

jetson-inference

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

Language:C++MIT000

jmlr-style-file

LaTeX style file for the Journal of Machine Learning Research

Language:TeX000

papi

Example implementations for paper "Projections for Approximate Policy Iteration" paper

Language:Python000

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT000

mahaitongdae

Haitong Ma's repositories

Deep-reinforcement-learning-with-pytorch

pytorch-value-iteration-networks

quad_nn

baselines

basics_of_ros

CLF-CBF-QP

cpo-pytorch

CQL

deepreach

Distributional-Soft-Actor-Critic

Emergency-braking-env

focops

gym-carla

jetson-inference

jmlr-style-file

NEXT-learning-to-plan

papi

Predictive_Entropy_Search

pybullet_ur5_gripper

pytorch-a2c-ppo-acktr-gail

Pytorch-NCE

quad_sim2multireal

rcheng805.github.io

releasing-research-code

RL-CBF

safety-gym

Spearmint

spinningup

Tensorboard2Seaborn

xml_map_render