Haitong Ma (mahaitongdae)

mahaitongdae

Geek Repo

Company:@lina-robotics-lab at Harvard SEAS

Location:Allston, MA

Home Page:https://scholar.harvard.edu/haitongma

Github PK Tool:Github PK Tool

Haitong Ma's repositories

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

pytorch-value-iteration-networks

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

Language:PythonLicense:BSD-3-ClauseStargazers:2Issues:0Issues:0

quad_nn

Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors IROS 2019

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

CLF-CBF-QP

Matlab class/functions to simulate a system implementing a control lyapunov-control barrier function quadratic program controller

Language:MATLABStargazers:0Issues:0Issues:0

cpo-pytorch

An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch

Language:PythonStargazers:0Issues:0Issues:0

CQL

Code for conservative Q-learning

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

focops

Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).

Language:PythonStargazers:0Issues:0Issues:0

gym-carla

An OpenAI gym wrapper for CARLA simulator

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

jetson-inference

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

Language:C++License:MITStargazers:0Issues:0Issues:0

jmlr-style-file

LaTeX style file for the Journal of Machine Learning Research

Language:TeXStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

papi

Example implementations for paper "Projections for Approximate Policy Iteration" paper

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Pytorch-NCE

The Noise Contrastive Estimation for softmax output written in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

quad_sim2multireal

Repository for IROS 2019

Language:PythonStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

safety-gym

Tools for accelerating safe exploration research.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Spearmint

Spearmint Bayesian optimization codebase

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Tensorboard2Seaborn

Plot Tensorflow Summary Event in a Beautiful Way 🌈

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

xml_map_render

Render map for xml networks.

Language:PythonStargazers:0Issues:1Issues:0