Yunhao (Robin) Tang's repositories

onpolicybaselines

on-policy optimization baselines for deep reinforcement learning

icml2021-pengqlambda

Revisiting Peng's Q(lambda) for Modern Reinforcement Learning

Language:PythonLicense:MITStargazers:16Issues:2Issues:0

neurips2021-meta-gradient-offpolicy-evaluation

Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021

Language:PythonStargazers:12Issues:2Issues:0

nstep-sil

Code for NeurIPS 2020 paper 'Self-imitation Learning via Generalized Lower bound Q-learning'

Language:PythonStargazers:11Issues:2Issues:0

Variational-DQN

Variational DQN encourages efficient exploration and allows for parameter update using black box variational inference

Language:PythonLicense:MITStargazers:9Issues:2Issues:1

gym-mac

gym that works on new mac. full credit to https://github.com/lobachevzky/gym

Language:PythonLicense:NOASSERTIONStargazers:1Issues:1Issues:0

learn2branch

Exact Combinatorial Optimization with Graph Convolutional Neural Networks (NeurIPS 2019)

Language:PythonLicense:MITStargazers:1Issues:2Issues:0
Language:PythonStargazers:1Issues:2Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

PySCIPOpt

Python interface for the SCIP Optimization Suite

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

scip-dagger

A branch-and-bound ILP solver

Language:CStargazers:0Issues:1Issues:0