Yunhao (Robin) Tang's repositories
onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
icml2021-pengqlambda
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
neurips2021-meta-gradient-offpolicy-evaluation
Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
Variational-DQN
Variational DQN encourages efficient exploration and allows for parameter update using black box variational inference
learn2branch
Exact Combinatorial Optimization with Graph Convolutional Neural Networks (NeurIPS 2019)
open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
scip-dagger
A branch-and-bound ILP solver