zhanghc12's repositories
cartpole_solver
Deep Q-Network (DQN) for CartPole game from OpenAI gym
CQL
Code for conservative Q-learning
doom-py
ViZDoom Python wrapper
GAC
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
homework
Imitation Learning Homework 1
mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
models
Models built with TensorFlow
modular_rl
Implementation of TRPO and related algorithms
mopo
Code for MOPO: Model-based Offline Policy Optimization
multiagent-competition
Repository for competitive multi-agent environments
opiq
Code for Optimistic Exploration even with a Pessimistic Initialisation
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
spree_abc
spree support multi site
tensorpack
Neural Network Toolbox on TensorFlow
zhc.github.io
Build a Jekyll blog in minutes, without touching the command line.