cafe / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

Link to paper

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym.

This paper introduces Wolpertinger training algorithm that extends the Deep Deterministic Policy Gradient training algorithm introduced in this paper. I extended stevenpjg's implementation of DDPG algorithm found here licensed under the MIT license.

About

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym


Languages

Language:Python 100.0%