- Playing Atari with Deep Reinforcement Learning, V. Mnih et al., NIPS Workshop, 2013. pdf
- Human-level control through deep reinforcement learning, V. Mnih et al., Nature, 2015. pdf
- Deterministic Policy Gradient Algorithms, D. Silver et al., ICML, 2015. pdf
- Trust Region Policy Optimization, J. Schulman et al., ICML, 2015. pdf
- Deep Reinforcement Learning with Double Q-learning, H. van Hasselt et al., arXiv, 2015. pdf
- Prioritized Experience Replay, T. Schaul et al., ICLR, 2016. pdf
- Mastering the game of Go with deep neural networks and tree search, D. Silver et al., Nature, 2016. pdf
- Dueling Network Architectures for Deep Reinforcement Learning, Z. Wang et al., arXiv, 2015. pdf