Paper: A Deeper Look at Experience Replay Author: Shangtong Zhang and Richard S. Sutton
- Nonlinear function representation on LunarLander-v2
- Did not implement timeout or partial-episode-bootstrap (PEB)
A Deeper Look at Experience Replay (Zhang and Sutton, 2017)
Paper: A Deeper Look at Experience Replay Author: Shangtong Zhang and Richard S. Sutton
A Deeper Look at Experience Replay (Zhang and Sutton, 2017)
MIT License