Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Home Page:https://mofanpy.com/tutorials/machine-learning/reinforcement-learning/
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
ananasfl opened this issue 2 years ago · comments
请问Morvan, DQN的代码中,计算q_target时,是否未考虑done为True的情况,即q_target = Reward? 存储在Replay memory中的经验也未包含done。请问为什么呢?