DQN的代码中，计算q_target时未考虑done为true的情况

Question

ananasfl opened this issue 2 years ago · comments

请问Morvan, DQN的代码中，计算q_target时，是否未考虑done为True的情况，即q_target = Reward?
存储在Replay memory中的经验也未包含done。请问为什么呢？