Exercise 3.28
NakuraMino opened this issue · comments
Why is there no decay term? Is there a reason why it doesn't become (r + gamma * v_*(s'))?
You guys are right, I forgot it :) Will patch tonight.
Thanks!
Solutions of Reinforcement Learning, An Introduction
NakuraMino opened this issue · comments
Why is there no decay term? Is there a reason why it doesn't become (r + gamma * v_*(s'))?
You guys are right, I forgot it :) Will patch tonight.
Thanks!