td error

Question

wadaniel opened this issue 3 years ago · comments

Hi there, I guess you missed the discount factor while computing the TD error:
https://github.com/XanderJC/scalable-birl/blob/main/sbirl/models.py#L189

wadaniel · Answer 1 · Wed Apr 28 2021 03:11:37 GMT+0800 (China Standard Time)

I found that you are not using a discount factor, respectively gamma equals 1.0