Exercise 6.1

Question

Exercise 6.1

Hyperion-shuo opened this issue 4 years ago · comments

YIFAN WANG · Answer 1 · Thu Apr 30 2020 20:52:31 GMT+0800 (China Standard Time)

It looks like a typo. I will double check tomorrow.
Thanks for your response.

YIFAN WANG · Answer 2 · Sat May 02 2020 01:29:22 GMT+0800 (China Standard Time)

I see. The definition of u is a typo. I should have written u_t = ... instead of u_{t+1}
The answer has been updated and thanks for your contribution.

ehddnr747 · Answer 3 · Fri May 14 2021 17:17:29 GMT+0800 (China Standard Time)

This is not solved because

u_[t] = V_[t+1](s_[t]) - V_[t](s_[t]) != V_[t+1](s_[t+1]) - V_[t](s_[t+1])
even you apply u_[t+1] = V_[t+2](s_[t+1]) - V_[t+1](s_[t+1]) != V_[t+1](s_[t+1]) - V_[t](s_[t+1])

For state k+1, u_[k+1] tells about V_[k+2] and V_[k+1], 
but you have to express V_[k+1] and V_[k] for the solution.

The solution can never be achieved in the form of u_t as you defined.

Hyperion · Answer 4 · Fri May 14 2021 18:51:03 GMT+0800 (China Standard Time)

You can checkout the final equation,
excatly match
G_[t] - V_[t](s_[t])