Working on the first stick environment
regarding pole balance:
Q1: is the game deterministic? how do we find out ourselves given the environment?
Q2: how do we Q learn something with continuous states?
Resources: http://www.davidqiu.com:8888/research/nature14236.pdf