Backwards induction

Question

Backwards induction

ADGEfficiency opened this issue 6 years ago · comments

Backwards induction is a dynamic programming model - it uses an environment model (i.e. a model that predicts state transitions and rewards) to solve MDPs.

Backwards induction can be implemented for any energy_py environment by manipulating the time series structure of the energy_py envs. This will allow any energy_py env to be solved by backwards induction.

I have drafted out an implementation of backwards induction and start to think about how to manipulate the env structure.

Next step is for me to upload these!