This module looks at policy based methods of reinforcement learning, principally the drawbacks to value based methods like Q learning that motivate the use of policy gradients.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool