deep-rl-zoo Algorithms Policy Gradient and Actor Critic Methods Vanilla Policy Gradient (VPG) Value Based Methods with fucntion approximation Deep Q Networks (DQN)