nima-siboni/cart-pole-deep-RL-actor-critic Issues
Tensorflow Update
Closed
Solving the inverted pendulum problem with deep-RL actor-critic (with shared network between the value-evaluation and the policy, epsilon-greedy policy). Some implementation issues concerning the stability are discussed.