nima-siboni / cart-pole-deep-RL-actor-critic

Solving the inverted pendulum problem with deep-RL actor-critic (with shared network between the value-evaluation and the policy, epsilon-greedy policy). Some implementation issues concerning the stability are discussed.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

nima-siboni/cart-pole-deep-RL-actor-critic Issues