ranzuh / semi-gradient-sarsa

Semi-gradient Sarsa in OpenAI Gym environments

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Semi-gradient Sarsa in OpenAI Gym environments

  • On-policy model free Sarsa reinforcement learning algorithm
  • Linear function approximation using tilecoder
  • Solving OpenAI Gym environments MountainCar and CartPole (over 10k timesteps for cartpole)
  • It seems it's not enough for LunarLander, a neural network may be needed

cartpole.gif

Algorithm is from Sutton & Barto's RL Book

Tile coder is from http://incompleteideas.net/tiles/tiles3.html

About

Semi-gradient Sarsa in OpenAI Gym environments


Languages

Language:Jupyter Notebook 96.8%Language:Python 3.2%