mgiammatteo / stochastic_policy_search

Implementation of three stochastic policy search algorithms for RL

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Hill Climbing

This repository contains the implementation of the following stochastic policy search algorithms:

  • Vanilla hill climbing
  • Hill climbing with simulated annealing
  • Steepest ascent hill climbing with adaptive noise scaling

They are all applied to the OpenAi Gym Cartpole-v0 environment, for simplicity.

About

Implementation of three stochastic policy search algorithms for RL


Languages

Language:Jupyter Notebook 100.0%