LanxinL / HOOF

Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Fast Efficient Hyperparameter Tuning for Policy Gradients (https://arxiv.org/abs/1902.06583)

Implementation of HOOF for A2C and TNPG. The code is based on OpenAI Baselines implementation: https://github.com/openai/baselines

To run the code:

  1. Add your MuJoCo key to the folder
  2. Build the docker with build.sh and then run it with run.sh
  3. The parameters for each environment is in the yaml file
  4. Run the code with run_A2C.sh or run_TRPO_TNPG.sh with the yaml filename as argument
  5. For other environments, just create a new yaml file and add it to the folder And that's it!

About

Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583


Languages

Language:Python 96.8%Language:Dockerfile 2.4%Language:Shell 0.8%