tmoer / a0c

Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

A0C

This project simulates the Alpha Zero algorithm for continuous action spaces (A0C). The associated paper is available from here: https://arxiv.org/pdf/1805.09613.pdf

Installation

You can clone this project by running: git clone https://github.com/tmoer/cursus.git

Requirements are available from ./requirements.txt

Run the project

Run the project with python3 agent.py

Hyperparameter settings can be modified in ./config/hps

Project organization

.
├── .gitignore
├── CITATION.md
├── LICENSE.md
├── README.md
├── requirements.txt
├── bin                <- Compiled and external code, ignored by git (PG)
│   └── external       <- Any external source code, ignored by git (RO)
├── config             <- Configuration files (HW)
├── data               <- All project data, ignored by git
│   ├── processed      <- The final, canonical data sets for modeling. (PG)
│   ├── raw            <- The original, immutable data dump. (RO)
│   └── temp           <- Intermediate data that has been transformed. (PG)
├── docs               <- Documentation notebook for users (HW)
│   ├── manuscript     <- Manuscript source, e.g., LaTeX, Markdown, etc. (HW)
│   └── reports        <- Other project reports and notebooks (e.g. Jupyter, .Rmd) (HW)
├── results
│   ├── figures        <- Figures for the manuscript or reports (PG)
│   └── output         <- Other output for the manuscript or reports (PG)
└── src                <- Source code for this project (HW)

License

This project is licensed under the terms of the MIT License

Citation

Please cite this project as described here.

About

Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)

License:MIT License


Languages

Language:Python 88.2%Language:Shell 11.8%