cptanalatriste / banana-hunter

A deep-reinforcement learning agent that loves bananas.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

banana-hunter

A deep-reinforcement learning agent that loves bananas, trained using Deep Q-Networks (DQN).

Project Details:

Banana-Hunter is a deep-reinforcement learning agent designed for the Banana Collectors environment from the Unity ML-Agents Toolkit.

The banana collector environment

The state is represented via a vector of 37 elements, corresponding to the agent's perception of the objects (i.e. bananas) around him. Our agent has four possible actions:

  1. Move forward, represented by 0.
  2. Move backwards, represented by 1.
  3. Turn left, represented by 2.
  4. Turn right, represented by 3.

We considered our agent has mastered the task when he reached an average score of 13, over 100 episodes.

Getting Started

Before running your agent, be sure to accomplish this first:

  1. Clone this repository.
  2. Download the banana collector environment appropriate to your operating system (available here).
  3. Place the environment file in the cloned repository folder.
  4. Setup an appropriate Python environment. Instructions available [here.] (https://github.com/udacity/deep-reinforcement-learning)

Instructions

You can start running and training the agent by exploring Navigation.ipynb. Also available in the repository:

  • banana_hunter.py contains the agent code.
  • banana_manager.py has the code for training the agent.

About

A deep-reinforcement learning agent that loves bananas.


Languages

Language:Jupyter Notebook 78.7%Language:Python 21.3%