deep-reinforcement-learning pytorch python deep-q-network

banana-hunter

A deep-reinforcement learning agent that loves bananas, trained using Deep Q-Networks (DQN).

Project Details:

Banana-Hunter is a deep-reinforcement learning agent designed for the Banana Collectors environment from the Unity ML-Agents Toolkit.

The state is represented via a vector of 37 elements, corresponding to the agent's perception of the objects (i.e. bananas) around him. Our agent has four possible actions:

Move forward, represented by 0.
Move backwards, represented by 1.
Turn left, represented by 2.
Turn right, represented by 3.

We considered our agent has mastered the task when he reached an average score of 13, over 100 episodes.

Getting Started

Before running your agent, be sure to accomplish this first:

Clone this repository.
Download the banana collector environment appropriate to your operating system (available here).
Place the environment file in the cloned repository folder.
Setup an appropriate Python environment. Instructions available [here.] (https://github.com/udacity/deep-reinforcement-learning)

Instructions

You can start running and training the agent by exploring Navigation.ipynb. Also available in the repository:

banana_hunter.py contains the agent code.
banana_manager.py has the code for training the agent.

About

A deep-reinforcement learning agent that loves bananas.

deep-reinforcement-learning pytorch python deep-q-network

Languages

Language:Jupyter Notebook 78.7%Language:Python 21.3%