telmo-correa / DRLND-project-1

Implementation of project 1 for Udacity's Deep Reinforcement Learning Nanodegree

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DRLND-project-1

This repository contains an implementation of project 1 for Udacity's Deep Reinforcement Learning Nanodegree.

Project details

This project implements a Deep Q-Learning Network for solving a Unity environment that can navigate and collect bananas in a large, square world.

Recording of final implementation:

Implemented solution

A reward of +1 is provided for collecting a yellow banana, and a reward of -1 is provided for collecting a blue banana. The goal of the agent is to collect as many yellow bananas as possible, while avoiding the blue bananas.

The state space has 37 dimensions pre-computed by the environment. The action space has four discrete actions, corresponding to moving forward, backward, turning left or turning right.

The task is episodic, and it is considered solved when the agent gets an average score of +13 over 100 consecutive episodes.

Getting started

Dependencies

Dependencies for this project can be setup as per dependencies for the DRL repository. The instructions below walk you through setting up this environment:

  1. Create (and activate) a new environment with Python 3.6.

    • Linux or Mac:
    conda create --name drlnd python=3.6
    source activate drlnd
    
    • Windows:
    conda create --name drlnd python=3.6
    conda activate drlnd
    
  2. Perform a minimal install of the OpenAI gym, as instructed on this repository, or the very helpful instructions at Open AI Spinning Up.

    • Note that openmpi is not natively supported on Windows 10; I had luck installing instead the Microsoft MPI.
  3. Clone this repository, and install its dependencies:

    git clone https://github.com/telmo-correa/DRLND-project-1
    cd DRLND-project-1
    pip install -r requirements.txt
    
  4. Create an IPython kernel for the drlnd environment:

    python -m ipykernel install --user --name drlnd --display-name "drlnd"
    
  5. Run the Jupyter notebook using the newly created kernel:

Selecting kernel on Jupyter

Downloading the Unity environment

Different versions of the Unity environment are required on different operational systems.

Download the corresponding zip file and unpack it. You will need to point to the location of the file within the Navigation.ipynb and Report.ipynb notebooks in order to run them.

Instructions

The problem and the environment are available on the Navigation.ipynb notebook.

The implementation is provided on the Report.ipynbnotebook, along with a discussion of its details.

About

Implementation of project 1 for Udacity's Deep Reinforcement Learning Nanodegree


Languages

Language:Jupyter Notebook 84.2%Language:Python 15.8%