proximal-policy-optimization deep-reinforcement-learning continuous-control pytorch crawler-environment reacher-environment unity-environment

DRLND-project-2

This repository contains an implementation of project 2 for Udacity's Deep Reinforcement Learning Nanodegree.

Project details

Reacher

This project implements PPO for solving a Unity environment for continuous control -- making robotic arms follow rotating waypoints -- on the variant version with 20 agents.

Recording of final implementation:

Rewards are accumulated over time for each agent while the arm extremity is within the target spheres / within a fixed range of the target waypoint. The goal of the agent is to follow the waypoint.

For each agent, the state space has 33 dimensions, and the action space has 4 continuous dimensions.

The task is episodic, and it is considered solved when the agent gets an average score of +30 over 100 consecutive episodes.

Crawler

An optional / extra / challenge part of the project is to control a Crawler.

Rewards are accumulated over time for each agent while each four-legged entity attempts to follow a target objective, based on facing the correct orientation and the speed in that orientation.

The environment features 12 agents, each observing a state with dimension 129, and controlling actions with dimension 20.

This environment is based upon the Unity Crawler environment.

Getting started

Dependencies

Dependencies for this project can be setup as per dependencies for the DRL repository. The instructions below walk you through setting up this environment:

Create (and activate) a new environment with Python 3.6.

Linux or Mac:

conda create --name drlnd python=3.6
source activate drlnd

Windows:

conda create --name drlnd python=3.6
conda activate drlnd

Perform a minimal install of the OpenAI gym, as instructed on this repository, or the very helpful instructions at Open AI Spinning Up.
- Note that openmpi is not natively supported on Windows 10; I had luck installing instead the Microsoft MPI.
Clone this repository, and install its dependencies:
```
git clone https://github.com/telmo-correa/DRLND-project-2
cd DRLND-project-2
pip install -r requirements.txt
```
- Note that there seems to be issues installing unityagents on Windows 10 -- conda looks for a required version of pytorch that does not seem to be available. Commenting out that requirement and installing pytorch separately worked for me.

Create an IPython kernel for the drlnd environment:

python -m ipykernel install --user --name drlnd --display-name "drlnd"

Run the Jupyter notebook using the newly created kernel:

Downloading the Unity environment

Reacher

Different versions of the Unity environment are required on different operational systems.

Version 1: One (1) Agent

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

Version 2: Twenty (20) Agents

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

Download the corresponding zip file and unpack it. You will need to point to the location of the file within the Continuous_Control.ipynb and Report.ipynb notebooks in order to run them.

Crawler

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

Download the corresponding zip file and unpack it. You will need to point to the location of the file within the Crawler.ipynb and Crawler-Report.ipynb notebooks in order to run them.

Instructions

Reacher

The problem and the environment are available on the Continuous_Control.ipynb notebook.

The implementation is provided on the Report.ipynbnotebook, along with a discussion of its details.

Crawler

The extra part of the project is described on the Crawler.ipynb notebook.

The implementation is provided on the Crawler-Report.ipnyb notebook.

About

Implementation of project 2 for Udacity's Deep Reinforcement Learning Nanodegree

proximal-policy-optimization deep-reinforcement-learning continuous-control pytorch crawler-environment reacher-environment unity-environment

Languages

Language:Jupyter Notebook 86.2%Language:Python 13.8%