schmiJo / p3_collabl-compet

This repository contains my solution to the third project of the Udacity reinforcement learning course

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Collab-Compete Project Jonas Schmidt

Welcome to the third Udacity reinforcement Project.
In this Project we train two agents to play tennis.
👇🏼These are the resulting agents
image info

Project Details

This project contains a solution to the third project of the Udacity Deep Reinforcement Learning Course. This Project uses a multi agent DDPG Algorithm to train the agent.

If an agent hits the ball over the net, it receives a reward of 0.1, if however the ball is dropped or is thrown out of bounds it receives an reward of -0.01.

The task is episodic, and in order to solve the environment, the agent must get an average score of +0.5 over 100 consecutive episodes.

State and Action Spaces

The state space consists of 8 dimensions and is continuous. The observations contain velocity and position of the ball and the racket.

The action space consists of 2 continuous actions, which control the racket. These correspond to a vertical and horizontal movement.

The Agents are trained using a DDPG algorithm with a shared replay buffer.
For further information on training please read the Report.md.

Getting Started

###Prerequisites Python 3.6
Unity
Conda

##Installation:

  1. Clone the repository
https://github.com/schmiJo/p3_collabl-compet
  1. Install Jupyter Notebook
pip install jupyter
  1. Create and activate a new environment for Python 3.6
  • Linux or Mac
conda create --name drlnd python=3.6
source activate drlnd
  • Windows
conda create --name drlnd python=3.6
activate drlnd
  1. Install several dependencies
pip install -r requirements.txt
  1. Before running the Tennis.ipynb change the kernel to match the drlnd environment by using the drop down Kernel menu.

Download the unity environment using the following link for macOs:
https://s3-us-west-1.amazonaws.com/udacity-drlnd/P3/Tennis/Tennis.app.zip

More instructions for the installation can be found under:
https://github.com/udacity/deep-reinforcement-learning#dependencies

About

This repository contains my solution to the third project of the Udacity reinforcement learning course


Languages

Language:ASP.NET 87.7%Language:Jupyter Notebook 10.2%Language:Python 2.1%