doctorcorral / DRLND-p3-collabcomp

DRLND project 3 on collaboration and competition

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DRLND Collaboration Compete Project


agentsheadertennis

This is the thrid project in the Udacity Deep Reinforcement Learning Nanodegree.

This project works with the Tennis environment.

Project Details

In this environment, two agents control rackets to bounce a ball over a net. If an agent hits the ball over the net, it receives a reward of +0.1. If an agent lets a ball hit the ground or hits the ball out of bounds, it receives a reward of -0.01. Thus, the goal of each agent is to keep the ball in play.

The observation space consists of 8 variables corresponding to the position and velocity of the ball and racket. Each agent receives its own, local observation. Two continuous actions are available, corresponding to movement toward (or away from) the net, and jumping.

Getting Started

It is recommended to follow the Udacity DRL ND dependencies instructions here

This project also needs Unity ML-Agents, NumPy and PyTorch

A prebuilt simulator is required in be installed. You need only select the environment that matches your operating system:

Tennis Unity Environment

Linux: click here Mac OSX: click here Windows (64-bit): click here

The file needs to placed in the root directory of the repository and unzipped.

Next, before starting the environment utilising the corresponding prebuilt app from Udacity Before running the code cell in the notebook, change the file_name parameter to match the location of the Unity environment that you downloaded.

Instructions

Then run the Tennis.ipynb notebook using the drlnd kernel to train the DDPG agent.

Once trained the model weights will be saved in the same directory in the files checkpoint1_actor0.pth, checkpoint1_actor1.pth and checkpoint1_critic.pth.

The model weights are used by the test_agent.ipynb notebook against the simulator.

About

DRLND project 3 on collaboration and competition


Languages

Language:ASP 56.0%Language:Jupyter Notebook 41.7%Language:Python 2.4%