letmelearncode / collaboration-competition

an agent trained on Deep Reinforcement Learning network for playing tennis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Instructions

You must first download the environment :

Linux

Mac OSX

Windows 32-bit

Windows 64-bit

Train and Watch the agent

Training and watching the agent is done in the MADDPG.ipynb notebook. Detailed instructions are present in this notebook.

Environment

movie

Two agents control rackets to bounce a ball over a net. They receive a reward of +0.1 for hitting the ball over the net and a reward of -0.01 for letting the ball hit the ground or hitting it out of bounds. Thus, each agent wants to keep the ball in play as long as possible. Each agent perceive a state of dimensionality 24 and must take an action in the real square [-1,1]x[-1,1].

The goal is to get a minimum score of 0.5 on average over 100 episodic tasks, where the score for each task is the score of the best agent.

Training Curve

The training allowed to solve this task in 3092 episodes. See report.md file for more details.

training

About

an agent trained on Deep Reinforcement Learning network for playing tennis


Languages

Language:Jupyter Notebook 100.0%