dennis-grinwald / RL_DQN_Autonomous_Lunar_Lander

Applying the DQN-Reinforcement Learning Algorithm to Gym's Lunar Lander environment

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RL_DQN_Autonomous_Lunar_Lander

Applying the DQN-Reinforcement Learning Algorithm to Gym's Lunar Lander environment. Train an agent to safely land a Lunar Lander on the moon without crashing.

The original paper explaining the Algorithm can be found here: https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf

Prerequisites

What things you need to install the software and how to install them

OpenAIs Gym toolbox

pip install gym

PyTorch

pip install torchvision

Running the experiements

Run the Notebook "Deep-Q-Network.ipynb" to train your agent. The notebook provides an comparison of the agents ability to land the lunar lander before and after training. You should see strong improvements. Optinonally you can skip the training procedure and jump ahead to point 4. in the notebook to load a pretrained model and watch the trained agents performance.

Optimizing tests

Tweak several Hyperparameters in the model.py (Hidden Layer size etc.) to see if you can achieve performance gains. Also you can play around with the exploration parameter epsilon and ReplayBuffer size in the dqn_agent.py

Credits

The project is part of Udacities Nanoprogram: Deep Reinforcement Learning

https://eu.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893

About

Applying the DQN-Reinforcement Learning Algorithm to Gym's Lunar Lander environment


Languages

Language:Python 52.4%Language:Jupyter Notebook 47.6%