aminedassouli/AI_PyRat

PyRat

PyRat is a maze game with two opponents (a rat and a python), some pieces of cheese, and potentially obstacles and mud. The goal is to collect as much pieces as possible.

Here is a picture below. For more information about the game and for the game’s source code visit this link : https://github.com/vgripon/PyRat

The AI

The AI was conceived using deep Q-learning with the Monte Carlo technique to supervise the training and decide on the actions.

The reinforcement learning procedure, including Q-learning, Experience replay, and Stochastic Gradient Descent is described in the rl.py file.
The simulation environment, including the generation of reward and the observation that is fed to the agent is described in the game.py file
The Monte Carlo technique to get an optimised path and choose the actions that are going to be followed by the agent, is described in the monte_carlo.py file
The agent (AI) can be deployed in the game with the numpy_rl_reload.py file available in Ais.

The code needs some other files in order to work. Unfortunately, I do not have the right to share them.

The tensorflow model for neural network designed to predict the Q function is written in the rl.py file.

Here is the curve of the evolution of the winrate through training :

The winrate is about 50 % in the end of the training. However, since I was more ambitious, I tried to supervise the Q-learning by choosing the actions to learn from with Monte Carlo instead of learning from completely random actions.

Therefore, the winrate reached 80 % !

About

Code and files for implementing Deep Q-Learning with Monte Carlo for an AI in a maze game "PyRat" against a basic enemy (using a greedy algorithm)

Languages

Language:Python 100.0%