avneets2103 / Frozen-lake-Gym-Game

A game environment facilitated by GYM. Solved using RL to an average reward of 0.7 by using basic Q learning algorithm.

Reinforcement Learning:

Q learning algo:
Q-Function
The Q-function uses the Bellman equation and takes state(s) and action(a) as input. The equation simplifies the state values and state-action value calculation.

Q-learning algorithm

Environment used:
Python: 3.8.10
gym: 0.26.2

About

A game environment facilitated by GYM. Solved using RL to an average reward of 0.7 by using basic Q learning algorithm.

Languages

Language:Jupyter Notebook 100.0%