StamatisOrfanos / RL_path_optimization

This is my undergraduate thesis for path optimization in an open, stochastic grid environment using RL methods like E-greedy strategy and Monte Carlo-Temporal Difference Hybrid