dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Home Page:http://www.wildml.com/2016/10/learning-reinforcement-learning/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Randomness in optimal epsilon_greedy_policy

levindabhi opened this issue · comments

Why we are not decaying epsilon in epsilon_greedy_policy ?

@levindabhi I am not sure but isn't it a type of a hyperparameter? If you would like to decay it, then you can else you can just assume that the minimum \epsilon allowed is the \epsilon you gave to the algorithm in the first place.