Randomness in optimal epsilon_greedy_policy

Question

Randomness in optimal epsilon_greedy_policy

levindabhi opened this issue 6 years ago · comments

Why we are not decaying epsilon in epsilon_greedy_policy ?

Apoorv Agnihotri · Answer 1 · Thu Jun 20 2019 22:10:41 GMT+0800 (China Standard Time)

@levindabhi I am not sure but isn't it a type of a hyperparameter? If you would like to decay it, then you can else you can just assume that the minimum \epsilon allowed is the \epsilon you gave to the algorithm in the first place.