Code for the numerical experiments in Zhang, Sheng, Zhe Zhang, and Siva Theja Maguluri. "Finite Sample Analysis of Average-Reward TD Learning and Q-Learning."
Home Page:https://openreview.net/pdf?id=1Rxp-demAH0
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool