xiaojianzhang / Average-Reward-TD-Q-Learning

Code for the numerical experiments in Zhang, Sheng, Zhe Zhang, and Siva Theja Maguluri. "Finite Sample Analysis of Average-Reward TD Learning and Q-Learning."

Home Page:https://openreview.net/pdf?id=1Rxp-demAH0

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

xiaojianzhang/Average-Reward-TD-Q-Learning Issues

No issues in this repository yet.