boyu-ai / Hands-on-RL

https://hrl.boyuai.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

2.5公式错误

StevenJokess opened this issue · comments

https://hrl.boyuai.com/chapter/1/%E5%A4%9A%E8%87%82%E8%80%81%E8%99%8E%E6%9C%BA#25-%E4%B8%8A%E7%BD%AE%E4%BF%A1%E7%95%8C%E7%AE%97%E6%B3%95

image

应是 p = 1 / N。

image
total_count就是N

其中,我们用$N$表示到目前为止按压所有臂的次数和,$N_t$代表为目前为止按压第t个臂的次数。

更多可参考我项目:https://github.com/StevenJokess/d2rl/blob/master/chapter/MAB.md#L124-L125
QQ群交个朋友:171097552
付款表达感谢:
收