wangshusen / DRL

Deep Reinforcement Learning

wangshusen/DRL Issues

发现了两个小错误
Updated a year ago
有没有代码示例呢？
Updated a year ago3
经验回放
Updated a year ago1
3.4.1节动作价值函数
Updated a year ago
p101 题3答案B文字错误
Updated 2 years ago3
REINFORCE with Baseline 中的 slides 出现错误
Updated 2 years ago
请问如何cite这本书呢？
Updated 2 years ago3
关于s,S与a,A间的相互转化
Updated 2 years ago
强化学习视频中使用的讲义和该 repo 中的 slides 对不上
Updated 2 years ago
跪求老师更新一节PPO的讲解视频
Updated 2 years ago3
7.3.2证明中的typo
Updated 2 years ago2
10.3.3 小节漏字
Updated 2 years ago
7.3.2 节可能的错误
Closed 2 years ago1
第五章SARSA算法描述是否有误
Updated 2 years ago3
我不清楚这里是否写错了
Updated 2 years ago4
劝你识相点，给我入驻B站（手动狗头）
Updated 2 years ago1
github上的DRL.pdf是最新版本吗？
Updated 2 years ago
Typo in Notes_CN/DRL.pdf: regarding entropy formulae
Updated 2 years ago
建议增加PPO和SAC讲解
Updated 2 years ago1
4.2.1 一术语使用不妥
Closed 2 years ago2
第7章视频没有公开
Updated 3 years ago1
请教，对于多Agent，按既定次序采取动作，而不是同时采取动作的问题，应如何建模，是否有推荐的论文？多谢
Updated 3 years ago1
感谢王先生难能可贵的分享，能否给书籍增加书签目录？
Closed 3 years ago2
第9章笔误及第6章疑问
Closed 3 years ago
Nothing
Closed 3 years ago
6.2.4 使用目标网络：可能的错误
Updated 3 years ago2
可能的错误：6.2.1小节--自举导致偏差的传播
Updated 3 years ago2
8.1节可能的小错误
Updated 3 years ago1
7. Multi-Agent Reinforcement Learning. 视频看不了
Updated 3 years ago1
对前两章基础部分内容的读后反馈
Updated 3 years ago6
习题答案
Updated 3 years ago
前9章读后感
Updated 3 years ago
Double DQN gamma 参数
Updated 3 years ago
3.5 添加相关概念
Closed 3 years ago
4.4 Q 学习算法 P47 落下一个字
Updated 3 years ago1
很不错的书，希望增加目录，还有文中公式，引用的超链接
Updated 3 years ago2
基于强化学习的知识图谱推理
Updated 3 years ago
建议增加值分布强化学习的内容
Updated 3 years ago
阅读反馈
Updated 3 years ago2
ImageNet 在深度学习中的应用
Updated 3 years ago1
More explanations on why Dueling DQN separates Q function
Updated 3 years ago1
确定策略梯度章节的改进建议
Updated 3 years ago2
TRPO中的一个小问题
Updated 3 years ago6
4.3.1算法推导的第一个公式
Updated 3 years ago3
一个小typo
Closed 3 years ago1
Missing right parenthesis in Appendix A
Closed 3 years ago1
41页的参数更新
Closed 3 years ago2
Question About P48
Closed 3 years ago5
第四页有一处错字
Updated 3 years ago3
chapter4: a question about TD
Closed 3 years ago2