YangWulve / Reinforcement_Learning

RLAI_book and DQN_adventure

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

#RLAI书籍阅读计划:

./book  目录下

08_20 QL算法+RLAI chapter2
08_21 RLAI chapter2
.
.
.
09_03 完成RLAI前十三章的阅读

Deep Reinforcement Learning Algorithm
代码:

./Pytorch_basic  目录下

一.deepmind DQN 测试环境:gym cartpole-v0 实现了以下几种DQN变型
1.DQN
2.DoubleCQN
3.PriorityMemoryDQN
4.DuelingDQN
5.DeepRecurrentQN
6...... e.g. AverageDQN NoisyDQN RainBow等等尚未实现

二.openai policyGradient
若干变型 尚未实现

区别理解
1.on-policy off-policy actor-critic区别?
2.value-based policy-based区别?

About

RLAI_book and DQN_adventure


Languages

Language:Jupyter Notebook 100.0%Language:Python 0.0%