MorvanZhou / Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Home Page:https://mofanpy.com/tutorials/machine-learning/reinforcement-learning/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

请问actor-critic中的critic预测价值,可以设计为预测action value分布吗?

Hins opened this issue · comments

然后取相应action的value计算v和v'