MorvanZhou / Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

https://mofanpy.com/tutorials/machine-learning/reinforcement-learning/

请问actor-critic中的critic预测价值，可以设计为预测action value分布吗？

Hins opened this issue 4 years ago · comments

潘晓彤 commented 4 years ago

然后取相应action的value计算v和v'