muupan / dqn-in-the-caffe

An implementation of Deep Q-Network using Caffe

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Use One network or Two network?

Alchemist77 opened this issue · comments

Hi, I am modifying the program for my project.
When I see the code, muupan used only one network to solve the problem.
However, if you use one network, the issue of the non-stationary targets will be encountered.
I also tested the code with one week(without GPU) and not getting high score, but the code get over 50 sometimes.
Is there someone developing DQN with two network in the same code and test it?
Let me know.
Thanks.