seungeunrho / minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DQN why train iterate for 10 times

FeynmanDNA opened this issue · comments

commented

https://github.com/seungeunrho/minimalRL/blob/master/dqn.py

def train(q, q_target, memory, optimizer):

I am wondering why the train method is internally looping 10 times? Shouldn't the policy network train per action?