seungeunrho / minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

DQN why train iterate for 10 times

FeynmanDNA opened this issue 3 years ago · comments

KYY commented 3 years ago

https://github.com/seungeunrho/minimalRL/blob/master/dqn.py

minimalRL/dqn.py

Line 63 in 7597b9a

def train(q, q_target, memory, optimizer):

I am wondering why the train method is internally looping 10 times? Shouldn't the policy network train per action?