How many environment steps per update?

Question

How many environment steps per update?

mctigger opened this issue 2 years ago · comments

Hi danijar,
how many environment steps are you running per update?
In the paper it is 4 (so after every step the agent makes it is updated because of action repeat?), but here in the config it says train_every: 16. What is the correct number?

Best,
Tim

Danijar Hafner · Answer 1 · Wed Feb 02 2022 06:58:13 GMT+0800 (China Standard Time)

Hi Tim, those are both correct. It's 16 env steps or 4 agent steps. The action repeat introduces the factor of 4 between the two.

Tim Joseph · Answer 2 · Wed Feb 02 2022 18:25:54 GMT+0800 (China Standard Time)

Thank you for answering so quickly!

In the paper in the hyperparameters section it says Environment steps per update: 4. So in the paper it should actually be Agent Steps per update: 4 or Environment steps per update: 16? Just want to make sure I understand you correctly.

Danijar Hafner · Answer 3 · Sat Feb 05 2022 01:31:03 GMT+0800 (China Standard Time)

Ah, yes. It's every 16 frames or 4 actions. I'll update the paper to make it clearer.