How many step does it take to converge?

Question

How many step does it take to converge?

YuZhang10 opened this issue 4 years ago · comments

Yu Zhang commented 4 years ago

Hi, I have trained the policy for over 50M steps, but it seems not converge yet.

I wonder how many steps in total it need to finish trianing?

(BTW, I use 16 way parallel training)

Thanks.

erwincoumans · Answer 1 · Tue Jun 02 2020 12:29:05 GMT+0800 (China Standard Time)

Somewhere around 100 million samples and for some of the more complex motions even more (150-200 million). Note that quality of the policies is not as good as our internal version we used for the paper, it may require a bit more hyper-parameter tuning.

Yu Zhang · Answer 2 · Tue Jun 02 2020 13:39:26 GMT+0800 (China Standard Time)

Thanks for your reply.

I've noticed that the trained lakaigo can already walk smoothly while the vf loss constantly increases. Does it make any sense? or it means I did something wrong?

Akhtyamov Timur · Answer 3 · Mon Jun 15 2020 09:24:54 GMT+0800 (China Standard Time)

Dear @erwincoumans,

Can you please clarify the parameters of your computational setup that you used to training in original paper? And how much time did training take for one motion skill?

erwincoumans · Answer 4 · Mon Jul 13 2020 00:15:23 GMT+0800 (China Standard Time)

Computation was in the order of day(s) typically. The computational setup are a cluster of cloud machines running Linux, similar to Google Cloud.