Issue with num_process while running trpo_mujoco script

Question

Issue with num_process while running trpo_mujoco script

balasurajp opened this issue 4 years ago · comments

Terminal freezes without any error. There is some bug related to multiprocessing in the memory collector class.

Ritchie · Answer 1 · Thu Jan 21 2021 12:35:07 GMT+0800 (China Standard Time)

@surajpedasingu Thanks for pointing out the bug, the memory collector was designed to compatible for multiprocessing to accelerate the sampling step, you can set num_process=1 just to verify the algorithm itself, I will try to fix the bug as soon as possible.

Ritchie · Answer 2 · Thu Jan 21 2021 14:35:54 GMT+0800 (China Standard Time)

@surajpedasingu The bug caused by multiprocessing package is incompatible with torch, now it's fixed now, and you can check it.

Bala Suraj P · Answer 3 · Sat Jan 23 2021 19:00:12 GMT+0800 (China Standard Time)

@RITCHIEHuang Yes it was working with num_process=1. I dont understand how are you trying to acclerate sampling when the underlying policy & env are single objects and the same objects are being passed to multiple processes. In this case, sampling wont be accelerated as processes will be executed in sequential order because underlying objects will be locked when one process is using them. Correct me if I'm wrong?

Ritchie · Answer 4 · Sun Jan 24 2021 03:56:06 GMT+0800 (China Standard Time)

@surajpedasingu Yeah, here the collector use multi processes to collect samples without updating the policy parameters, the policy & env itselves is not changed, we still can use less time to sample same samples.