Batch size in A2C Cartpole

Question

Batch size in A2C Cartpole

akileshbadrinaaraayanan opened this issue 7 years ago · comments

Akilesh Badrinaaraayanan commented 7 years ago

Are there no variants of A2C with mini-batch update instead of training every time step? If yes, could you tell the pros and cons of such an approach?

Thanks,
Akilesh

Woongwon Lee · Answer 1 · Wed Jun 28 2017 00:12:12 GMT+0800 (China Standard Time)

If you want, you can change it from one-step bootstrap to multi-step bootstrap. As I know, one-step bootstrap has lower variance and higher bias than multi-step bootstrap.