Batch size in A2C Cartpole
akileshbadrinaaraayanan opened this issue · comments
Akilesh Badrinaaraayanan commented
Are there no variants of A2C with mini-batch update instead of training every time step? If yes, could you tell the pros and cons of such an approach?
Thanks,
Akilesh
Woongwon Lee commented
If you want, you can change it from one-step bootstrap to multi-step bootstrap. As I know, one-step bootstrap has lower variance and higher bias than multi-step bootstrap.