PPO
jaelim opened this issue · comments
Jae Lim commented
HI, Thanks for a great repo. I was going through your code, and I was wondering if you have tried to implement PPO instead of REINFORCE algo? If not yet, any plan on upgrading?
Somshubra Majumdar commented
No plans to upgrade this. This repo was just an experiment to see if I could implement a minimal version that worked.