uvipen / Super-mario-bros-PPO-pytorch

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

training from scratch?

samiulextreem opened this issue · comments

i want to train the model for world 1-1 from scratch. How many update in the network need to get the result which is shown here?

well it seems like the model completes the stage around 900 episode update