Will dropout break out the final loss of ppo algorithm?

Question

ppaanngggg opened this issue 7 years ago · comments

If I add dropout layer to model, will it be a bad idea?

Any experiments there?

ppaanngggg · Answer 1 · Wed Sep 13 2017 11:50:11 GMT+0800 (China Standard Time)

I use eval model when explore environment, and use train model for policy, old policy and value model when training