Parameters used to learn Pong?

Question

Parameters used to learn Pong?

mhauskn opened this issue 10 years ago · comments

Matthew Hausknecht commented 10 years ago

What parameters were used to learn the pong player showed in the video (https://www.youtube.com/watch?v=p88R2_3yWPA)? Specifically what is the gamma/memory size/iterations needed to train dqn? I attempted to retrain the agent using the default parameters but it shows no progress after 2-million iterations.

Thanks in advance, and very cool repo!

Yasuhiro Fujita · Answer 1 · Thu Dec 04 2014 13:45:25 GMT+0800 (China Standard Time)

Thanks for forking my repo! Default parameters in dqn_solver.prototxt was outdated, so I've just updated it and now it's the one used in the Pong demo. The size of replay memory was 500,000.

Matthew Hausknecht · Answer 2 · Sun Dec 07 2014 12:34:01 GMT+0800 (China Standard Time)

Thanks, it seems to be working much better now :)