PPO Continuous Action Space

Question

raunakdoesdev opened this issue 5 years ago · comments

What changes would be required to employ your ppo algorithm in a continuous action space like Pendulum-v0?

seolhokim · Answer 1 · Sun Nov 10 2019 19:08:49 GMT+0800 (China Standard Time)

It was too late, but I made similar code-style continuous-ppo version and sent pull request. It doesn't perform well, but check it out.

Seungeun Rho · Answer 2 · Thu Nov 12 2020 17:11:15 GMT+0800 (China Standard Time)

Added! Thanx :)