deepolicy's repositories
ppo_parameterized
Proximal Policy Optimization for parameterized action space.
000
gym_goal_platform
An simple environment wrapper for gym_goal and gym_platform.
isc
http://isc.net.cn/ i 收藏:个人网址收藏,为第三方网站提供收藏接口
000
Language:C000
plot.baselines-series
A series of baselines ppo running results on different environments.
ppo_pendulum.old
PPO for pendulum of gym.