Experiment with ICM and PPO bunch for environment with sparse reward signal.
The experiment tests the contribution of intrinsic reward to the agent's ability to solve the sparse-reward environment from Unity ML-Agents Toolkit.
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
Experiment with ICM and PPO bunch for environment with sparse reward signal.
The experiment tests the contribution of intrinsic reward to the agent's ability to solve the sparse-reward environment from Unity ML-Agents Toolkit.
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
MIT License