markub3327 / Dueling-DQN-with-AutoEncoder

Signal novelty detection as an intrinsic reward for robotics

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Signal novelty detection as an intrinsic reward for robotics

Paper: Signal Novelty Detection as an Intrinsic Reward for Robotics

AutoEncoder Architecture

AutoEncoder is one of the deep neural network methods used to detect anomalies in the signals received from the sensors on the robot. The principle is to train on the previously observed states from the experience replay buffer (RB), while the prediction is done over the observed state. In this way, early detection of new previously unobserved states is possible, which will help the agent to explore the environment and try to access previously unvisited states in the agent's state space.

AutoEncoder

Results

The results were obtained by the Dueling Deep Q Network (DQN) algorithm, which represented the agent. All tested tasks were learned without extrinsic reward from the environment using only this novelty intrinsic reward method.

  • Acrobot-v1 AutoEncoder

  • CartPole-v1 AutoEncoder

  • LunarLander-v2 AutoEncoder

  • MountainCar-v0 AutoEncoder

About

Signal novelty detection as an intrinsic reward for robotics

License:MIT License


Languages

Language:Jupyter Notebook 100.0%