anomaly-detection autoencoder deep-neural-networks dueling-dqn keras reinforcement-learning robotics tensorflow

Signal novelty detection as an intrinsic reward for robotics

Paper: Signal Novelty Detection as an Intrinsic Reward for Robotics

AutoEncoder Architecture

AutoEncoder is one of the deep neural network methods used to detect anomalies in the signals received from the sensors on the robot. The principle is to train on the previously observed states from the experience replay buffer (RB), while the prediction is done over the observed state. In this way, early detection of new previously unobserved states is possible, which will help the agent to explore the environment and try to access previously unvisited states in the agent's state space.

Results

The results were obtained by the Dueling Deep Q Network (DQN) algorithm, which represented the agent. All tested tasks were learned without extrinsic reward from the environment using only this novelty intrinsic reward method.

Acrobot-v1
CartPole-v1
LunarLander-v2
MountainCar-v0

About

Signal novelty detection as an intrinsic reward for robotics

anomaly-detection autoencoder deep-neural-networks dueling-dqn keras reinforcement-learning robotics tensorflow

MIT License

Languages

Language:Jupyter Notebook 100.0%