shriramsb / batchRL-SI

Faster convergence in batch-RL with Synaptic Intelligence

batchRL-SI

This projects applies Synaptic Intelligence (https://arxiv.org/abs/1703.04200) to batch-RL. A neural network is used to approximate Q-value. This network is learnt using batch-RL with experience replay, regularized with Synaptic Intelligence.

Synaptic Intelligence reduces the amount of experience required per batch. It also improves the rate of convergence of batch-RL.

About

Faster convergence in batch-RL with Synaptic Intelligence

Languages

Language:Jupyter Notebook 63.7%Language:Python 36.3%