katalinic / l2rl

TF Learning to Reinforcement Learn Bandit tasks

l2rl

TensorFlow implementation of correlated two-armed bandit tasks reported in Learning to Reinforcement Learn (https://arxiv.org/abs/1611.05763) by Wang et al.

About

TF Learning to Reinforcement Learn Bandit tasks

Languages

Language:Python 100.0%