l2rl
TensorFlow implementation of correlated two-armed bandit tasks reported in Learning to Reinforcement Learn (https://arxiv.org/abs/1611.05763) by Wang et al.
TF Learning to Reinforcement Learn Bandit tasks
TensorFlow implementation of correlated two-armed bandit tasks reported in Learning to Reinforcement Learn (https://arxiv.org/abs/1611.05763) by Wang et al.
TF Learning to Reinforcement Learn Bandit tasks