multi-armed-bandit - Reinforcement Learning Reinforcement Learning | Code to produce the plot in multi-armed-bandit problem