Multi armed bandit (MAB) testbeds based on the book Reinforcement Leraning: An Introduction by Sutton and Barto (2018) created using OpenAI Gym.
Multi armed bandit testbed based on the book 'Reinforcement Leraning: An Introduction' by Sutton and Barto (2018)
Multi armed bandit (MAB) testbeds based on the book Reinforcement Leraning: An Introduction by Sutton and Barto (2018) created using OpenAI Gym.
Multi armed bandit testbed based on the book 'Reinforcement Leraning: An Introduction' by Sutton and Barto (2018)
MIT License