Following the examples from Reinforcement Learning: An Introduction by Andrew Barto and Richard S. Sutton.
- Add non stationary k armed bandits
- Fix Gibbs solver
Using a range of methods to approche the k armed bandit problem, inspired by Reinforcement Learning: An Introduction by Andrew Barto and Richard S. Sutton.
Following the examples from Reinforcement Learning: An Introduction by Andrew Barto and Richard S. Sutton.
Using a range of methods to approche the k armed bandit problem, inspired by Reinforcement Learning: An Introduction by Andrew Barto and Richard S. Sutton.
MIT License