There are 6 repositories under multi-armed-bandits topic.
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
An easy-to-use reinforcement learning library for research and education.
Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
A Pythonic microframework for multi-armed bandit problems
Python implementation of UCB, EXP3 and Epsilon greedy algorithms
This project is created for the simulations of the paper: [Wang2021] Wenbo Wang, Amir Leshem, Dusit Niyato and Zhu Han, "Decentralized Learning for Channel Allocation inIoT Networks over Unlicensed Bandwidth as aContextual Multi-player Multi-armed Bandit Game", to appear in IEEE Transactions on Wireless Communications, 2021.
Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB.
Online Ranking with Multi-Armed-Bandits
Study of the paper 'Neural Thompson Sampling' published in October 2020
Curated materials for different machine learning related summer schools
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
Implementation of provably Rawlsian fair ML algorithms for contextual bandits.
A beer recommendation system using multi-armed bandit approach to solve cold start problems
Adaptive consistency replication with reinforcement learning for large scale globally distributed storage.
[NeurIPS 2022] Supervising the Multi-Fidelity Race of Hyperparameter Configurations
Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.
Implementations of the bandit algorithms with unordered and ordered slates that are described in the paper "Non-Stochastic Bandit Slate Problems", by Kale et al. 2010.
Focuses on Reinforcement Learning related concepts, use cases, and learning approaches
An improved version of Turbo algorithm for the Black-box optimization competition organized by NeurIPS 2020
SPGD: Search Party Gradient Descent algorithm, a Simple Gradient-Based Parallel Algorithm for Bound-Constrained Optimization. Link: https://www.mdpi.com/2227-7390/10/5/800
MATLAB Implementation of the CGPRANK algorithm
Multi-Armed Bandit method of accurately estimating the largest parameter out of a set of candidates.
Implementations of methods in book <Reinforcement Learning: an introduction> by Sutton Barto, using Python.
C++ implementation of Multi-Armed Bandits (Gaussian and Bernoulli)
Reinforcement learning project using multi-armed bandits for recommendation system