multi-armed-bandit

There are 3 repositories under multi-armed-bandit topic.

mpatacchiola / dissecting-reinforcement-learning
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
reinforcement-learning deep-reinforcement-learning markov-chain temporal-differencing-learning sarsa q-learning actor-critic multi-armed-bandit inverted-pendulum mountain-car drone-landing dissecting-reinforcement-learning genetic-algorithm neural-networks
Language:Python 606
SMPyBandits
SMPyBandits / SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
research multi-arm-bandits internet-of-things simulations python open-source learning-theory cognitive-radio bandit-algorithms multi-armed-bandit
Language:Jupyter Notebook 375
OnYuKang / Recommendation-systems-paperlist
Papers about recommendation systems that I am interested in
deep-learning social-network explainable-recommendations collaborative-filtering multi-armed-bandit recommendation recommender-system session-based-recommendation-system survey
359
MLBazaar / BTB
A simple, extensible library for developing AutoML systems
automl gaussian-processes hyperparameter-optimization multi-armed-bandit
Language:Python 169
taoensso / touchstone
Simple A/B testing library for Clojure
clojure epl taoensso split-testing multi-armed-bandit engagement-testing
Language:Clojure 136
alison-carrera / mabalgs
:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
multi-armed-bandit mab arm reward thompson-sampling simulation ucb algorithm ranking-algorithm rank ranked-mab monte-carlo montecarlo-simulation contextual-bandits reinforcement-learning reinforcement-learning-algorithms
Language:Python 126
Unity-Technologies / BanditDungeon
Demo project using multi-armed bandit algorithm
unity3d unity multi-armed-bandit
Language:C# 99
roycoding / slots
A multi-armed bandit library for Python
multi-armed-bandit python
Language:Python 81
Nth-iteration-labs / contextual
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
contextual bandit simulation statistics multi-armed cmab contextual-bandits bandit-learning bandit-experiments reinforcement-learning reinforcement exploitation exploration evaluation machine-learning multi-armed-bandit contextual-bandit-policies cran offline-bandit multi-armed-bandits
Language:R 79
Nth-iteration-labs / streamingbandit
Python application to setup and run streaming (contextual) bandit experiments.
streaming bandit mab cmab contextual sequential online multi-armed multi-armed-bandit
Language:Python 79
mab
stitchfix / mab
Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
multiarmed-bandits golang go experimentation data-science reinforcement-learning thompson-sampling multi-armed-bandit multi-armed-bandits thompson
Language:Go 48
google / MAB
R package for Multi-Armed Bandit Simulation Study
multi-armed-bandit
Language:R 37
ardaegeunlu / Contextual-Gaussian-Process-Bandit-Optimization
Simple implementation of the CGP-UCB algorithm.
multi-armed-bandit gaussian-processes machine-learning machine-learning-algorithms reinforcement-learning reinforcement-learning-algorithms
Language:Python 30
gdmarmerola / advanced-bandit-problems
More about the exploration-exploitation tradeoff with harder bandits
machine-learning bandit-algorithms multi-armed-bandit
Language:Jupyter Notebook 23
improve-ai / python-ranker
Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions
improve-ai ab-testing ai contextual-bandits machine-learning multi-armed-bandit personalization python recommender-system xgboost reinforcement-learning multivariate-testing
Language:Python 21
antoine-hochart / bandit_algo_evaluation
Offline evaluation of multi-armed bandit algorithms
multi-armed-bandit epsilon-greedy upper-confidence-bound thompson-sampling policy-evaluation
Language:Python 20
jacksonpradolima / coleman4hcs
COLEMAN (Combinatorial VOlatiLE Multi-Armed BANdit) - and strategies for HCS context
tcpci ci tcp mab coleman multi-armed-bandit test-case-prioritization continuous-integration hcs highly-configurable-system
Language:Jupyter Notebook 16
ir-uam / EnsembleBandits
Software for the experiments reported in the RecSys 2019 paper "Multi-Armed Recommender System Bandit Ensembles"
multi-armed-bandit recommender-system ensemble
Language:Java 14
idanmoradarthas / MutiArmedBandit-DeepLearning
Multi-armed bandit algorithm with tensorflow and 11 policies
tensorflow deep-reinforcement-learning epsilon ucb softmax python3 multi-armed-bandit
Language:Python 13
Correlated-AoI-Bandits
ishank-juneja / Correlated-AoI-Bandits
Author's implementation of the paper Correlated Age-of-Information Bandits.
multi-armed-bandit ucb thompson-sampling age-of-information aoi correlated-multi-armed-bandits correlated-arms aoi-regret
Language:Python 13
ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems
A curated list on papers about combinatorial multi-armed bandit problems.
bandit-algorithms combinatorial-optimization multi-armed-bandit thompson-sampling combinatorial-bandit
13
improve-ai / swift-ranker
Easily Score & Rank Codable Objects with ML
improve-ai ai personalization contextual-bandits ios machine-learning objective-c swift reinforcement-learning recommender-system xgboost multi-armed-bandit ab-testing multivariate-testing
Language:Swift 11
prusrafal / Click-Through-Rate-Prediction-Model
This repository is for a Decision Making Aarhus University Course assignment, focusing on using Multi-Armed Bandit algorithms, specifically the epsilon-greedy algorithm, for optimizing click-through rates in digital advertising by balancing the exploration of new ads and the exploitation of successful ones.
data-science modelling-databases multi-armed-bandit r
Language:R 11
nathanwispinski / meta-rl
A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.
a2c a3c haiku jax multi-armed-bandit reinforcement-learning rnn deep-learning neural-network python
Language:Jupyter Notebook 10
ardaegeunlu / X-armed-Bandits
Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.
reinforcement-learning machine-learning-algorithms multi-armed-bandit multi-armed-bandits reinforcement-learning-algorithms
Language:Python 9
KaleabTessera / Multi-Armed-Bandit
Implementation of greedy, E-greedy and Upper Confidence Bound (UCB) algorithm on the Multi-Armed-Bandit problem.
reinforcement-learning multi-armed-bandit epsilon-greedy greedy upper-confidence-bounds
Language:Python 9
xin-pu / DeepSharp
secondary development by torchsharp for Deep Learning and Reinforcement Learning
deep-learning reinforcement-learning torch qlearning multi-armed-bandit dqn actor-critic
Language:C# 9
ardaegeunlu / Non-Stochastic-Bandit-Slate-Algorithms
Implementations of the bandit algorithms with unordered and ordered slates that are described in the paper "Non-Stochastic Bandit Slate Problems", by Kale et al. 2010.
machine-learning reinforcement-learning multi-armed-bandits multi-armed-bandit machine-learning-algorithms machine-learning-models
Language:Python 8
RicardoMoya / Reinforcemente_Learning_with_Python
En este proyecto de GitHhub podrás encontrar parte del material que utilizo para impartir las clases del módulo introductorio de Reinforcement Learning (Aprendizaje por Refuerzo)
aprendizaje-por-refuerzo sarsa-learning multi-armed-bandit q-learning reinforcement-learning
Language:Jupyter Notebook 8
improve-ai / tracker-trainer
Contextual Multi-Armed Bandit Item/Reward Tracker & Model Trainer
improve-ai ab-testing ai aws aws-lambda contextual-bandits decision-trees multi-armed-bandit parquet python recommender-system reinforcement-learning serverless serverless-framework xgboost machine-learning ml personalization
Language:Python 7
BryanSWeber / CUNYAIModule
CUNYBot, an AI that plays complete games of Starcraft.
starcraft economics genetic-algorithm multi-armed-bandit
Language:C++ 6
FlynnOwen / multi-armed-bandits
Multi-Armed Bandit method of accurately estimating the largest parameter out of a set of candidates.
machine multi-armed-bandit multi-armed-bandits python reinforcement-learning
Language:Python 6
GjjvdBurg / ThompsonSampling
Source code for blog post on Thompson Sampling
thompson-sampling bandit-algorithms multi-armed-bandit multiarmed-bandits
Language:JavaScript 6
juliennonin / multiplayer-bandits
Multi-Player Bandits Revisited [L. Besson & É. Kaufmann]
mab multi-armed-bandit reinforcement-learning
Language:Python 6
DURUII / Replica-AUCB
🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"
aution bandit-algorithms bandits cmab mab multi-armed-bandit aucb
Language:Python 5
saeedghoorchian / NCC-Bandits
Experiments for paper "Online Learning with Costly Features in Non-stationary Environments"
concept-drift contextual-bandits costly-features multi-armed-bandit non-stationary-environment
Language:Jupyter Notebook 5

multi-armed-bandit

mpatacchiola / dissecting-reinforcement-learning

SMPyBandits / SMPyBandits

OnYuKang / Recommendation-systems-paperlist

MLBazaar / BTB

taoensso / touchstone

alison-carrera / mabalgs

Unity-Technologies / BanditDungeon

roycoding / slots

Nth-iteration-labs / contextual

Nth-iteration-labs / streamingbandit

stitchfix / mab

google / MAB

ardaegeunlu / Contextual-Gaussian-Process-Bandit-Optimization

gdmarmerola / advanced-bandit-problems

improve-ai / python-ranker

antoine-hochart / bandit_algo_evaluation

jacksonpradolima / coleman4hcs

ir-uam / EnsembleBandits

idanmoradarthas / MutiArmedBandit-DeepLearning

ishank-juneja / Correlated-AoI-Bandits

ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

improve-ai / swift-ranker

prusrafal / Click-Through-Rate-Prediction-Model

nathanwispinski / meta-rl

ardaegeunlu / X-armed-Bandits

KaleabTessera / Multi-Armed-Bandit

xin-pu / DeepSharp

ardaegeunlu / Non-Stochastic-Bandit-Slate-Algorithms

RicardoMoya / Reinforcemente_Learning_with_Python

improve-ai / tracker-trainer

BryanSWeber / CUNYAIModule

FlynnOwen / multi-armed-bandits

GjjvdBurg / ThompsonSampling

juliennonin / multiplayer-bandits

DURUII / Replica-AUCB

saeedghoorchian / NCC-Bandits