bandit-algorithms

There are 7 repositories under bandit-algorithms topic.

SMPyBandits
SMPyBandits / SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
bandit-algorithms cognitive-radio internet-of-things learning-theory multi-arm-bandits multi-armed-bandit open-source python research simulations
Language:Jupyter Notebook 387
c-bata / goptuna
A hyperparameter optimization framework, inspired by Optuna.
bandit-algorithms bayesian-optimization blackbox-optimization evolution-strategies
Language:Go 256
WilliamLwj / PyXAB
PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization Algorithms
algorithm automl bandit-algorithms blackbox-optimization continuous-armed-bandit data-science hyperparameter-optimization hyperparameter-tuning lipschitz-bandit machine-learning machine-learning-algorithms online-learning optimization optimization-algorithms reinforcement-learning x-armed-bandit
Language:Python 155
KKeishiro / Yahoo_recommendation
Yahoo! news article recommendation system by linUCB
bandit-algorithms contextual-bandit linucb recommendation-system
Language:Python 106
gdmarmerola / interactive-intro-rl
Big Data's open seminars: An Interactive Introduction to Reinforcement Learning
bandit-algorithms machine-learning reinforcement-learning
Language:Jupyter Notebook 62
sshkhr / Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
bandit-algorithms deep-reinforcement-learning evolutionary-algorithms markov-decision-processes monte-carlo-sampling policy-gradient pytorch reinforcement-learning td-learning tensorflow
Language:Jupyter Notebook 54
Alanthink / banditpylib
A lightweight python library for bandit algorithms
bandit-algorithms
Language:Python 29
niffler92 / Bandit
Bandit algorithms
bandit-algorithms contextual-bandit linucb multiarm-bandit simulation thompson-sampling
Language:Python 29
kulinshah98 / Multi-Armed-Bandit-Algorithms
Python implementation of UCB, EXP3 and Epsilon greedy algorithms
multi-armed-bandits bandit-algorithms stochastic-bandit-algorithms upper-confidence-bounds epsilon-greedy adversarial-bandit-algorithms exp3-algorithm
Language:Python 27
doerlbh / MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
acml bandit-algorithms contextual-bandits interspeech interspeech2020 online-learning online-speaker-diarization paper self-supervised-learning speaker-diarization speaker-recognition
Language:Cuda 25
gdmarmerola / advanced-bandit-problems
More about the exploration-exploitation tradeoff with harder bandits
machine-learning bandit-algorithms multi-armed-bandit
Language:Jupyter Notebook 23
mmalekzadeh / privacy-preserving-bandits
Privacy-Preserving Bandits (MLSys'20)
bandit-algorithms differential-privacy machine-learning online-machine-learning reinforcement-learning contextual-bandits privacy-preserving-machine-learning privacy-preserving-bandits criteo-dataset federated-learning recommender-system recommendation bandit-learning bandit-algorithm differentially-private
Language:Jupyter Notebook 22
ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems
A curated list on papers about combinatorial multi-armed bandit problems.
bandit-algorithms combinatorial-bandit combinatorial-optimization multi-armed-bandit thompson-sampling
16
gokceuludogan / interactive-music-recommendation
Personalized and Interactive Music Recommendation with Bandit approach
music-recommendation bandit-algorithms bayes-ucb exploration-exploitation
Language:Jupyter Notebook 10
rssalessio / reading-list
This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.
bandit-algorithms deep-learning learning machine-learning optimization reading-list reinforcement-learning statistics
10
sparsh-ai / reco-bandit
Building recommender Systems using contextual bandit methods to address cold-start issue and online real-time learning
recommender-system bandit-algorithms contextual-bandits
Language:Jupyter Notebook 10
babaniyi / Deep-contextual-bandits
A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.
bandit-algorithms bandits multiarmed-bandits
Language:Python 9
MaxenceGiraud / MachineLearningAlgos
Personal reimplementation of some ML algorithms for learning purposes
machine-learning machine-learning-algorithms deep-learning lda qda knn naive-bayes decision-tree random-forest clustering bandit-algorithms bayesian svm smo convolution gaussian-process-regression kernels reinforcement-learning regression dbscan
Language:Python 9
Naereen / Kullback-Leibler-divergences-and-kl-UCB-indexes
🐍 🔬 Fast Python implementation of various Kullback-Leibler divergences for 1D and 2D parametric distributions. Also provides optimized code for kl-UCB indexes
bandit-algorithms cython divergence kl-ucb kullback-leibler-divergence numba python-library
Language:HTML 9
albertopirillo / ola-project-2023
Pricing and advertising strategy for the e-commerce of an airline company, based on Multi-Armed Bandits (MABs) algorithms and Gaussian Processes. Simulations include non-stationary environments.
bandit-algorithms marketing-automation online-learning reinforcement-learning
Language:Python 7
doerlbh / BanditZoo
Python library of bandits and RL agents in different real-world environments
bandit bandit-algorithms bandits reinforcement-learning simulation
Language:Python 7
jayrcausal / Essential3CRL
Research about Causality-based Reinforcement Learning. This repository includes all needed fundamentals, summary of past work and some most recent development
causal-inference causality reinforcement-learning bandit-algorithms covariate-shift domain-adaptation
Language:Jupyter Notebook 7
GjjvdBurg / ThompsonSampling
Source code for blog post on Thompson Sampling
thompson-sampling bandit-algorithms multi-armed-bandit multiarmed-bandits
Language:JavaScript 6
ngutowski / algossim
This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.
bandit-algorithms artificial-intelligence-algorithms recommendation-system contextual-bandits
Language:Python 6
niravnb / Multi-armed-bandit-algortihms
Implementation of famous Bandits algortihm: Explore then commit, UCB & Thompson sampling in python.
bandit-algorithms
Language:Jupyter Notebook 6
ZiruiYan / awesome-causal-bandit
An list of papers for causal bandit
bandit-algorithms causality causal-bandit
6
duongnhatthang / meta-bandit
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
bandit meta-learning python3 partial-monitoring sequential-decisions sequential-decision-making-problems multi-task bandit-algorithms meta-bandit
Language:Python 5
DURUII / Replica-AUCB
🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"
aution bandit-algorithms bandits cmab mab multi-armed-bandit aucb
Language:Python 5
guptav96 / bandit-algorithms
A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB
reinforcement-learning bandit-algorithms exploration-exploitation
Language:Python 5
MIFA-Lab / LDPbandit2020
Implementation for NeurIPS 2020 paper "Locally Differentially Private (Contextual) Bandits Learning" (https://arxiv.org/abs/2006.00701)
bandit-algorithms differential-privacy numpy
Language:Python 5
nicoleorzan / Multi-armed-bandit-RL
C++ implementation of Multi-Armed Bandits (Gaussian and Bernoulli)
multi-armed-bandits reinforcement-learning softmax-policy bernoulli-bandit gaussian-bandit softmax ucb bandit-algorithms
Language:C++ 5
amirbalef / PS_MOMAB
Multi-Objective Multi-Armed Bandit
bandit-algorithms multi-armed-bandit multi-objective non-stationary ucb-algorithm
Language:Python 4
amirhosein-mesbah / Reinforcement_learning
This repository contains the implementation of a wide variety of Reinforcement Learning Projects in different applications of Bandit Algorithms, MDPs, Distributed RL and Deep RL. These projects include university projects and projects implemented due to interest in Reinforcement Learning.
bandit-algorithms deep-reinforcement-learning deeprl distributed-reinforcement-learning mdp multi-agent-reinforcement-learning network-routing off-policy on-policy reinforcement-learning gym stablebaselines3 q-learning
Language:Jupyter Notebook 4
anishacharya / Bandits-Online-Learning
Simple Implementations of Bandit Algorithms in python
bandit bandit-algorithms bandit-learning bandits multi-armed-bandits online-learning online-learning-algorithms online-learning-python
Language:Jupyter Notebook 4
jia-yi-chen / Bandit-and-Reinforcement-Learning
Python implementation for Reinforcement Learning algorithms -- Bandit algorithms, MDP, Dynamic Programming (value/policy iteration), Model-free Control (off-policy Monte Carlo, Q-learning)
reinforcement-learning bandit-algorithms q-learning monte-carlo dynamic-programming markov-decision-processes grid-world multi-armed-bandit
Language:Python 4
junjiedong / warfarin-bandit
Contextual Bandit algorithms for Warfarin Treatment
bandit bandit-algorithms warfarin
Language:Jupyter Notebook 4

bandit-algorithms

SMPyBandits / SMPyBandits

c-bata / goptuna

WilliamLwj / PyXAB

KKeishiro / Yahoo_recommendation

gdmarmerola / interactive-intro-rl

sshkhr / Practical_RL

Alanthink / banditpylib

niffler92 / Bandit

kulinshah98 / Multi-Armed-Bandit-Algorithms

doerlbh / MiniVox

gdmarmerola / advanced-bandit-problems

mmalekzadeh / privacy-preserving-bandits

ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

gokceuludogan / interactive-music-recommendation

rssalessio / reading-list

sparsh-ai / reco-bandit

babaniyi / Deep-contextual-bandits

MaxenceGiraud / MachineLearningAlgos

Naereen / Kullback-Leibler-divergences-and-kl-UCB-indexes

albertopirillo / ola-project-2023

doerlbh / BanditZoo

jayrcausal / Essential3CRL

GjjvdBurg / ThompsonSampling

ngutowski / algossim

niravnb / Multi-armed-bandit-algortihms

ZiruiYan / awesome-causal-bandit

duongnhatthang / meta-bandit

DURUII / Replica-AUCB

guptav96 / bandit-algorithms

MIFA-Lab / LDPbandit2020

nicoleorzan / Multi-armed-bandit-RL

amirbalef / PS_MOMAB

amirhosein-mesbah / Reinforcement_learning

anishacharya / Bandits-Online-Learning

jia-yi-chen / Bandit-and-Reinforcement-Learning

junjiedong / warfarin-bandit