contextual-bandit

There are 3 repositories under contextual-bandit topic.

KKeishiro / Yahoo_recommendation
Yahoo! news article recommendation system by linUCB
linucb contextual-bandit bandit-algorithms recommendation-system
Language:Python 112
niffler92 / Bandit
Bandit algorithms
multiarm-bandit contextual-bandit bandit-algorithms thompson-sampling simulation linucb
Language:Python 30
Bilkent-CYBORG / ACC-UCB
Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volatile multi-armed bandit setting.
multiarmed-bandits reinforcement-learning contextual-bandit combinatorial-bandit
Language:Python 18
Digitalized-Energy-Systems / opfgym
A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.
opf environment environment-design optimal-power-flow reinforcement-learning rl supervised-learning pandapower benchmark energy-system gymnasium power-system contextual-bandit reward-design reward-shaping action-shaping
Language:Python 17
doerlbh / BerlinUCB
Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".
contextual-bandit reinforcement-learning semi-supervised-learning self-supervised-learning nonstationary-environments bandit contextual-bandits bandits paper
Language:MATLAB 4
bsteenwi / ContextualBandit
Contextual bandit implementation using Keras
contextual-bandit keras python
Language:Python 2
ej0cl6 / cbpr
Contextual Bandit with Piled Rewards
contextual-bandit piled-rewards
Language:Python 2
Hins-Hu / Bandit-Algorithms
An illustrative project including some multi-armed bandit algorithms and contextual bandit algorithms
bandit-algorithms multi-armed-bandit contextual-bandit
Language:Python 2
SC5 / bandits
bandits machine-learning reinforcement-learning contextual-bandit
Language:Python
victor-iyi / contextual-bandit
A Reinforcement Learning approach to a contextual bandit problem.
reinforcement-learning-algorithms contextual-bandit markov-decision-processes bandit-learning reinforcement-learning
Language:Jupyter Notebook