There are 3 repositories under contextual-bandit topic.
Yahoo! news article recommendation system by linUCB
Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volatile multi-armed bandit setting.
A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.
Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".
An illustrative project including some multi-armed bandit algorithms and contextual bandit algorithms
A Reinforcement Learning approach to a contextual bandit problem.