multiarmed-bandits

There are 2 repositories under multiarmed-bandits topic.

david-cortes / contextualbandits
Python implementations of contextual bandits algorithms
contextual-bandits exploration-exploitation multiarmed-bandits reinforcement-learning
Language:Python 739
alison-carrera / onn
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
neural-network neural-architecture-search pytorch-implementation machine-learning-library thompson-sampling thompson-algorithm mab multiarmed-bandits contextual-bandits reinforcement-learning-algorithms reinforcement-learning pytorch pytorch-implemention
Language:Python 178
mab
stitchfix / mab
Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
multiarmed-bandits golang go experimentation data-science reinforcement-learning thompson-sampling multi-armed-bandit multi-armed-bandits thompson
Language:Go 49
Heewon-Hailey / multi-armed-bandits-for-recommendation-systems
implement basic and contextual MAB algorithms for recommendation system
python scikit-learn numpy matplotlib multiarmed-bandits recommendation-system epsilon-greedy upper-confidence-bounds contextual-bandits
Language:Jupyter Notebook 32
irec-org / irec
Interactive Recommender Systems Framework
recommender-system multiarmed-bandits reinforcement-learning interactive-recommender
Language:Python 22
Bilkent-CYBORG / ACC-UCB
Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volatile multi-armed bandit setting.
multiarmed-bandits reinforcement-learning contextual-bandit combinatorial-bandit
Language:Python 16
rklymentiev / mab_problem
how to deal with multi-armed bandit problem through different approaches
multiarmed-bandits flask-app ab-testing
Language:HTML 13
beer-recommender-mab
paulozip / beer-recommender-mab
A beer recommendation system using multi-armed bandit approach to solve cold start problems
multi-armed-bandits multiarmed-bandits python recommendation-system
Language:Python 12
ShreenidhiN / Reinforcement-Learning-based-Movie-Recommendation
Recommender Systems are the systems designed to that are designed to recommend things to the user based on many different factors. These systems predict the most likely product that the users are most likely to purchase and are of interest to. Recommendations typically speed up searches and make it easier for users to access content they’re interested in, and surprise them with offers they would have never searched for. In this project work, we explore the use of Reinforcement Learning based techniques to solve the problem of Movie Recommendation. We have implemented the following strategies: Multi Armed Bandits based recommender and an Actor-Critic based recommender framework using Deep Reinforcement Learning.
deep-reinforcement-learning multiarmed-bandits reinforcement-learning actor-critic-model
Language:Jupyter Notebook 12
babaniyi / Deep-contextual-bandits
A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.
bandit-algorithms bandits multiarmed-bandits
Language:Python 9
formidablae / Batched_Multi-armed_Bandits
Batched Multi-armed Bandits Problem - Analisi critica. Artificial Intelligence Course Project on the study and experimental results' analysis of a scientific paper.
artificial-intelligence armed-bandit multiarmed-bandits data-science pandas numpy matplotlib matlab
Language:Python 8
R4j4n / Maximizing-Revenue-of-an-Online-Retail-Business
thompson-sampling thompson-algorithm revenue-systems multiarmed-bandits
Language:Python 8
GjjvdBurg / ThompsonSampling
Source code for blog post on Thompson Sampling
thompson-sampling bandit-algorithms multi-armed-bandit multiarmed-bandits
Language:JavaScript 6
Nath-R / LEAF
Learning, Evaluation and Avoidance of Failure situations (LEAF) is a tool to that prevents failures in robot's task plan by learning from previous experience.
multiarmed-bandits learning-by-doing ontology robotics
Language:Java 4
prakHr / Reinforcement-Learning-Book
[Book] :- Andrea Lonza - Reinforcement Learning Algorithms with Python_ Learn, understand, and develop smart algorithms for addressing AI challenges-Packt Publishing (2019)
tensorflow gym-environment gym roboschool pybox2d duckietown-environment policy-iteration value-iteration sarsa-learning dqn actor-critic reinforce reinforcement-learning-algorithms baseline-cnns policy-gradient trpo ddpg td3 dagger multiarmed-bandits
Language:Python 4
raunakkmr / non-monotonic-resource-utilization-in-the-bandits-with-knapsacks-problem-code
This repository contains code for the paper "Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem".
multiarmed-bandits regret-minimization bandits-with-knapsacks knapsack-constraints
Language:Python 4
irec-org / irec-cmdline
The iRec official command line interface
reinforcement-learning multiarmed-bandits recommender-system
Language:Jupyter Notebook 3
k9luo / Deep-Preference-Elicitation
A Comparative Evaluation of Active Learning Methods in Deep Recommendation
python3 tensorflow activelearning multiarmed-bandits upper-confidence-bounds thompson-sampling recommender-system sequential-recommendation active-learning
Language:Jupyter Notebook 3
Shahul-Rahman / MABSearch-Learning-the-learning-rate
MABSearch: The Bandit Way of Learning the Learning Rate - A Harmony Between Reinforcement Learning and Gradient Descent
global-minimum global-optimization global-optimization-algorithms gradient-descent learning-rate metaheuristics multi-armed-bandit multiarm-bandit multiarmed-bandits optimization reinforcement-learning python machine-learning
Language:Jupyter Notebook 3
CavenaghiEmanuele / Multi-armed-bandit
Library on Multi-armed bandit
multiarmed-bandits multiarm-bandit thompson-sampling thompson-algorithm
Language:Python 2
JordiMateoUdL / MAB
MAB Simulator is a Python package that provides a framework for simulating and comparing multi-armed bandit algorithms.
multiarmed-bandits python simulation
Language:Python 2
niazangels / bandits
An introduction to multi arm bandits
bandit-algorithms multiarm-bandit multiarmed-bandits reinforcement-learning
Language:Jupyter Notebook 2
SamueleMeta / data-intelligence-applications
Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing online learning techniques applied to networks.
graphs greedy-algorithm multiarmed-bandits online-learning pricing social-influence thompson-sampling
Language:Python 2
StivenMetaj / Data_Intelligence_Applications_Exam_Project
Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing techniques applied to networks.
data-intelligence multiarmed-bandits pricing python social-influence thompson-sampling
Language:Python 2
Sushant-ctrl / RL-IMPLEMENTATIONS
This repository has all the codes and sources of various RL algorithms that I have implemented.
dqn montecarlomethod multiarmed-bandits rl tabular-rl temporal-differencing-learning
Language:Python 2
GuilongAaron / beta_distribution_adprediction
This program deploys Thompson Bandit algorithm to solve an ad prediction for highest probability of clicking.
beta-distribution multi-arm-bandits multiarmed-bandits thompson-sampling
Language:Python 1
HaniyehBarghi / CooperativeThresholdedLasso
This repository contains the code necessary for generating the figures presented in the paper titled "Cooperative Thresholded Lasso for Sparse Linear Bandit".
multiagent multiarmed-bandits sparselearning
Language:Python 1
hardhik-99 / Thompsom_Sampling_GoF
Thompson Sampling equipped with Goodness of Fit test based active change-point detection in Non-Stationary Bandit environment
reinforcement-learning thompson-sampling goodness-of-fit multiarmed-bandits
Language:Python 1
KalbeDigitalLab / RL4DB-TUTORIAL-PRICAI-2023
This repository contains hands on code for tutorials on PRICAI 2023 with the topics of Reinforcement Learning for Digital Business
inventory-management multiarmed-bandits online-advertising reinforcement-learning
Language:Jupyter Notebook 1
mahdiasdzd / Multi-Armed-Bandits
Multi-Stage-Multi-Armed Bandits (MAB) are a class of reinforcement learning problems where an agent tries to maximize its cumulative reward by sequentially selecting actions from multiple options (arms) and observing the rewards associated with those actions.
multiarmed-bandits multiarmedbandit python3 reinforcement-learning
Language:Jupyter Notebook 1
mobarski / kraken
Contextual Bandit Engine
multi-armed-bandit multi-armed-bandits multiarm-bandit multiarmed-bandits contextual-bandits
Language:Python 1
nisharathod231 / Sustainable-Agriculture-Practices-Recommender
Create a platform that recommends sustainable farming practices to farmers based on their specific location, soil type, crop choice, and climate conditions. Incorporating data on sustainable agriculture methods could help in increasing crop yield, reducing environmental impact, and promoting biodiversity.
machine-learning recommendation-system sustainable-development-goals multiarmed-bandits regret-minimization
Language:Jupyter Notebook 1
showman-sharma / Semi-bandits
We show performance of various algorithms in semi-bandit setting and try to solve a real word problem using the same
multiarmed-bandits bandit-algorithms shortest-path-algorithm
Language:Jupyter Notebook 1
theheisenberg10 / Marketing-Mix-for-Leading-Hospitality-Company
Sending personalized marketing offers (called free play in a casino setting) to players by observing data on their gaming behavior and demographic information
abtesting bayesian-neural-networks multiarmed-bandits reinforcement-learning ucb-algorithm
1
Bulbatronik / Machine-Learning
Repo containing lab files for "Machine Learning" course taken during academic year 2022-2023 summer semester of Master of Telecommunication Engineering program at Politecnico di Milano
machine-learning multiarmed-bandits python reinforcement-learning
Language:Jupyter Notebook
uribalb / RecommSystemWithRL
Music Recommendation system with a Contextual Multi-Armed Bandit
ipywidgets multiarmed-bandits reinforcement-learning scraping selenium
Language:Jupyter Notebook

multiarmed-bandits

david-cortes / contextualbandits

alison-carrera / onn

stitchfix / mab

Heewon-Hailey / multi-armed-bandits-for-recommendation-systems

irec-org / irec

Bilkent-CYBORG / ACC-UCB

rklymentiev / mab_problem

paulozip / beer-recommender-mab

ShreenidhiN / Reinforcement-Learning-based-Movie-Recommendation

babaniyi / Deep-contextual-bandits

formidablae / Batched_Multi-armed_Bandits

R4j4n / Maximizing-Revenue-of-an-Online-Retail-Business

GjjvdBurg / ThompsonSampling

Nath-R / LEAF

prakHr / Reinforcement-Learning-Book

raunakkmr / non-monotonic-resource-utilization-in-the-bandits-with-knapsacks-problem-code

irec-org / irec-cmdline

k9luo / Deep-Preference-Elicitation

Shahul-Rahman / MABSearch-Learning-the-learning-rate

CavenaghiEmanuele / Multi-armed-bandit

JordiMateoUdL / MAB

niazangels / bandits

SamueleMeta / data-intelligence-applications

StivenMetaj / Data_Intelligence_Applications_Exam_Project

Sushant-ctrl / RL-IMPLEMENTATIONS

GuilongAaron / beta_distribution_adprediction

HaniyehBarghi / CooperativeThresholdedLasso

hardhik-99 / Thompsom_Sampling_GoF

KalbeDigitalLab / RL4DB-TUTORIAL-PRICAI-2023

mahdiasdzd / Multi-Armed-Bandits

mobarski / kraken

nisharathod231 / Sustainable-Agriculture-Practices-Recommender

showman-sharma / Semi-bandits

theheisenberg10 / Marketing-Mix-for-Leading-Hospitality-Company

Bulbatronik / Machine-Learning

uribalb / RecommSystemWithRL