thompson-sampling

There are 2 repositories under thompson-sampling topic.

alison-carrera / onn
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
neural-network neural-architecture-search pytorch-implementation machine-learning-library thompson-sampling thompson-algorithm mab multiarmed-bandits contextual-bandits reinforcement-learning-algorithms reinforcement-learning pytorch pytorch-implemention
Language:Python 171
alison-carrera / mabalgs
:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
multi-armed-bandit mab arm reward thompson-sampling simulation ucb algorithm ranking-algorithm rank ranked-mab monte-carlo montecarlo-simulation contextual-bandits reinforcement-learning reinforcement-learning-algorithms
Language:Python 126
Eric-Bradford / TS-EMO
This repository contains the source code for “Thompson sampling efficient multiobjective optimization” (TSEMO).
bayesian-optimization machine-learning black-box-optimization gaussian-processes matlab expensive-to-evaluate-functions kriging surrogate-based-optimization thompson-sampling spectral-sampling genetic-algorithms multi-objective-optimization
Language:MATLAB 84
mab
stitchfix / mab
Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
multiarmed-bandits golang go experimentation data-science reinforcement-learning thompson-sampling multi-armed-bandit multi-armed-bandits thompson
Language:Go 46
andrecianflone / thompson
Thompson Sampling Tutorial
thompson-sampling bandit bandit-algorithm reinforcement-learning
Language:Jupyter Notebook 45
farhanchoudhary / Machine_Learning_A-Z_All_Codes_and_Templates
All codes, both created and optimized for best results from the SuperDataScience Course
machine-learning-az classification neural-networks association-rule-learning clustering-algorithm xgboost-algorithm natural-language-processing naive-bayes-classifier dimensionality-reduction principal-component-analysis clustering grid-search k-fold cross-validation deep-learning reinforcement-learning thompson-sampling upper-confidence-bounds
Language:Python 33
Nikronic / Machine-Learning-Models
In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
reinforcement-learning nlp-machine-learning pca apriori eclat upper-confidence-bound thompson-sampling lda kernel-pca xgboost linear-regression support-vector-regression logistic-regression k-nn svm naive-bayes random-forest decision-tree ann cnn
Language:Python 31
niffler92 / Bandit
Bandit algorithms
multiarm-bandit contextual-bandit bandit-algorithms thompson-sampling simulation linucb
Language:Python 29
michaelosthege / pyrff
pyrff: Python implementation of random fourier feature approximations for gaussian processes
gaussian-processes thompson-sampling bayesian-optimization
Language:Jupyter Notebook 26
antoine-hochart / bandit_algo_evaluation
Offline evaluation of multi-armed bandit algorithms
multi-armed-bandit epsilon-greedy upper-confidence-bound thompson-sampling policy-evaluation
Language:Python 20
v-i-s-h / MAB.jl
A Julia Package for providing Multi Armed Bandit Experiments
reinforcement-learning reinforcement-learning-algorithms julia-language multi-arm-bandits julia thompson-sampling mab bandit-experiments ucb exp julialang julia-package
Language:Julia 20
nphdang / Bandit-BO
Bayesian Optimization for Categorical and Continuous Inputs
bayesian-optimization categorical-variables continuous-variable thompson-sampling gaussian-processes automated-machine-learning hyperparameter-optimization batch-bayesian-optimization acquisition-functions multi-armed-bandits machine-learning hyperparameter-tuning gpyopt hyperopt smac automl optimization
Language:Python 18
akshaykhadse / reinforcement-learning
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
reinforcement-learning reinforcement-learning-excercises reinforcement-learning-analysis multi-armed-bandits multiarm-bandit markovian-epidemic-processes mdps ucb ucb1 kl-divergence epsilon-greedy thompson-sampling linear-programming howards-pi policy-iteration policy-evaluation batch-switching randomised-algorithms randomized-policy-iteration
Language:Python 17
RonyAbecidan / Neural-Thompson-Sampling
Study of the paper 'Neural Thompson Sampling' published in October 2020
neural-network neural-tangent-kernel thompson-sampling neural-thompson-sampling multi-armed-bandits contextual-bandits non-linear-optimization
Language:Jupyter Notebook 17
aijunbai / thompson-sampling
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
thompson-sampling pomdps mdp mcts
Language:C++ 13
Correlated-AoI-Bandits
ishank-juneja / Correlated-AoI-Bandits
Author's implementation of the paper Correlated Age-of-Information Bandits.
multi-armed-bandit ucb thompson-sampling age-of-information aoi correlated-multi-armed-bandits correlated-arms aoi-regret
Language:Python 13
sharmaroshan / Ads-Optimization
Optimizing the best Ads using Reinforcement learning Algorithms such as Thompson Sampling and Upper Confidence Bound.
data-science reinforcement-learning upper-confidence-bound thompson-sampling eda beginner data-analysis data-visualization
Language:Jupyter Notebook 13
ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems
A curated list on papers about combinatorial multi-armed bandit problems.
bandit-algorithms combinatorial-optimization multi-armed-bandit thompson-sampling combinatorial-bandit
13
atse0612 / Machine-Learning-A-Z
python numpy pandas matplotlib r regression bayesian-statistics random-forest kernel-support logistic-regression naive-bayes-classifier polynomial-regression linear-regression model-building thompson-sampling upper-confidence-bounds random-generation
Language:Jupyter Notebook 12
annieyan / Bandits-using-UCB-algorithm
Thompson Sampling for Bandits using UCB policy
reinforcement-learning ucb bandits thompson-sampling
Language:Python 10
Amshra267 / Thompson-Greedy-Comparison-for-MultiArmed-Bandits
Repository Containing Comparison of two methods for dealing with Exploration-Exploitation dilemma for MultiArmed Bandits
exploration-exploitation thompson-sampling epsilon-greedy optimistic-bayesian-sampling
Language:Python 9
lucko515 / ads-strategy-reinforcement-learning
The example of using reinforcement learning algorithms in the business, specifically finding what ads to use in our campaign.
reinforcement-learning upper-confidence-bounds thompson-sampling machine-learning
Language:Jupyter Notebook 8
nphdang / turbo_bbo_neurips_2020
An improved version of Turbo algorithm for the Black-box optimization competition organized by NeurIPS 2020
bayesian-optimization thompson-sampling gaussian-processes automated-machine-learning hyperparameter-optimization batch-bayesian-optimization decay acquisition-functions multi-armed-bandits machine-learning hyperparameter-tuning classification turbo
Language:Python 8
R4j4n / Maximizing-Revenue-of-an-Online-Retail-Business
thompson-sampling thompson-algorithm revenue-systems multiarmed-bandits
Language:Python 8
rudrajit1729 / Machine-Learning-Codes-And-Templates
Codes and templates for ML algorithms created, modified and optimized in Python and R.
datascience feature-extraction feature-selection regression-models regression-algorithms classification-algorithims kmeans-clustering hierarchical-clustering apriori-algorithm eclat-algorithm ucb thompson-sampling nlp-machine-learning ann cnn-classification kfold-cross-validation dimensionality-reduction xgboost-model parameter-tuning
Language:Python 8
mabby
thetawom / mabby
A multi-armed bandit (MAB) simulation library in Python
multi-armed-bandits probability python reinforcement-learning simulation agent-based-simulation artificial-intelligence epsilon-greedy thompson-sampling
Language:Python 7
GjjvdBurg / ThompsonSampling
Source code for blog post on Thompson Sampling
thompson-sampling bandit-algorithms multi-armed-bandit multiarmed-bandits
Language:JavaScript 6
LukasRinder / bayesian-neural-networks
Different implementations of Bayesian neural networks for uncertainty estimation. The uncertainty estimation is utilized for efficient exploration in reinforcement learning.
bayesian-neural-networks reinforcement-learning uncertainty-estimation thompson-sampling deep-q-network
Language:Python 6
nimily / linear-ts
Codes for simulations in the paper "On Worst-case Regret of Linear Thompson Sampling"
bandit thompson-sampling
Language:Python 6
rssalessio / Parallel-Bayesian-Optimization-Thompson-Sampling
thompson-sampling bayesian-optimization parallel-computing parallel-thompson-sampling parallel-bayesian-optimization black-box-optimization
Language:Python 5
rueian / gobandit
A golang library for solving multi armed bandit problem which can optimize your business choice on the fly without A/B testing
golang thompson-sampling enforcement multi-armed-bandit
Language:Go 5
Suchetaaa / CS747-Assignments
Foundations Of Intelligent Learning Agents (FILA) Assignments
reinforcement-learning multi-armed-bandits bellman-equation linear-programming howards-pi bootstrapping monte-carlo sarsa-learning windy-gridworld temporal-differencing-learning intelligent-learning-agents ucb thompson-sampling kl-ucb
Language:Python 5
vidits-kth / bayesla-link-adaptation
Bayesian Link Adaptation under a BLER Target
wireless-communication thompson-sampling cellular-network throughput-performance linear-programming
Language:Jupyter Notebook 5
Ralami1859 / Stochastic-Multi-Armed-Bandit
Implementation of 9 multi-armed bandit algorithm for the stationary stochastic environment
stochastic-bandit-algorithms ucb thompson-sampling kl-ucb bayes-ucb moss
Language:MATLAB 4
LaurentVeyssier / Maximize-Revenues-with-Thompson-Sampling
Maximize revenues of Online Retail Business with Thompson Sampling algorithm
thompson-sampling thompson-algorithm revenue-management maximization python reinforcement-learning
Language:Jupyter Notebook 3
vmarchaud / ts-mab
Typescript implementation of a multi-armed bandit
mab thompson-sampling typescript
Language:TypeScript 3

thompson-sampling

alison-carrera / onn

alison-carrera / mabalgs

Eric-Bradford / TS-EMO

stitchfix / mab

andrecianflone / thompson

farhanchoudhary / Machine_Learning_A-Z_All_Codes_and_Templates

Nikronic / Machine-Learning-Models

niffler92 / Bandit

michaelosthege / pyrff

antoine-hochart / bandit_algo_evaluation

v-i-s-h / MAB.jl

nphdang / Bandit-BO

akshaykhadse / reinforcement-learning

RonyAbecidan / Neural-Thompson-Sampling

aijunbai / thompson-sampling

ishank-juneja / Correlated-AoI-Bandits

sharmaroshan / Ads-Optimization

ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

atse0612 / Machine-Learning-A-Z

annieyan / Bandits-using-UCB-algorithm

Amshra267 / Thompson-Greedy-Comparison-for-MultiArmed-Bandits

lucko515 / ads-strategy-reinforcement-learning

nphdang / turbo_bbo_neurips_2020

R4j4n / Maximizing-Revenue-of-an-Online-Retail-Business

rudrajit1729 / Machine-Learning-Codes-And-Templates

thetawom / mabby

GjjvdBurg / ThompsonSampling

LukasRinder / bayesian-neural-networks

nimily / linear-ts

rssalessio / Parallel-Bayesian-Optimization-Thompson-Sampling

rueian / gobandit

Suchetaaa / CS747-Assignments

vidits-kth / bayesla-link-adaptation

Ralami1859 / Stochastic-Multi-Armed-Bandit

LaurentVeyssier / Maximize-Revenues-with-Thompson-Sampling

vmarchaud / ts-mab