There are 2 repositories under thompson-sampling topic.
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
This repository contains the source code for “Thompson sampling efficient multiobjective optimization” (TSEMO).
Thompson Sampling Tutorial
All codes, both created and optimized for best results from the SuperDataScience Course
In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
pyrff: Python implementation of random fourier feature approximations for gaussian processes
Offline evaluation of multi-armed bandit algorithms
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
Study of the paper 'Neural Thompson Sampling' published in October 2020
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
Author's implementation of the paper Correlated Age-of-Information Bandits.
Optimizing the best Ads using Reinforcement learning Algorithms such as Thompson Sampling and Upper Confidence Bound.
A curated list on papers about combinatorial multi-armed bandit problems.
Thompson Sampling for Bandits using UCB policy
Repository Containing Comparison of two methods for dealing with Exploration-Exploitation dilemma for MultiArmed Bandits
The example of using reinforcement learning algorithms in the business, specifically finding what ads to use in our campaign.
An improved version of Turbo algorithm for the Black-box optimization competition organized by NeurIPS 2020
Codes and templates for ML algorithms created, modified and optimized in Python and R.
Source code for blog post on Thompson Sampling
Different implementations of Bayesian neural networks for uncertainty estimation. The uncertainty estimation is utilized for efficient exploration in reinforcement learning.
Foundations Of Intelligent Learning Agents (FILA) Assignments
Bayesian Link Adaptation under a BLER Target
Implementation of 9 multi-armed bandit algorithm for the stationary stochastic environment
Maximize revenues of Online Retail Business with Thompson Sampling algorithm