There are 0 repository under bandit-learning topic.
Code and datasets for the Tsetlin Machine
Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
A checkers reinforcement learning AI, and all the tools needed to train it.
Tutorial on the Convolutional Tsetlin Machine
Multi-threaded implementation of the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features and multigranularity.
Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire
Privacy-Preserving Bandits (MLSys'20)
Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.
Some visualizations of bandit algorithm outputs.
Simple Implementations of Bandit Algorithms in python
Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).
Based on Gentile-Li-Zapella article "Online Clustering of Bandits"
Implementing RL algorithms
Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms
Detailed solution of solving wargames of over the wire which includes bandit and in future many more.
A policy gradient approach to a multi-armed bandit problem
This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.
Randomized Greedy Learning Under Full-bandit Feedback
Implementation of 10 Arm Bandit using RLGlue
Aqui irei explicar como passar de cada nível do CTF Bandit fornecido pela Over The Wire
A Reinforcement Learning approach to a contextual bandit problem.
Repository of code developed for the course MSSI @FEUP.
Leveling up on the Bandit Wargames