bandit-learning

There are 0 repository under bandit-learning topic.

cair / TsetlinMachine
Code and datasets for the Tsetlin Machine
bandit-learning frequent-pattern-mining game-theory learning-automata machine-learning pattern-recognition propositional-logic tsetlin-machine
Language:Cython 451
cair / pyTsetlinMachine
Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget
rule-based bandit-learning propositional-logic tsetlin-machine convolution regression frequent-pattern-mining interpretable machine-learning classification embedding
Language:C 122
Nth-iteration-labs / contextual
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
contextual bandit simulation statistics multi-armed cmab contextual-bandits bandit-learning bandit-experiments reinforcement-learning reinforcement exploitation exploration evaluation machine-learning multi-armed-bandit contextual-bandit-policies cran offline-bandit multi-armed-bandits
Language:R 79
SamRagusa / Checkers-Reinforcement-Learning
A checkers reinforcement learning AI, and all the tools needed to train it.
adversarial adversarial-learning ai alpha-beta-pruning artificial-intelligence bandit-learning board-game checker checkers checkers-reinforcement-learning draughts dynamic-programming game game-board machine-learning q-learning reinforcement-learning
Language:Python 54
cair / convolutional-tsetlin-machine-tutorial
Tutorial on the Convolutional Tsetlin Machine
tsetlin-machine pattern-recognition convolution interpretable-machine-learning propositional-logic rule-based bandit-learning frequent-pattern-mining
Language:Python 52
cair / pyTsetlinMachineParallel
Multi-threaded implementation of the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features and multigranularity.
frequent-pattern-mining bandit-learning propositional-logic tsetlin-machine classification regression convolution machine-learning interpretable-machine-learning rule-based
Language:C 39
thunfischtoast / LinUCB
Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire
java bandit-algorithm bandit-learning contextual-bandits linucb
Language:Java 28
mmalekzadeh / privacy-preserving-bandits
Privacy-Preserving Bandits (MLSys'20)
bandit-algorithms differential-privacy machine-learning online-machine-learning reinforcement-learning contextual-bandits privacy-preserving-machine-learning privacy-preserving-bandits criteo-dataset federated-learning recommender-system recommendation bandit-learning bandit-algorithm differentially-private
Language:Jupyter Notebook 22
Nth-iteration-labs / streamingbandit-ui
Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.
streamingbandit-client react bandit-learning bandit-algorithm contextual-bandits multiarm-bandit javascript client webapp machine-learning
Language:JavaScript 8
etiennekintzler / visualize_bandit_algorithms
Some visualizations of bandit algorithm outputs.
reinforcement-learning-algorithms bandit-learning linucb
Language:Jupyter Notebook 7
anishacharya / Bandits-Online-Learning
Simple Implementations of Bandit Algorithms in python
online-learning online-learning-python online-learning-algorithms bandit bandit-algorithms bandit-learning bandits multi-armed-bandits
Language:Jupyter Notebook 4
crenwick / Swiper
🦊 A series of bandit algorithms in Swift
softmax epsilon swift bandit-learning bandit multi-armed-bandits multi-arm-bandits
Language:Swift 4
juliakreutzer / bandit-neuralmonkey
Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).
bandit-learning machine-translation weak-feedback neural-mt nmt reinforce
Language:Python 4
AntoineG92 / Online-Clustering-of-Bandits-ENSAE
Based on Gentile-Li-Zapella article "Online Clustering of Bandits"
bandit-learning online-learning clustering graph-algorithms
Language:Jupyter Notebook 3
thiagopbueno / pybayesbandit
Bayesian bandits in Python3.
bandit-learning bayesian belief-planning rl
Language:Python 3
florian / reinforcement-learning
Implementing RL algorithms
reinforcement-learning machine-learning bandit-learning
Language:Jupyter Notebook 2
juliakreutzer / bandit-cdec
Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms
bandit-learning cdec weak-feedback machine-translation
Language:C++ 2
rasros / combo
kotlin kotlin-library optimization bandit-algorithms bandit-learning ab-testing genetic-algorithm
Language:Kotlin 2
shashankp914 / Over-the-wire-wargames-Solutions
Detailed solution of solving wargames of over the wire which includes bandit and in future many more.
ctf cybersecurity linux open-source overthewire bandit bandit-algorithms bandit-learning
2
0x65-e / Stats-115
Homework Code for UCLA STATS 115 (Probabilistic Decision Making) Fall 22 Offering
bandit-algorithms bandit-learning decision-making-algorithms decision-making-under-uncertainty expectation-maximization markov-decision-processes python3 reinforcement-learning reinforcement-learning-algorithms value-iteration
Language:Python 1
victor-iyi / policy-gradient
A policy gradient approach to a multi-armed bandit problem
reinforcement-learning policy-gradient bandit-learning multi-armed-bandits tensorflow
Language:Jupyter Notebook 1
vwang0 / causal_inference
ab-testing simulation experiment bandit-learning bandit-algorithm
Language:Jupyter Notebook 1
znreza / RL_Best_Presentation
This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.
reinforcement-learning reinforcement-learning-algorithms td-learning sarsa sarsa-learning exploration exploitation bandit-algorithm bandit-learning alphago active-learning passive-learning model-based-rl model-free rl-vs-supervised-learning rl-vs-unsupervised-learning
1
fouratifares / RGL
Randomized Greedy Learning Under Full-bandit Feedback
agent bandit-algorithms bandit-learning machine-learning machine-learning-algorithms machinelearning reinforcement-learning reinforcement-learning-algorithms submodular-optimization submodularity
Language:Python 0
jpthanga / 10-Arm-Bandit
Implementation of 10 Arm Bandit using RLGlue
reinforcement-learning bandit-learning cpp
Language:C 0
SFV-CORE / Bandit_OverTheWire
Aqui irei explicar como passar de cada nível do CTF Bandit fornecido pela Over The Wire
bandit bandit-learning ctf-challenges ctf-solutions linux
0
ad0x99 / linux-4-fun
My Linux Notes
linux bandit-learning
DenzilFrancisCrasta / bandit
reinforcement-learning bandit-learning
Language:Python
hartikainen / information-theoretic-bandit
information-theory reinforcement-learning perception-action-cycle bandit-learning k-armed-bandit multi-arm-bandits information-to-go value-to-go
Language:Python
jonad / smartcab
Train a SmartCab how to drive using reinforcement learning.
reinforcement-learning pygame markov-decision-processes bandit-learning python
Language:Jupyter Notebook
victor-iyi / contextual-bandit
A Reinforcement Learning approach to a contextual bandit problem.
reinforcement-learning-algorithms contextual-bandit markov-decision-processes bandit-learning reinforcement-learning
Language:Jupyter Notebook
vitorhugo13 / feup-mssi
Repository of code developed for the course MSSI @FEUP.
sumo traci bandit-learning
Language:Python
zeroinfiniti / bandit-wargames
Leveling up on the Bandit Wargames
bandit-learning cybersecurity cybersecurity-training overthewire overthewire-bandit wargame-challenges

bandit-learning

cair / TsetlinMachine

cair / pyTsetlinMachine

Nth-iteration-labs / contextual

SamRagusa / Checkers-Reinforcement-Learning

cair / convolutional-tsetlin-machine-tutorial

cair / pyTsetlinMachineParallel

thunfischtoast / LinUCB

mmalekzadeh / privacy-preserving-bandits

Nth-iteration-labs / streamingbandit-ui

etiennekintzler / visualize_bandit_algorithms

anishacharya / Bandits-Online-Learning

crenwick / Swiper

juliakreutzer / bandit-neuralmonkey

AntoineG92 / Online-Clustering-of-Bandits-ENSAE

thiagopbueno / pybayesbandit

florian / reinforcement-learning

juliakreutzer / bandit-cdec

rasros / combo

shashankp914 / Over-the-wire-wargames-Solutions

0x65-e / Stats-115

victor-iyi / policy-gradient

vwang0 / causal_inference

znreza / RL_Best_Presentation

fouratifares / RGL

jpthanga / 10-Arm-Bandit

SFV-CORE / Bandit_OverTheWire

ad0x99 / linux-4-fun

DenzilFrancisCrasta / bandit

hartikainen / information-theoretic-bandit

jonad / smartcab

victor-iyi / contextual-bandit

vitorhugo13 / feup-mssi

zeroinfiniti / bandit-wargames