JCK-1096 / contextual_MAB

A simple pure-python framework for dealing with the contextual multi-armed bandit problems

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Contextual multi-armed bandit

This repo contains a review of the contextual multi-armed bandits.
Includes proposed framework for extendible building blocks that form the contextual bandit problem.
You can find the overview of the contextual bandits, dataset, and the framework in the presentation

Structure

  • src/ contains the modules of the framework
    • agent, environment, oracle, policy, data provider
  • train_manual demonstrates how to combine these modules to form a contextual bandit
  • train_grid allows you to run the model for a specified parameter grid

About

A simple pure-python framework for dealing with the contextual multi-armed bandit problems


Languages

Language:HTML 79.6%Language:Jupyter Notebook 18.2%Language:Python 2.1%Language:Dockerfile 0.1%