karthickrajas / Multi-arm-bandit

Introduction to reinforcement learning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Multi arm bandits

Discusses formulation of MAB problems and its application. There are several methods for solving multi arm bandits. Few of them are dicussed in this git. Contextual MAB problems are interesting and extension of the solutions are also discussed.

  • Epsilon greedy
  • Upper Confidence Bound
  • Thompson sampling
  • Exp3
  • Exp4
  • LinUCB

About

Introduction to reinforcement learning