till2 / RL-Bandit

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RL-Bandit

Implementation of the Upper Confidence Bound Section from "Introduction to RL" by Sutton & Barto. Just playing around and trying different approaches.

About


Languages

Language:Jupyter Notebook 100.0%