wecet / Experimenations-in-Reinforcement-Learning

Experiment 1: Comparison of key bandit algorithms; Experiment 2: Comparison of Q and SARSA Learning on Taxiv3 environment' ; Experiment 3: Comparison of Q, SARSA and CEM Learning on LunarLanderv2 Environment

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

wecet/Experimenations-in-Reinforcement-Learning Watchers