johnpaulada / ticktocktoes

Tic Tac Toe Experiment with RL in JS

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TickTockToes

Bellman

Q[s][a] = Q[s][a] + alpha * (reward(s, a) + discount * Math.max(...Qs) - Q[s][a])

Todos

  • Select action from table. If none, randomly select.
  • Add reward/punishment when game is done.
  • [] Convert back to Map
  • [] Compress Repeating Stuff
  • [] Huffman Coding
  • [] Use MessagePack

About

Tic Tac Toe Experiment with RL in JS


Languages

Language:JavaScript 100.0%