Mastering TicTacToe with AlphaZero

Coding AlphaZero algorithm from scratch to play the game of TicTacToe and it never loses!!

Pls checkout the medium article for a quick overview.

Demo: https://alphazerotictactoe.herokuapp.com/

To build training dataset using self play:

python selfplay.py

To train the value and policy networks

python retrain.py

AlphaZero (or it's more famous predecessor AlphaGo) made one of the most famous breakthroughs in the field of AI. Being able to achieve superhuman performance in the games of chess, shogi and go, having a Netflix movie made about its accomplishment (AlphaGo - The Movie) are just some of its accolades.

In this article, I will be discussing the general intuition behind AlphaZero and explaining the various components and processes in the algorithm. I have modified the algorithm from the paper to play TicTacToe.

Demo: https://alphazerotictactoe.herokuapp.com/

Shoulders of giants

Mastering the game of Go without human knowledge: https://www.nature.com/articles/nature24270
http://joshvarty.github.io/AlphaZero/
AlphaZero Cheatsheet: https://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0
A Simple Alpha(Go) Zero Tutorial: https://web.stanford.edu/~surag/posts/alphazero.html
https://en.wikipedia.org/wiki/Monte_Carlo_tree_search
https://www.youtube.com/watch?v=MPXGiowUr0o&ab_channel=SkowstertheGeek

About

Coding AlphaZero algorithm from scratch to play the game of TicTacToe and it never loses!!

Languages

Language:Python 63.3%Language:JavaScript 21.4%Language:CSS 7.5%Language:Dockerfile 4.0%Language:HTML 3.8%