Simple TicTacToe game developed to learn Reinforcement Learning using Q-Learning.
The AI is basic and all was handmade, so obviously the performance is not great.
- Improve AI performance overall
- Support multiple policies (right now, we're using only epsilon-greedy)