3cky/tensorflow-rl-tictactoe

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Requirements:

References:

Michael L. Littman. Markov games as a framework for multi-agent reinforcement learning. Machine Learning, 11:157–163, 1994.
W. T. Uther and M. Veloso. Adversarial reinforcement learning, School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, 1997.
R. A. C. Bianchi, C. H. C. Ribeiro, and A. H. R. Costa. Heuristic selection of actions in multiagent reinforcement learning. In IJCAI’07, Hyderabad, India, 2007.

About

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Language:Python 100.0%