3cky / tensorflow-rl-tictactoe

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Requirements:

References:

  • Michael L. Littman. Markov games as a framework for multi-agent reinforcement learning. Machine Learning, 11:157–163, 1994.
  • W. T. Uther and M. Veloso. Adversarial reinforcement learning, School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, 1997.
  • R. A. C. Bianchi, C. H. C. Ribeiro, and A. H. R. Costa. Heuristic selection of actions in multiagent reinforcement learning. In IJCAI’07, Hyderabad, India, 2007.

About

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.


Languages

Language:Python 100.0%