Thidtc / gobang

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Gobang

This is a PyTorch-based implementation of AlphaGo Zero algorithm for gobang game. Also, provide an OpenAI Gym style environment interface for gobang game

Environment

  • Python 3.*
  • PyTorch 0.3.*

Usage

Training

python train.py

Result

6*6 board, n_in_row is 4

  • training time

    2~3 hours

  • results

    9:1 versus pure monte carlo tree search algorithm(with 10 times sample counts)

TODO

  • Add interface for human player
  • Implement render_ interface for gobang environment so as to provide an UI

About


Languages

Language:Python 100.0%