zeal4u / AlphaZero_Gobang

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Overview

This is a AlphaZero Implementation of Gobang based on Pytorch.

Code

  • Train.py : Run the train process
  • Run.py : Play with Human using the trained model
  • Player.py: Base class for different Player
  • RolloutPlayer.py: Player with MCTS using random rollout policy
  • AlphaZeroPlayer.py: AlphaZero Player with MCTS guided by Residual Network
  • HumanPlayer.py: Human Player
  • MCTS.py: Base class for different MCTS
  • AlphaZeroMCTS.py: MCTS guided by Residual Network
  • RolloutMCTS.py: MCTS using random rollout policy
  • TreeNode.py: MCTS Tree Node
  • PolicyValueNet.py: Redisual Network Implementation based on Pytorch
  • Board.py: Board Class for Gobang
  • Game.py: Game for Gobang

Running Script

Run on Linux Server

nohup python -u Train.py > simpleres_train.log 2>&1 &

Download the trained model

scp root@139.199.21.83:/usr/local/workspace/AlphaZero_Gobang/data/current_policy_resnet_epochs_1500.model /Users/xuetf/Downloads

Upload -P big!!!

scp -P 8381 local_file_path root@139.199.21.83:/root/

Reference

AlphaZero实战

About


Languages

Language:Python 100.0%