jonathan-laurent / AlphaZero.jl

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

Home Page:https://jonathan-laurent.github.io/AlphaZero.jl/stable/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What next after training an agent?

pepa65 opened this issue · comments

commented

I followed the readme in training an agent, except I chose tictactoe (as connect-four ran out of memory), and it completed!

What can I now do to use the result?

You can do anything you want with the trained network weights.
If you just want to play against the agent or visualize MCTS stats, you can use the explore and play scripts.

commented

OK, this works: julia --project -e 'using AlphaZero; Scripts.play("mancala")'