A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.
Home Page:https://jonathan-laurent.github.io/AlphaZero.jl/stable/
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
StepHaze opened this issue a year ago · comments
If loss almost isn't changing (~1.06) during learning, but the network is replacing itself, is it normal?