There are 16 repositories under alphago topic.
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
MuZero
Code and other material for the book "Deep Learning and the Game of Go"
BetaGo: AlphaGo for the masses, live on GitHub.
**象棋alpha zero程序
A student implementation of Alpha Go Zero
A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:
A Go playing program implemented in Tensorflow roughly according to the architecture of AlphaGo. Current strength is 3~4 amateur dan.
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
datasets for computer go
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程
Reinforcing Your Learning of Reinforcement Learning
A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)
A minimalist deep learning library in Javascript using WebGL + asm.js. Run convolutional neural network in your browser.
AlphaGo Zero implementation using Flux.jl
An implementation of improved AlphaGo algorithm in the game of Gomoku.
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
:monkey_face: An AI chess-board-game framework(by many programming languages) implementations.
[ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"
Book repository for AlphaGo Simplified (CRC Press 2024). Implement ideas behind Deep Blue (rule-based AI) and AlphaGo (rule-based AI + Deep Learning) in three simple games: Last Coin Standing, Tic Tac Toe, and Connect Four.
Deep Reinforcement Learning to Play 2048 (with Keras)
Othello AI (AlphaGo's PV-MCTS algorithm)
Tic Tac Toe with Alpha Zero method - My first work
🌲 We introduce LLM-UCT, a MCTS sampling framework specifically designed for large language models (LLMs), to address the unique challenges and requirements of such models.