alphago

There are 16 repositories under alphago topic.

sweetice / Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
a2c a3c actor-critic actor-critic-algorithm algorithm alphago deep-learning deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforce resnet sac sarsa td3 trpo
Language:Python 4487
suragnair / alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
tensorflow pytorch keras gobang gomoku alpha-zero alphago-zero alphago reinforcement-learning self-play mcts monte-carlo-tree-search othello tf deep-learning alphazero neural-network
Language:Jupyter Notebook 4297
junxiaosong / AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
alphago alphago-zero alphazero board-game gobang gomoku mcts monte-carlo-tree-search pytorch reinforcement-learning rl self-learning tensorflow
Language:Python 3543
werner-duvaud / muzero-general
MuZero
alphago alphazero deep-learning deep-reinforcement-learning gym machine-learning mcts model-based-rl monte-carlo-tree-search muzero muzero-general neural-network python3 pytorch reinforcement-learning residual-network rl self-learning tensorboard
Language:Python 2715
maxpumperla / deep_learning_and_the_game_of_go
Code and other material for the book "Deep Learning and the Game of Go"
deep-learning neural-networks machine-learning data-science python games game-of-go alphago alphago-zero
Language:Python 1040
maxpumperla / betago
BetaGo: AlphaGo for the masses, live on GitHub.
alphago betago bot deep-networks game neural-network
Language:Python 691
bupticybee / icyChessZero
**象棋alpha zero程序
alphago-zero alphago chinese-chess reinforcement-learning chinese tensorlfow
Language:Jupyter Notebook 413
dylandjian / SuperGo
A student implementation of Alpha Go Zero
alphago alphago-zero machine-learning mcts python3 pytorch reinforcement-learning
Language:Python 281
CrazyAra
QueensGambit / CrazyAra
A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:
python crazyhouse chess-engine deep-learning artificial-intelligence convolutional-neural-network mcts alphazero mxnet gluon open-source machine-learning lichess python-chess alphago mcgs
Language:Jupyter Notebook 278
HardcoreJosh / JoshieGo
A Go playing program implemented in Tensorflow roughly according to the architecture of AlphaGo. Current strength is 3~4 amateur dan.
alphago tensorflow
Language:Python 216
initial-h / AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
alphazero alphazero-gomoku parallel mpi4py tensorflow alphago mcts gomoku tensorlayer tree-search algorithm deep-reinforcement-learning dirichlet-distribution
Language:Python 212
yenw / computer-go-dataset
datasets for computer go
computer-go go sgf tygem computer-go-dataset alphago alphazero muzero fineart leelazero golaxy minigo elf-opengo phoenixgo
Language:C++ 157
michaelnny / alpha_zero
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
alphago-zero alphazero gomoku alphago go reinforcement-learning
Language:Python 153
Sayuri
CGLemon / Sayuri
AlphaZero based engine for the game of Go (圍棋/围棋).
mcts weiqi baduk alphago deeplearning sayuri alphazero gumbel-alphazero
Language:C++ 114
YoujiaZhang / AlphaGo-Zero-Gobang
AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型，主要用以了解AlphaGo Zero的运行原理的Demo，即神经网络是如何指导MCTS做出决策的，以及如何自我对弈学习。源码+教程
ai alphago alphazero deep-learning gobang gomuku gui mcts residual-networks tensorflow
Language:Python 109
Urinx / ReinforcementLearning
Reinforcing Your Learning of Reinforcement Learning
reinforcement-learning alphago-zero mcts q-learning policy-gradient gomoku frozenlake doom cartpole tic-tac-toe atari-2600 space-invaders ppo advantage-actor-critic dqn alphago ddpg
Language:Python 96
cgreer / alpha-zero-boosted
A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)
alphago alphazero gbdt gradient-boosted-trees lightgbm python xgboost
Language:Python 86
BlinkDL / BlinkDL
A minimalist deep learning library in Javascript using WebGL + asm.js. Run convolutional neural network in your browser.
deep-learning deeplearning deep-neural-networks neural-network neural-networks alphago
Language:JavaScript 84
BUGOUT
Terkwood / BUGOUT
AI-driven, Multiplayer Go/Weiqi/Baduk for the web 🐛🤖🦀♟
go-game weiqi baduk goban board-game multiplayer-game distributed-systems microservices redis redis-streams katago alphago alphazero artificial-intelligence igo multiplayer boardgame rust distributed-monolith
Language:Rust 79
tejank10 / AlphaGo.jl
AlphaGo Zero implementation using Flux.jl
alpha-zero alphago flux go julia reinforcement-learning
Language:Julia 72
kobanium / TamaGo
Computer go engine using Monte-Carlo Tree Search written in Python3.
baduk go weiqi mcts monte-carlo-tree-search deep-learning go-text-protocol alphago alphago-zero alphagozero gumbel-alphazero reinforcement-learning
Language:Python 71
CGLemon / pyDLGO
基於深度學習的 GTP 圍棋（围棋）引擎，KGS 指引文件以及演算法教學。
alphago baduk deep-learning game-of-go goban mcts weiqi
Language:Python 68
GuoYi0 / alphaFive
alphaGo版本的五子棋(gobang, gomoku)
alphago alphago-zero alphazero gomoku reinforcement-learning tensorflow gobang
Language:Python 68
zouyih / AlphaZero_Gomoku-tensorflow
alphazero alphago gomoku tensorflow reinforcement-learning
Language:Python 62
PolyKen / 15_by_15_AlphaGomoku
An implementation of improved AlphaGo algorithm in the game of Gomoku.
python gomoku alphago
Language:Python 58
Liu8018 / Ghost
基于miniGo的幻影围棋AI，2019**计算机博弈大赛幻影围棋组冠军；AI of Phantom Go based on miniGo
alphago alphazero deep-learning minigo phantom-go python
Language:Python 56
cestpasphoto / alpha-zero-general
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
alphago alphago-zero alphazero machikoro minivilles python pytorch reinforcement-learning santorini santorini-game splendor the-little-prince numba self-play
Language:Python 54
yangboz / godpaper
:monkey_face: An AI chess-board-game framework(by many programming languages) implementations.
actionscript starling ai board-game game-engine flash alphago deeplearning finite-state-machine fuzzy-logic-control dnn policytree mcts cnn docker microservice kubernetes deep-neural-networks deep-reinforcement-learning wiki
Language:HTML 48
HKUNLP / DiffuSearch
[ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"
diffusion-models alphago alphazero chess-engine discrete-diffusion llms mcts non-autoregressive planning reasoning
Language:Python 33
markhliu / AlphaGoSimplified
Book repository for AlphaGo Simplified (CRC Press 2024). Implement ideas behind Deep Blue (rule-based AI) and AlphaGo (rule-based AI + Deep Learning) in three simple games: Last Coin Standing, Tic Tac Toe, and Connect Four.
actor-critic ai alphago alphazero deep-learning deep-neural-networks deep-reinforcement-learning machine-learning policy-gradient reinforcement-learning rule-based
Language:Jupyter Notebook 30
SergioIommi / DQN-2048
Deep Reinforcement Learning to Play 2048 (with Keras)
2048 2048-game alphago alphago-zero artificial-intelligence convolutional-neural-networks deep-learning deep-neural-networks deep-q-network deep-reinforcement-learning deepmind dqn game intelligent-agent keras-rl monte-carlo-tree-search neural-networks openai openai-gym reinforcement-learning
Language:Python 27
kongjiellx / AlphaZero-Renju
alphago alpha-zero bazel alphazero tensorflow pygame
Language:C++ 19
ladofa / janggi
야매장기 - 알파고를 참고한 장기 AI
janggi deep learning mcts alphago wpf
Language:C# 19
shionhonda / IaGo
Othello AI (AlphaGo's PV-MCTS algorithm)
deep-reinforcement-learning deep-learning reinforcement-learning alphago othello python chainer
Language:Python 18
kekmodel / gym-tictactoe-zero
Tic Tac Toe with Alpha Zero method - My first work
gym openai mcts tictactoe reinforcement alphago alphago-zero alphazero
Language:Python 16
Cotix-AI / Deep-Think
🌲 We introduce LLM-UCT, a MCTS sampling framework specifically designed for large language models (LLMs), to address the unique challenges and requirements of such models.
alphago llm llm-reasoning llms mcts sampling test-time-scaling tree-search llm-mcts
Language:Python 15

alphago

sweetice / Deep-reinforcement-learning-with-pytorch

suragnair / alpha-zero-general

junxiaosong / AlphaZero_Gomoku

werner-duvaud / muzero-general

maxpumperla / deep_learning_and_the_game_of_go

maxpumperla / betago

bupticybee / icyChessZero

dylandjian / SuperGo

QueensGambit / CrazyAra

HardcoreJosh / JoshieGo

initial-h / AlphaZero_Gomoku_MPI

yenw / computer-go-dataset

michaelnny / alpha_zero

CGLemon / Sayuri

YoujiaZhang / AlphaGo-Zero-Gobang

Urinx / ReinforcementLearning

cgreer / alpha-zero-boosted

BlinkDL / BlinkDL

Terkwood / BUGOUT

tejank10 / AlphaGo.jl

kobanium / TamaGo

CGLemon / pyDLGO

GuoYi0 / alphaFive

zouyih / AlphaZero_Gomoku-tensorflow

PolyKen / 15_by_15_AlphaGomoku

Liu8018 / Ghost

cestpasphoto / alpha-zero-general

yangboz / godpaper

HKUNLP / DiffuSearch

markhliu / AlphaGoSimplified

SergioIommi / DQN-2048

kongjiellx / AlphaZero-Renju

ladofa / janggi

shionhonda / IaGo

kekmodel / gym-tictactoe-zero

Cotix-AI / Deep-Think