There are 3 repositories under self-play topic.
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
A Massively Parallel Large Scale Self-Play Framework
A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
Train a neural network to PvP in Old School RuneScape using reinforcement learning.
The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141
TD-Gammon implementation
Backgammon OpenAI Gym
This is the implementation of paper Model Free Episodic Control
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
AI agents for the bavarian card game Schafkopf trained with reinforcement learning
A Self Play reinforcement learning Agent learns to play TicTacToe using the ML-Agents Framework in Unity.
Using self-play, MCTS, and a deep neural network to create a hearthstone ai player
Self-Play Boxing Match made with Unity Machine Learning Agents
Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)
An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice
Emulator and AI of Shadowverse
My attempt to reproduce a water down version of PBT (Population based training) for MARL (Multi-agent reinforcement learning) using DDPPO (Decentralized & distributed proximal policy optimization) from ray[rllib].
Implementation of TD Gammon algorithm by Gerald Tesauro at IBM's Thomas J. Watson Research Center in Python.
Recreating Bill Seiler's 1985 version of Space War and training RL agents with Self-Play
Implementation of Alpha Go Zero - Reinforcement Learning Project, COL870 @iit-delhi
Implementation of an AlphaGo Zero paper in one C++ header file without any dependencies
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
A Smart Agent using reinforcement learning with CNN + MCTS to learn to play Othello/Reversi