nalgae73's repositories
PPO-for-Beginners
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
AlphaZero-Ultimate-TicTacToe
An AlphaZero Implementation of Ultimate Tic-Tac-Toe (with GUI) (since my Git LFS is out of quota, I have to push them without the commits)
PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
smtm
It's a game to get money
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
AlphaZeroSimple
The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with
DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
alpha-zero
Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.
PPO_continuous
Most Simple, Works Well
RL
RL algorithm implementations from scratch.
ac-ppo
Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment
Sutton-and-Barto-Reinforcement-Learning-An-Introduction
Codes and solutions to exercises from the book Introduction to Reinforcement Learning by Sutton and Barto
ultimate_tic-tac-toe_alphazero-in-keras
I used the AlphaZero algorithm to make a bot that plays ultimate tic-tac-toe.
Sutton-and-Barto-reinforcement_learning_an_introduction
Summary (in Korean) and python implementation of 'Reinforcement Learning: An Introduction' written by Sutton & Barto