Beast code in Giters

nalgae73's repositories

PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

MIT000

AlphaZero-Ultimate-TicTacToe

An AlphaZero Implementation of Ultimate Tic-Tac-Toe (with GUI) (since my Git LFS is out of quota, I have to push them without the commits)

GPL-3.0000

PPO-Continuous-Pytorch

A clean and robust Pytorch implementation of PPO on continuous action space.

MIT000

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

NOASSERTION000

smtm

It's a game to get money

MIT000

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

MIT000

AlphaZero

000

AlphaZeroSimple

The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with

MIT000

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

MIT000

alpha-zero

Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.

MIT000

Reversi_with_Simple_AlphaZero

MIT000

PPO_continuous

Most Simple, Works Well

000

RL

RL algorithm implementations from scratch.

MIT000

ac-ppo

Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment

000

Sutton-and-Barto-Reinforcement-Learning-An-Introduction

Codes and solutions to exercises from the book Introduction to Reinforcement Learning by Sutton and Barto

000

ultimate_tic-tac-toe_alphazero-in-keras

I used the AlphaZero algorithm to make a bot that plays ultimate tic-tac-toe.

000

Sutton-and-Barto-reinforcement_learning_an_introduction

Summary (in Korean) and python implementation of 'Reinforcement Learning: An Introduction' written by Sutton & Barto

000