nalgae73

nalgae73

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

nalgae73's repositories

PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

License:MITStargazers:0Issues:0Issues:0

AlphaZero-Ultimate-TicTacToe

An AlphaZero Implementation of Ultimate Tic-Tac-Toe (with GUI) (since my Git LFS is out of quota, I have to push them without the commits)

License:GPL-3.0Stargazers:0Issues:0Issues:0

PPO-Continuous-Pytorch

A clean and robust Pytorch implementation of PPO on continuous action space.

License:MITStargazers:0Issues:0Issues:0

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

License:NOASSERTIONStargazers:0Issues:0Issues:0

smtm

It's a game to get money

License:MITStargazers:0Issues:0Issues:0

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

AlphaZeroSimple

The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with

License:MITStargazers:0Issues:0Issues:0

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

License:MITStargazers:0Issues:0Issues:0

alpha-zero

Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

PPO_continuous

Most Simple, Works Well

Stargazers:0Issues:0Issues:0

RL

RL algorithm implementations from scratch.

License:MITStargazers:0Issues:0Issues:0

ac-ppo

Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment

Stargazers:0Issues:0Issues:0

Sutton-and-Barto-Reinforcement-Learning-An-Introduction

Codes and solutions to exercises from the book Introduction to Reinforcement Learning by Sutton and Barto

Stargazers:0Issues:0Issues:0

ultimate_tic-tac-toe_alphazero-in-keras

I used the AlphaZero algorithm to make a bot that plays ultimate tic-tac-toe.

Stargazers:0Issues:0Issues:0

Sutton-and-Barto-reinforcement_learning_an_introduction

Summary (in Korean) and python implementation of 'Reinforcement Learning: An Introduction' written by Sutton & Barto

Stargazers:0Issues:0Issues:0