initial-h's repositories
AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
FlappyBird_DQN_with_target_network
DQN with freezing target network in tensorflow on pygame FlappyBird
spaceShooter_DQN
DQN with target network for spaceshooter
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:PythonMIT000
CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC
Language:PythonApache-2.0000
dreamerv2
Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.
Language:PythonMIT000
rl-rep
Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference
Language:Python000