initial-h's repositories

AlphaZero_Gomoku_MPI

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

FlappyBird_DQN_with_target_network

DQN with freezing target network in tensorflow on pygame FlappyBird

Language:PythonStargazers:11Issues:2Issues:0

CEER

Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023

Language:PythonLicense:MITStargazers:3Issues:1Issues:0

spaceShooter_DQN

DQN with target network for spaceshooter

Language:PythonStargazers:1Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dreamerv2

Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rl-rep

Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference

Language:PythonStargazers:0Issues:0Issues:0