initial-h

initial-h's repositories

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

Language:Python184 10 47

DQN with freezing target network in tensorflow on pygame FlappyBird

Language:Python11 20

Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023

Language:PythonMIT3 10

200

DQN with target network for spaceshooter

Language:Python100

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT000

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC

Language:PythonApache-2.0000

Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.

Language:PythonMIT000

Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference

Language:Python000