Brett Daley (brett-daley)

brett-daley

Geek Repo

Company:University of Alberta

Location:Edmonton, AB

Home Page:https://brett-daley.github.io/

Github PK Tool:Github PK Tool

Brett Daley's repositories

dqn-lambda

NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.

Language:PythonLicense:MITStargazers:23Issues:4Issues:1

gym-classics

Classic environments for reinforcement learning and dynamic programming, implemented in OpenAI Gym and Gymnasium.

Language:PythonLicense:GPL-3.0Stargazers:19Issues:4Issues:4

fast-dqn

A concurrent/synchronized DQN implementation optimized for multi-CPU, single-GPU systems.

Language:PythonLicense:MITStargazers:8Issues:3Issues:0

stratified-experience-replay

Stratified Experience Replay. Correcting Multiplicity Bias in Off-Policy Deep Reinforcement Learning. AAMAS 2021.

Language:PythonStargazers:6Issues:3Issues:0
Language:HTMLLicense:MITStargazers:2Issues:2Issues:0

virtual-replay-cache

Virtual Replay Cache. A modified DQN(λ) implementation with a significantly reduced memory footprint.

Language:PythonLicense:MITStargazers:2Issues:3Issues:0
Language:PythonStargazers:1Issues:3Issues:0

averaging-nstep-returns

ICML 2024: Averaging n-step Returns Reduces Variance in Reinforcement Learning

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

expectigrad

A deep learning optimizer with reliable convergence. Supports Pytorch and TensorFlow 1 & 2.

Language:PythonLicense:MITStargazers:1Issues:3Issues:0

trajectory-aware-etraces

ICML 2023: Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning. https://arxiv.org/abs/2301.11321

Language:PythonLicense:MITStargazers:1Issues:4Issues:0

pfrl

PFRL: a PyTorch-based deep reinforcement learning library

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

recency-heuristic

RLC 2024: Demystifying the Recency Heuristic in Temporal-Difference Learning

Language:PythonStargazers:0Issues:0Issues:0