Beast code in Giters

longzh211's repositories

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION000

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0000

how-to-autorl

Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to efficiently tune RL hyperparameters.

Language:PythonApache-2.0000

LA3P

Actor Prioritized Experience Replay

Language:PythonMIT000

seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Language:PythonApache-2.0000

SimTPR

Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)

Language:Python000