longzh211's repositories
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:PythonNOASSERTION000
DeepSpeedExamples
Example models using DeepSpeed
Language:PythonApache-2.0000
how-to-autorl
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to efficiently tune RL hyperparameters.
Language:PythonApache-2.0000
LA3P
Actor Prioritized Experience Replay
Language:PythonMIT000
seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
Language:PythonApache-2.0000
SimTPR
Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)
Language:Python000