Costa Huang's repositories
portwarden
Create Encrypted Backups of Your Bitwarden Vault with Attachments
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
PPO-Implementation-Deep-Dive
DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details
gym-microrts-paper
The source code for the gym-microrts paper.
a2c_is_a_special_case_of_ppo
A2C is a special case of PPO!
vectorized-value-methods
[WIP] Vectorized architecture for value-based methods such as DQN and DDPG
composer
library of algorithms to speed up neural network training
enn-zoo
Collection of entity-gym bindings for different reinforcement learning environments.
entity-gym
Standard interface for entity based reinforcement learning environments.
environment
Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research
gym-docs
Code for Gym documentation website
hyperstate
Opinionated library for managing hyperparameters and mutable state of machine learning training systems.
IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
launcha
Launcha is a simple Docker-based cloud job launcher.
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
rl_games
RL implementations
rogue-net
Entity Gym compatible ragged batch transformer implementation.