Costa Huang's repositories
PPO-Implementation-Deep-Dive
DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details
a2c_is_a_special_case_of_ppo
A2C is a special case of PPO!
vectorized-value-methods
[WIP] Vectorized architecture for value-based methods such as DQN and DDPG
Arcade-Learning-Environment
The Arcade Learning Environment (ALE) -- a platform for AI research.
container-apps-store-api-microservice
Sample microservices solution using Azure Container Apps, Dapr, Cosmos DB, and Azure API Management
environment
Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research
gym-microrts-paper-sb3
RL agent to play μRTS with Stable-Baselines3
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code