vwxyzjn

Costa Huang's repositories

PPO-Implementation-Deep-Dive

DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details

Language:Python41 2 1

a2c_is_a_special_case_of_ppo

A2C is a special case of PPO!

Language:PythonMIT17 4 2

vectorized-value-methods

[WIP] Vectorized architecture for value-based methods such as DQN and DDPG

Language:PythonMIT3 2 2

launcha

Launcha is a simple Docker-based cloud job launcher.

Language:Python1 20

validate-new-gym-mujoco-envs

Language:PythonMIT1 30

Arcade-Learning-Environment

The Arcade Learning Environment (ALE) -- a platform for AI research.

Language:C++GPL-2.0010

birthday

A Happy Birthday animation design in CSS3, HTML5

Language:CSS010

brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Language:Jupyter NotebookApache-2.0010

composer

library of algorithms to speed up neural network training

Language:PythonNOASSERTION010

container-apps-store-api-microservice

Sample microservices solution using Azure Container Apps, Dapr, Cosmos DB, and Azure API Management

Language:ShellMIT010

draw.io

020

environment

Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research

Language:PythonMIT010

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonNOASSERTION010

gym-docs

Code for Gym documentation website

MIT010

gym-microrts-paper-sb3

RL agent to play μRTS with Stable-Baselines3

Language:Python010

gym-microrts-static-files

020

gym-robotics

Language:PythonNOASSERTION010

iclr-blog-track.github.io

Language:HTMLNOASSERTION010

incubator

Collection of in-progress libraries for entity neural networks.

Language:Python010

isort

A Python utility / library to sort imports.

Language:PythonMIT010

launcha-sb3-example

Language:Python020

MA-ALE2

Language:Python010

microrts-sb3

Language:Python020

minihack

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Language:PythonApache-2.0010

MultiAgentObjectCollectorEnv

Language:Python010

nmmo-cleanrl-incubator

MIT020

PPO-Procgen-Reproduction

Language:Python030

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT010

stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Language:PythonMIT010

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonMIT010