Costa Huang (vwxyzjn)

vwxyzjn

Geek Repo

Company:@huggingface

Location:Philadelphia, PA

Home Page:https://costa.sh

Twitter:@vwxyzjn

Github PK Tool:Github PK Tool

Costa Huang's repositories

PPO-Implementation-Deep-Dive

DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details

a2c_is_a_special_case_of_ppo

A2C is a special case of PPO!

Language:PythonLicense:MITStargazers:17Issues:4Issues:2

vectorized-value-methods

[WIP] Vectorized architecture for value-based methods such as DQN and DDPG

Language:PythonLicense:MITStargazers:3Issues:2Issues:2

launcha

Launcha is a simple Docker-based cloud job launcher.

Language:PythonStargazers:1Issues:2Issues:0
Language:PythonLicense:MITStargazers:1Issues:3Issues:0

Arcade-Learning-Environment

The Arcade Learning Environment (ALE) -- a platform for AI research.

Language:C++License:GPL-2.0Stargazers:0Issues:1Issues:0

birthday

A Happy Birthday animation design in CSS3, HTML5

Language:CSSStargazers:0Issues:1Issues:0

brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

composer

library of algorithms to speed up neural network training

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

container-apps-store-api-microservice

Sample microservices solution using Azure Container Apps, Dapr, Cosmos DB, and Azure API Management

Language:ShellLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

environment

Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

gym-docs

Code for Gym documentation website

License:MITStargazers:0Issues:1Issues:0

gym-microrts-paper-sb3

RL agent to play μRTS with Stable-Baselines3

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:HTMLLicense:NOASSERTIONStargazers:0Issues:1Issues:0

incubator

Collection of in-progress libraries for entity neural networks.

Language:PythonStargazers:0Issues:1Issues:0

isort

A Python utility / library to sort imports.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

minihack

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:3Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0