Costa Huang (vwxyzjn)

vwxyzjn

Geek Repo

Company:@huggingface

Location:Philadelphia, PA

Home Page:https://costa.sh

Twitter:@vwxyzjn

Github PK Tool:Github PK Tool

Costa Huang's repositories

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:4531Issues:34Issues:170

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Language:PythonLicense:NOASSERTIONStargazers:558Issues:3Issues:6

portwarden

Create Encrypted Backups of Your Bitwarden Vault with Attachments

Language:GoLicense:MITStargazers:549Issues:9Issues:28

invalid-action-masking

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Language:PythonLicense:MITStargazers:124Issues:2Issues:3

gym-microrts-paper

The source code for the gym-microrts paper.

gym-pysc2

Gym wrapper for pysc2

Language:PythonLicense:MITStargazers:8Issues:3Issues:0
Language:PythonLicense:MITStargazers:7Issues:2Issues:0
Language:PythonLicense:MITStargazers:4Issues:2Issues:0
Language:JavaLicense:GPL-3.0Stargazers:3Issues:2Issues:0
Language:PythonLicense:MITStargazers:2Issues:3Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:1Issues:1Issues:0
Language:PythonLicense:MITStargazers:1Issues:2Issues:0

awesome-reinforcement-learning-lib

GitHub's code repository is all you need

Stargazers:0Issues:1Issues:0

dragonfly

A modern replacement for Redis and Memcached

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

entity-gym

Standard interface for entity based reinforcement learning environments.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

envpool

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

flax

Flax is a neural network library for JAX that is designed for flexibility.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Gymnasium

A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

hyperstate

Opinionated library for managing hyperparameters and mutable state of machine learning training systems.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

rl-experiments

Keeping track of RL experiments

License:Apache-2.0Stargazers:0Issues:1Issues:0

rl_games

RL implementations

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

rogue-net

Entity Gym compatible ragged batch transformer implementation.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Shimmy

An API conversion tool for popular external reinforcement learning environments

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

torchbeast

A PyTorch Platform for Distributed RL

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

v23

Volume 23 of JMLR

Stargazers:0Issues:1Issues:0
Language:HTMLLicense:MITStargazers:0Issues:2Issues:1