Costa Huang (vwxyzjn)

vwxyzjn

Geek Repo

Company:Drexel University

Location:Philadelphia, PA

Home Page:https://costa.sh

Twitter:@vwxyzjn

Github PK Tool:Github PK Tool

ezoic increase your site revenue

Costa Huang's repositories

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:1195Issues:20Issues:81

portwarden

Create Encrypted Backups of Your Bitwarden Vault with Attachments

Language:GoLicense:MITStargazers:388Issues:8Issues:24

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Language:PythonLicense:NOASSERTIONStargazers:110Issues:0Issues:0

invalid-action-masking

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Language:PythonLicense:MITStargazers:55Issues:0Issues:0

PPO-Implementation-Deep-Dive

DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details

gym-microrts-paper

The source code for the gym-microrts paper.

Language:PythonStargazers:26Issues:0Issues:0

a2c_is_a_special_case_of_ppo

A2C is a special case of PPO!

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

gym-pysc2

Gym wrapper for pysc2

Language:PythonLicense:MITStargazers:8Issues:2Issues:0
Language:PythonLicense:MITStargazers:5Issues:0Issues:0

vectorized-value-methods

[WIP] Vectorized architecture for value-based methods such as DQN and DDPG

Language:PythonLicense:MITStargazers:4Issues:0Issues:0
Language:JavaLicense:GPL-3.0Stargazers:3Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0
License:MITStargazers:1Issues:0Issues:0

composer

library of algorithms to speed up neural network training

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

enn-zoo

Collection of entity-gym bindings for different reinforcement learning environments.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

entity-gym

Standard interface for entity based reinforcement learning environments.

License:NOASSERTIONStargazers:0Issues:0Issues:0

environment

Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gym-docs

Code for Gym documentation website

License:MITStargazers:0Issues:0Issues:0

hyperstate

Opinionated library for managing hyperparameters and mutable state of machine learning training systems.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLLicense:NOASSERTIONStargazers:0Issues:0Issues:0

IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

License:NOASSERTIONStargazers:0Issues:0Issues:0

jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

License:MITStargazers:0Issues:0Issues:0

launcha

Launcha is a simple Docker-based cloud job launcher.

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

rl_games

RL implementations

License:MITStargazers:0Issues:0Issues:0

rogue-net

Entity Gym compatible ragged batch transformer implementation.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0