Ming Zhou's repositories
MAgentRender
an interactive pygame client for MAgent
multiagent-particle-envs
Forked from openai, and expand it with more scenarios.
1M-agents-RL
A preliminary platform for up to 1 million reinforcement learning agents
acme
A library of reinforcement learning components and agents
arxiv-vanity
Renders papers from Arxiv as responsive web pages so you don't have to squint at a PDF.
asynchronous_impala_PPO
Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation
d4pg-pytorch
PyTorch implementation of Distributed Distributional Deterministic Policy Gradients
grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
info_geometry
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
language-server-protocol
Defines a common protocol for language servers.
lxc-gpu
Enjoy computation resources sharing at your laboratory with lxc-gpu!
malib
A Multi-agent Learning Framework
mdp
Make it easy to specify simple MDPs that are compatible with the OpenAI Gym.
mini-AlphaStar
A mini-source reproduction code of the AlphaStar program which is an AI proposed by DeepMind to play StarCraft II.
phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"
proc-bridge
A lightweight socket-based IPC (Inter-Process Communication) protocol. (Support Java and Python)
rliable
Open-source library for reliable evaluation on reinforcement learning and machine learning benchmarks. See NeurIPS 2021 oral for details.
rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
sac
Soft Actor-Critic
scalable_agent
A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.
seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
slide-template
A template for academic presentation slides in Apex Lab.
smac
SMAC: The StarCraft Multi-Agent Challenge
stocBiO
Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"