Ming Zhou (KornbergFresnel)

KornbergFresnel

Geek Repo

Company:Shanghai AI Lab

Location:Shanghai, China

Home Page:mingzak.com

Twitter:@mzhou_cs

Github PK Tool:Github PK Tool


Organizations
APEXLAB
apexrl
sjtu-marl

Ming Zhou's repositories

ModelRepo

reproduce some RL or Multi-Agent models

MAgentRender

an interactive pygame client for MAgent

Language:PythonLicense:MITStargazers:4Issues:2Issues:0

multiagent-particle-envs

Forked from openai, and expand it with more scenarios.

Language:PythonLicense:MITStargazers:3Issues:3Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:3Issues:0

1M-agents-RL

A preliminary platform for up to 1 million reinforcement learning agents

Language:PythonStargazers:0Issues:2Issues:0

acme

A library of reinforcement learning components and agents

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

arxiv-vanity

Renders papers from Arxiv as responsive web pages so you don't have to squint at a PDF.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

asynchronous_impala_PPO

Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation

License:GPL-3.0Stargazers:0Issues:0Issues:0

d4pg-pytorch

PyTorch implementation of Distributed Distributional Deterministic Policy Gradients

Language:PythonStargazers:0Issues:0Issues:0

ELF

An End-To-End, Lightweight and Flexible Platform for Game Research

Language:C++License:NOASSERTIONStargazers:0Issues:3Issues:0

garage

A toolkit for reproducible reinforcement learning research

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

grpc

The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

License:Apache-2.0Stargazers:0Issues:0Issues:0

info_geometry

Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"

Stargazers:0Issues:0Issues:0

language-server-protocol

Defines a common protocol for language servers.

License:NOASSERTIONStargazers:0Issues:1Issues:0

lxc-gpu

Enjoy computation resources sharing at your laboratory with lxc-gpu!

Language:ShellLicense:MITStargazers:0Issues:0Issues:0

malib

A Multi-agent Learning Framework

License:MITStargazers:0Issues:0Issues:0

mdp

Make it easy to specify simple MDPs that are compatible with the OpenAI Gym.

License:Apache-2.0Stargazers:0Issues:0Issues:0

mini-AlphaStar

A mini-source reproduction code of the AlphaStar program which is an AI proposed by DeepMind to play StarCraft II.

License:Apache-2.0Stargazers:0Issues:0Issues:0

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

License:MITStargazers:0Issues:0Issues:0

proc-bridge

A lightweight socket-based IPC (Inter-Process Communication) protocol. (Support Java and Python)

Language:JavaLicense:MITStargazers:0Issues:0Issues:0

rliable

Open-source library for reliable evaluation on reinforcement learning and machine learning benchmarks. See NeurIPS 2021 oral for details.

License:Apache-2.0Stargazers:0Issues:0Issues:0

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

sac

Soft Actor-Critic

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

scalable_agent

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

License:Apache-2.0Stargazers:0Issues:0Issues:0

seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Simulator

Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning

Language:PythonStargazers:0Issues:3Issues:0

slide-template

A template for academic presentation slides in Apex Lab.

Language:TeXStargazers:0Issues:1Issues:0

smac

SMAC: The StarCraft Multi-Agent Challenge

License:MITStargazers:0Issues:0Issues:0

stocBiO

Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"

License:MITStargazers:0Issues:0Issues:0