KornbergFresnel

Ming Zhou's repositories

ModelRepo

reproduce some RL or Multi-Agent models

Language:Python35 4 2

MAgentRender

an interactive pygame client for MAgent

Language:PythonMIT4 20

multiagent-particle-envs

Forked from openai, and expand it with more scenarios.

Language:PythonMIT3 30

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT1 20

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookApache-2.01 30

1M-agents-RL

A preliminary platform for up to 1 million reinforcement learning agents

Language:Python020

acme

A library of reinforcement learning components and agents

Language:PythonApache-2.0000

arxiv-vanity

Renders papers from Arxiv as responsive web pages so you don't have to squint at a PDF.

Language:PythonApache-2.0000

asynchronous_impala_PPO

Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation

GPL-3.0000

d4pg-pytorch

PyTorch implementation of Distributed Distributional Deterministic Policy Gradients

Language:Python000

ELF

An End-To-End, Lightweight and Flexible Platform for Game Research

Language:C++NOASSERTION030

garage

A toolkit for reproducible reinforcement learning research

Language:PythonMIT010

grpc

The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

Apache-2.0000

info_geometry

Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"

000

language-server-protocol

Defines a common protocol for language servers.

NOASSERTION010

lxc-gpu

Enjoy computation resources sharing at your laboratory with lxc-gpu!

Language:ShellMIT000

malib

A Multi-agent Learning Framework

MIT000

mdp

Make it easy to specify simple MDPs that are compatible with the OpenAI Gym.

Apache-2.0000

mini-AlphaStar

A mini-source reproduction code of the AlphaStar program which is an AI proposed by DeepMind to play StarCraft II.

Apache-2.0000

phasic-policy-gradient

Code for the paper "Phasic Policy Gradient"

MIT000

proc-bridge

A lightweight socket-based IPC (Inter-Process Communication) protocol. (Support Java and Python)

Language:JavaMIT000

rliable

Open-source library for reliable evaluation on reinforcement learning and machine learning benchmarks. See NeurIPS 2021 oral for details.

Apache-2.0000

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonNOASSERTION000

sac

Soft Actor-Critic

Language:PythonNOASSERTION000

scalable_agent

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

Apache-2.0000

seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Apache-2.0000

Simulator

Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning

Language:Python030

slide-template

A template for academic presentation slides in Apex Lab.

Language:TeX010

smac

SMAC: The StarCraft Multi-Agent Challenge

MIT000

stocBiO

Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"

MIT000