RuanJingqing's starred repositories

multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Language:PythonLicense:MITStargazers:1651Issues:0Issues:0

mappo

This is the official implementation of Multi-Agent PPO.

Language:PythonLicense:MITStargazers:93Issues:0Issues:0

option-critic-pytorch

PyTorch implementation of the Option-Critic framework, Harb et al. 2016

Language:PythonStargazers:117Issues:0Issues:0

Conventions-ModularPolicy

PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021

Language:PythonStargazers:15Issues:0Issues:0

noisy-mappo

Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)

Language:PythonLicense:MITStargazers:54Issues:0Issues:0

fast_pytorch_kmeans

This is a pytorch implementation of k-means clustering algorithm

Language:PythonLicense:MITStargazers:285Issues:0Issues:0

HiTS

Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021

Language:PythonLicense:MITStargazers:31Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

sequential_social_dilemma_games

Repo for reproduction of sequential social dilemmas

Language:PythonLicense:MITStargazers:387Issues:0Issues:0

rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Language:PythonLicense:MITStargazers:1133Issues:0Issues:0

CASEC-MACO-benchmark

Codes accompanying the paper "Context-Aware Sparse Deep Coordination Graphs (https://arxiv.org/abs/2106.02886).

Language:PythonLicense:Apache-2.0Stargazers:17Issues:0Issues:0

Reinforcement-Learning-of-Spatio-Temporal-Point-Processes

A general framework for learning spatio-temporal point processes via reinforcement learning

Language:PythonStargazers:28Issues:0Issues:0

Learning-Temporal-Point-Processes-via-Reinforcement-Learning

PPG (Point Process Generator) is a Reinforcement Learning framework that is able to produce actions by imitating expert sequences.

Language:PythonStargazers:13Issues:0Issues:0

Learning-Point-Processes-Via-Reinforcement-Learning

code of paper "Learning Temporal Point Processes Via Reinforcement Learning ", NeurIPS 2018

Language:PythonStargazers:9Issues:0Issues:0

torch-neuralpointprocess

(Pytorch ver) Code for "Fully Neural Network based Model for General Temporal Point Process"

Language:PythonStargazers:18Issues:0Issues:0

LOLA-pytorch

Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)

Language:PythonStargazers:19Issues:0Issues:0

CARE-SMAC-MA_SAC

Multi-task Multi-agent Soft Actor Critic for SMAC

Language:PythonStargazers:12Issues:0Issues:0

mtenv

MultiTask Environments for Reinforcement Learning.

Language:PythonLicense:MITStargazers:74Issues:0Issues:0

mtrl

Multi Task RL Baselines

Language:PythonLicense:MITStargazers:226Issues:0Issues:0

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

License:MITStargazers:2586Issues:0Issues:0

prioritized_option_critic

Implementation of the Prioritized Option-Critic on the Four-Rooms Environment

Language:PythonStargazers:15Issues:0Issues:0

Deep-Reinforcement-Learning-Algorithms

This is a reconstruction of previous repository(rl-algorithms).

Language:PythonStargazers:7Issues:0Issues:0

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonLicense:MITStargazers:1719Issues:0Issues:0
Language:PythonStargazers:16Issues:0Issues:0

WeTS

A benchmark for the task of translation suggestion

Language:MaskLicense:UnlicenseStargazers:59Issues:0Issues:0
License:MITStargazers:3Issues:0Issues:0

SMAC

StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX

Language:PythonLicense:Apache-2.0Stargazers:68Issues:0Issues:0

ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

Language:PythonLicense:NOASSERTIONStargazers:3739Issues:0Issues:0

CityFlow

A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

Language:C++License:Apache-2.0Stargazers:793Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:71Issues:0Issues:0