Beast code in Giters

RuanJingqing's starred repositories

multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Language:PythonMIT165100

mappo

This is the official implementation of Multi-Agent PPO.

Language:PythonMIT9300

option-critic-pytorch

PyTorch implementation of the Option-Critic framework, Harb et al. 2016

Language:Python11700

Conventions-ModularPolicy

PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021

Language:Python1500

noisy-mappo

Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)

Language:PythonMIT5400

fast_pytorch_kmeans

This is a pytorch implementation of k-means clustering algorithm

Language:PythonMIT28500

HiTS

Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021

Language:PythonMIT3100

EFA-DWM

Language:Python500

sequential_social_dilemma_games

Repo for reproduction of sequential social dilemmas

Language:PythonMIT38700

rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Language:PythonMIT113300

CASEC-MACO-benchmark

Codes accompanying the paper "Context-Aware Sparse Deep Coordination Graphs (https://arxiv.org/abs/2106.02886).

Language:PythonApache-2.01700

Reinforcement-Learning-of-Spatio-Temporal-Point-Processes

A general framework for learning spatio-temporal point processes via reinforcement learning

Language:Python2800

Learning-Temporal-Point-Processes-via-Reinforcement-Learning

PPG (Point Process Generator) is a Reinforcement Learning framework that is able to produce actions by imitating expert sequences.

Language:Python1300

Learning-Point-Processes-Via-Reinforcement-Learning

code of paper "Learning Temporal Point Processes Via Reinforcement Learning ", NeurIPS 2018

Language:Python900

torch-neuralpointprocess

(Pytorch ver) Code for "Fully Neural Network based Model for General Temporal Point Process"

Language:Python1800

LOLA-pytorch

Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)

Language:Python1900

CARE-SMAC-MA_SAC

Multi-task Multi-agent Soft Actor Critic for SMAC

Language:Python1200

mtenv

MultiTask Environments for Reinforcement Learning.

Language:PythonMIT7400

mtrl

Multi Task RL Baselines

Language:PythonMIT22600

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

MIT258600

prioritized_option_critic

Implementation of the Prioritized Option-Critic on the Four-Rooms Environment

Language:Python1500

Deep-Reinforcement-Learning-Algorithms

This is a reconstruction of previous repository(rl-algorithms).

Language:Python700

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonMIT171900

SEQ-SCD

Language:Python1600

WeTS

A benchmark for the task of translation suggestion

Language:MaskUnlicense5900

on-policy

MIT300

SMAC

StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX

Language:PythonApache-2.06800

ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

Language:PythonNOASSERTION373900

CityFlow

A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

Language:C++Apache-2.079300

dcg

Language:PythonApache-2.07100