RuanJingqing's starred repositories
multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
Conventions-ModularPolicy
PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021
noisy-mappo
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
fast_pytorch_kmeans
This is a pytorch implementation of k-means clustering algorithm
sequential_social_dilemma_games
Repo for reproduction of sequential social dilemmas
rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
CASEC-MACO-benchmark
Codes accompanying the paper "Context-Aware Sparse Deep Coordination Graphs (https://arxiv.org/abs/2106.02886).
Reinforcement-Learning-of-Spatio-Temporal-Point-Processes
A general framework for learning spatio-temporal point processes via reinforcement learning
Learning-Temporal-Point-Processes-via-Reinforcement-Learning
PPG (Point Process Generator) is a Reinforcement Learning framework that is able to produce actions by imitating expert sequences.
Learning-Point-Processes-Via-Reinforcement-Learning
code of paper "Learning Temporal Point Processes Via Reinforcement Learning ", NeurIPS 2018
torch-neuralpointprocess
(Pytorch ver) Code for "Fully Neural Network based Model for General Temporal Point Process"
LOLA-pytorch
Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)
CARE-SMAC-MA_SAC
Multi-task Multi-agent Soft Actor Critic for SMAC
releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
prioritized_option_critic
Implementation of the Prioritized Option-Critic on the Four-Rooms Environment
Deep-Reinforcement-Learning-Algorithms
This is a reconstruction of previous repository(rl-algorithms).
PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch