Xiaoyang Yu (Lamperougeyxy)

Lamperougeyxy

Geek Repo

Company:Beijing Jiaotong University

Github PK Tool:Github PK Tool

Xiaoyang Yu's repositories

ai-deadlines

:alarm_clock: AI conference deadline countdowns

Language:HTMLStargazers:0Issues:1Issues:0

anonymous_github

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

Language:HTMLLicense:GPL-3.0Stargazers:0Issues:1Issues:0

CDS

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

CityFlow

A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

DOP

Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

FinRL

FinRL: Financial Reinforcement Learning. 🔥

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

go-explore

Code for Go-Explore: a New Approach for Hard-Exploration Problems

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

gps

Guided Policy Search

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

handful-of-trials

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Homophily-MARL

Code for "Learning Homophilic Incentives in Sequential Social Dilemmas"

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

MADDPG-1

Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".

Language:PythonStargazers:0Issues:1Issues:0

mbpo

Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

MPE-Multiagent-RL-Algos

Simple verification experiments codes for multi-agent RL using OpenAI MPE environment

Language:PythonStargazers:0Issues:1Issues:0

multi-agent-PPO-on-SMAC

Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.

Language:PythonStargazers:0Issues:1Issues:0

multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pymarl2

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

RODE

Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is a scalable role-based multi-agent learning method which effectively discovers roles based on joint action space decomposition according to action effects, establishing a new state of the art on the StarCraft multi-agent benchmark.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

smac_plus

An open source benchmark for Multi Agent Reinforcement Learning

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:JavaScriptStargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

visualboyadvance-m

The continuing development of the legendary VBA gameboy advance emulator.

Language:C++Stargazers:0Issues:1Issues:0

wqmix

Code for Weighted QMIX

Language:PythonStargazers:0Issues:1Issues:0