YinfengYu (yyf17)

yyf17

Geek Repo

Company:Xinjiang University

Location:Beijing,China

Home Page:https://yyf17.github.io/

Github PK Tool:Github PK Tool

YinfengYu's repositories

ROMA

Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

AI-QMIX

Code for "AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning"

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

CollaQ

A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

deeprl_signal_control

multi-agent deep reinforcement learning for large-scale traffic signal control.

License:MITStargazers:0Issues:0Issues:0

DOP

Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (https://arxiv.org/abs/2007.12322)

License:Apache-2.0Stargazers:0Issues:0Issues:0

emix

Energy-based Surprise Minimization for Multi-Agent Value Factorization

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

gitignore

A collection of useful .gitignore templates

License:CC0-1.0Stargazers:0Issues:0Issues:0

habitat-sim

A flexible, high-performance 3D simulator for Embodied AI research.

License:MITStargazers:0Issues:0Issues:0

ICLR2021-OpenReviewData

Crawl & visualize ICLR papers and reviews.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

img2latex-mathpix

An image to LaTeX tool by MathpixOCR API and JavaFX

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

jps

Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"

License:NOASSERTIONStargazers:0Issues:0Issues:0

LICA

[NeurIPS 2020] PyTorch implementation of "Learning Implicit Credit Assignment for Cooperative Muti-Agent Reinforcement Learning"

License:MITStargazers:0Issues:0Issues:0

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

License:MITStargazers:0Issues:0Issues:0

MAVEN

Submission for MAVEN: Multi-Agent Variational Exploration

Stargazers:0Issues:0Issues:0

multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

License:MITStargazers:0Issues:0Issues:0

NDQ

Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)

License:Apache-2.0Stargazers:0Issues:0Issues:0

QDPP

Multi-Agent Determinantal Q-Learning

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

RelationalGraphLearning

[IROS20] Relational graph learning for crowd navigation

Stargazers:0Issues:0Issues:0

RODE

Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (https://arxiv.org/abs/2010.01523). RODE is a scalable role-based multi-agent learning method which effectively discovers roles based on joint action space decomposition according to action effects, establishing a new state of the art on the StarCraft multi-agent benchmark.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Speech-Emotion-Recognition

Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别

License:MITStargazers:0Issues:0Issues:0

wqmix

Code for Weighted QMIX

Stargazers:0Issues:0Issues:0