Xiaoyang Yu's repositories
bert
TensorFlow code and pre-trained models for BERT
Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
deeprl_network
multi-agent deep reinforcement learning for networked system control.
deeprl_signal_control
multi-agent deep reinforcement learning for large-scale traffic signal control.
dreamer-1
Dream to Control: Learning Behaviors by Latent Imagination
EITI-EDTI
Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)
ghostnet
[CVPR2020] Surpassing MobileNetV3: "GhostNet: More Features from Cheap Operations"
ghostnet.pytorch
[CVPR2020] Surpassing MobileNetV3: "GhostNet: More Features from Cheap Operations"
hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
jmlr-style-file
LaTeX style file for the Journal of Machine Learning Research
maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
MPHRL
Model Primitive Hierarchical Reinforcement Learning
NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
pymoo
NSGA2, NSGA3, R-NSGA3, MOEAD, GA, DE,
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Reinforcement-Learning-from-Hierarchical-Critics
Reinforcement Learning from Hierarchical Critics
ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
UnsupervisedAttentionMechanism
Code for our paper: "Unsupervised Attention Mechanism across Neural Network Layers".
vscode-rainbow-fart
一个在你编程时疯狂称赞你的 VSCode 扩展插件 | An VSCode extension that keeps giving you compliment while you are coding, it will checks the keywords of code to play suitable sounds.
ZOOpt
A python package of Zeroth-Order Optimization (ZOOpt)