Lamperougeyxy

followers

following

stars

Beijing Jiaotong University

Xiaoyang Yu's repositories

bert

TensorFlow code and pre-trained models for BERT

Apache-2.0000

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonMIT000

DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

Language:PythonMIT000

deeprl_network

multi-agent deep reinforcement learning for networked system control.

000

deeprl_signal_control

multi-agent deep reinforcement learning for large-scale traffic signal control.

MIT000

dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Language:PythonApache-2.0010

dreamer-1

Dream to Control: Learning Behaviors by Latent Imagination

MIT000

EITI-EDTI

Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)

MIT000

ghostnet

[CVPR2020] Surpassing MobileNetV3: "GhostNet: More Features from Cheap Operations"

Language:PythonApache-2.0000

ghostnet.pytorch

[CVPR2020] Surpassing MobileNetV3: "GhostNet: More Features from Cheap Operations"

000

hierarchical-marl

Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery

Language:PythonNOASSERTION010

jmlr-style-file

LaTeX style file for the Journal of Machine Learning Research

Language:TeX010

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

MIT000

MAVEN

Submission for MAVEN: Multi-Agent Variational Exploration

000

mentalRL

Code for our AAMAS 2020 paper: "A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry".

Language:Jupyter Notebook010

MPHRL

Model Primitive Hierarchical Reinforcement Learning

MIT000

NDQ

Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)

Apache-2.0000

pymoo

NSGA2, NSGA3, R-NSGA3, MOEAD, GA, DE,

Apache-2.0000

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

MIT000

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Language:PythonMIT000

ray

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Apache-2.0000

Reinforcement-Learning-from-Hierarchical-Critics

Reinforcement Learning from Hierarchical Critics

000

RL-Papers

papers about reinforcement learning

010

ROMA

Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)

Language:PythonApache-2.0000

StarCraft

Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:Python010

transformers

🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

Apache-2.0000

UnsupervisedAttentionMechanism

Code for our paper: "Unsupervised Attention Mechanism across Neural Network Layers".

Language:Jupyter Notebook010

VAE-Pytorch

000

vscode-rainbow-fart

一个在你编程时疯狂称赞你的 VSCode 扩展插件 | An VSCode extension that keeps giving you compliment while you are coding, it will checks the keywords of code to play suitable sounds.

Language:VueMIT010

ZOOpt

A python package of Zeroth-Order Optimization (ZOOpt)

MIT000