Xiaoyang Yu (Lamperougeyxy)

Lamperougeyxy

Geek Repo

Company:Beijing Jiaotong University

Github PK Tool:Github PK Tool

Xiaoyang Yu's repositories

bert

TensorFlow code and pre-trained models for BERT

License:Apache-2.0Stargazers:0Issues:0Issues:0

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

deeprl_network

multi-agent deep reinforcement learning for networked system control.

Stargazers:0Issues:0Issues:0

deeprl_signal_control

multi-agent deep reinforcement learning for large-scale traffic signal control.

License:MITStargazers:0Issues:0Issues:0

dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

dreamer-1

Dream to Control: Learning Behaviors by Latent Imagination

License:MITStargazers:0Issues:0Issues:0

EITI-EDTI

Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)

License:MITStargazers:0Issues:0Issues:0

ghostnet

[CVPR2020] Surpassing MobileNetV3: "GhostNet: More Features from Cheap Operations"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ghostnet.pytorch

[CVPR2020] Surpassing MobileNetV3: "GhostNet: More Features from Cheap Operations"

Stargazers:0Issues:0Issues:0

hierarchical-marl

Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

jmlr-style-file

LaTeX style file for the Journal of Machine Learning Research

Language:TeXStargazers:0Issues:1Issues:0

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

License:MITStargazers:0Issues:0Issues:0

MAVEN

Submission for MAVEN: Multi-Agent Variational Exploration

Stargazers:0Issues:0Issues:0

mentalRL

Code for our AAMAS 2020 paper: "A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry".

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

MPHRL

Model Primitive Hierarchical Reinforcement Learning

License:MITStargazers:0Issues:0Issues:0

NDQ

Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)

License:Apache-2.0Stargazers:0Issues:0Issues:0

pymoo

NSGA2, NSGA3, R-NSGA3, MOEAD, GA, DE,

License:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

License:MITStargazers:0Issues:0Issues:0

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ray

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Reinforcement-Learning-from-Hierarchical-Critics

Reinforcement Learning from Hierarchical Critics

Stargazers:0Issues:0Issues:0

RL-Papers

papers about reinforcement learning

Stargazers:0Issues:1Issues:0

ROMA

Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

StarCraft

Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:PythonStargazers:0Issues:1Issues:0

transformers

🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

License:Apache-2.0Stargazers:0Issues:0Issues:0

UnsupervisedAttentionMechanism

Code for our paper: "Unsupervised Attention Mechanism across Neural Network Layers".

Language:Jupyter NotebookStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

vscode-rainbow-fart

一个在你编程时疯狂称赞你的 VSCode 扩展插件 | An VSCode extension that keeps giving you compliment while you are coding, it will checks the keywords of code to play suitable sounds.

Language:VueLicense:MITStargazers:0Issues:1Issues:0

ZOOpt

A python package of Zeroth-Order Optimization (ZOOpt)

License:MITStargazers:0Issues:0Issues:0