XU Zhiwei (deligentfool)

deligentfool

Geek Repo

Company:Institute of Automation, Chinese Academy of Sciences

Location:Beijing, China

Home Page:http://xuleek.tech/

Github PK Tool:Github PK Tool

XU Zhiwei's repositories

dqn_zoo

The implement of all kinds of dqn reinforcement learning with Pytorch

Language:PythonStargazers:84Issues:2Issues:0

policy_based_RL

The implement of the policy gradient RL algorithm with pytorch

Language:PythonStargazers:35Issues:3Issues:0

GAIL_pytorch

The implement of GAIL with pytorch

Language:PythonStargazers:14Issues:2Issues:0

HAVEN

Codes for the paper "HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism"

Language:PythonLicense:Apache-2.0Stargazers:14Issues:2Issues:0

COLA

Codes for the paper "Consensus Learning for Cooperative Multi-Agent Reinforcement Learning"

Language:PythonLicense:Apache-2.0Stargazers:9Issues:3Issues:1

maddpg

Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch

Language:PythonStargazers:9Issues:1Issues:0

mfrl_pytorch

Implementation of Mean Field Multi-Agent Reinforcement Learning in Pytorch

Language:PythonStargazers:7Issues:0Issues:0

SIDE

Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"

Language:PythonLicense:Apache-2.0Stargazers:7Issues:3Issues:0

option_critic

An implement of option-critic architecture (HRL) with pytorch

Language:PythonStargazers:4Issues:2Issues:0

DRQN_pytorch

An implement of DRQN with pytorch.

Language:PythonStargazers:2Issues:2Issues:0

Hiro

The implement of HIRO based TD3 with pytorch

Language:PythonStargazers:2Issues:3Issues:0

MGAN

Codes for the paper "Learning to Coordinate via Multiple Graph Neural Networks"

Language:PythonLicense:Apache-2.0Stargazers:2Issues:3Issues:0

Population_based_training_pytorch

The implement of PBT with pytorch (for Reinforcement Learning)

Language:PythonStargazers:2Issues:1Issues:0

SVPG

A simple implementation of SVPG (Stein Variational Policy Gradient) by using pytorch

Language:PythonStargazers:2Issues:2Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

DeepNash

An attempt at implementing DeepNash

Language:PythonStargazers:1Issues:0Issues:0

IMPALA_pytorch

Implement of IMPALA (Importance Weighted Actor-Learner Architectures) with pytorch

Language:PythonStargazers:1Issues:2Issues:0

leduc_nfsp

The implement of Neural Fictitious Self Play with pytorch

Language:PythonStargazers:1Issues:2Issues:0

NFSP_lasertag

The implement of Neural Fictitious Self Play with pytorch

Progressive_Neural_Networks

Implement of Progressive_Neural_Networks for RL case with pytorch

Language:PythonStargazers:1Issues:2Issues:0
Stargazers:0Issues:2Issues:0

deligentfool.github.io

Personal website.

Language:HTMLStargazers:0Issues:2Issues:0

DGM_pytorch

Implementation of deep generative model with pytorch

Language:PythonStargazers:0Issues:2Issues:0

Evolution_Strategies

The implement of Evolution_Strategies (openAI) with pytorch

Language:PythonStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0

optimization_algo

The implement of the four basic optimization algorithms with python (sympy package)

Language:PythonStargazers:0Issues:2Issues:0

Pointer_Network_pytorch

The implement of Pointer Network with Pytorch

Language:PythonStargazers:0Issues:2Issues:0

UNREAL_pytorch

The implement of UNREAL reinforcement learning algorithm with pytorch

Language:PythonStargazers:0Issues:1Issues:0

vime_pytorch

The implement of VIME (a curiosity intrinsic reward algorithm for RL) with pytorch

Language:PythonStargazers:0Issues:2Issues:0