Wenshuai Zhao's repositories

MSSU-Net

This code is for the paper "multi-scale supervised 3D U-Net for kidneys and kidney tumor segmentation".

NoisyKukaReacher

A modified environment for robotic arm reaching based on Pybullet Kuka Arm Grasping

Language:PythonStargazers:6Issues:2Issues:0

mappo

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

optimappo

This is the code for Optimistic Multi-Agent Policy Gradient.

Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0