Multi-Agent Reinforcement algorithms with Particle Environment (OpenAI) using pytorch

ddpg.py: bidirectional LSTM actor + LSTM critic + DDPG
model_ddpg.py: bidirectional LSTM actor + LSTM critic + DDPG + estimate next_state + estimate reward <estimate next_state + estimate reward> is a combined method between model-free RL and model-based RL.
It shows significantly improved performance in particle env. simple_spread scenarios
model_rdpg.py: bidirectional LSTM actor + LSTM critic + RDPG (recurrent DPG) + estimate next_state + estimate reward This algorithm currently shows degraded performance than others in particle env. simple_spread scenarios.

About

Multi-Agent Reinforcement Learning with Particle Env. (on going)

GNU General Public License v3.0

Language:Python 100.0%