openai / maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Home Page:https://arxiv.org/pdf/1706.02275.pdf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The code does not converged

sjq19960802 opened this issue · comments

I run the environment simple_spread_listener with the code and it does not converged. I haven't changed any code.

It seems the loss didn't backward?