There are 70 repositories under multi-agent-reinforcement-learning topic.
AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.
Unified Reinforcement Learning Framework
BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.
VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
Multi-Agent Connected Autonomous Driving (MACAD) Gym environments for Deep RL. Code for the paper presented in the Machine Learning for Autonomous Driving Workshop at NeurIPS 2019:
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
DI-engine docs (Chinese and English)
Deep Reinforcement Learning in C#
This repository is for an open-source environment for multi-agent active voltage control on power distribution networks (MAPDN).
PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.
[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".
Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning
Lightweight multi-agent gridworld Gym environment
[ICCV 2025 Highlights] Large-scale photo-realistic virtual worlds for embodied AI
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
Datasets with baselines for Offline MARL.
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.