clustering consensus pinning-control q-learning swarm swarm-intelligence swarm-robotics

Optimal Swarm Structures via Q-Learning

This project implements Q-learning to automatically assemble lattice structures composed of an arbitrary number of agents. The structure is optimized to ensure maximal coverage of landmarks. Each agent learns optimal parameters locally and negotiates consensus with its neighbours. The Q-table grows as new states are discovered and the exploitation rate increases exponentially with time.

Key theoretical developments:

The structure is assembled using consensus-based methods
Pinning control is used to guide the agents
As the agents converge and structure assembles, new pins are selected based on degree centrality
Q-learning is used to adjust the separation between agents
This local separation is what drives the global structure of the swarm
Agents negotiate local separation using a novel consensus-based approach

Mathematical formulations to follow. Here is a plot showing the learning progress:

Here is is an illustration of the agents assembling:

About

This project implements Q-learning to automatically assemble lattice structures composed of an arbitrary number of agents.

clustering consensus pinning-control q-learning swarm swarm-intelligence swarm-robotics

MIT License

Languages

Language:Python 100.0%