gitting-guud / GML_Project

Semi-bandits for Decentralized Optimal Transport on a Graph

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GML_Project

Semi-bandits for Decentralized Optimal Transport on a Graph

By KILANI AL HOUCEINE and NAOUMI SALMANE, under the supervision of Mr SEZNEC Julien

In this project, we consider a mass (e.g. people) transporting on a resistive graph.From a global perspective, a central planner might be interested in choosing eachroute to minimize the global cost (quadratic in each edge flow). The constraintthat each unit of mass starts from its starting node and successfully reaches itsendpoint can be formulated as a linear constraint. Hence, this is a standard convexoptimization problem. Nonetheless, in a decentralized setting, each player willchoose its route to minimize its local cost (linear in the flow of each visited edge).To do so, players are not allowed to communicate and are hence doomed to tryroutes sequentially. Moreover, at each try, all the players choose a route and onlyobserve the flow in the visited edges (semi-bandit feedback). The cost of a route istherefore dependent on the other players actions. Hence, rewards are not stochastic,but probably not fully adversarial as well.

For the agents :

For the environments, we implemented the following graphs :

  • Random Sparse Graph : a fully connected graph where we discard randomly a fraction of the edges.
  • OW Network
  • Sioux Falls network

We perform comparisons by launching the agents on the environments and get figures and images

About

Semi-bandits for Decentralized Optimal Transport on a Graph


Languages

Language:Jupyter Notebook 99.8%Language:Python 0.2%