• Keine Ergebnisse gefunden

COOPERATIVE LINKER

N/A
N/A
Protected

Academic year: 2022

Aktie "COOPERATIVE LINKER"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Results and conclusions

The implementation of the LINKER Architecture and the PBEB in the case of the Barcelona DWN leads to a good solution where all the states are kept within limits with a cost 𝐽∆𝑢 of almost half (53.55%) of the centralized solution. Ten of seventeen (the 58.8%) tanks of the entire system could even follow the desirable reference (that was not mandatory). That means that the system accomplishes the objectives of keeping within the security levels and maintaining a smooth control better than to track the reference. It seems that with a more balanced partitioning the DWN performance could still improve.

This work shows how a Linker agent coordinates a

cooperative MAS

environment to seek a global optimum. The approach is applied to the Barcelona Drinking Water

Network (DWN)

administrated by AGBAR where the main problem was to coordinate the control of three different sectors of the network.

Each part has a local controller (M) to solve the local water demands, but it also has to cooperate with the other agents to satisfy the water demands of the whole network. The cooperative Linker agent (N) implemented, learns by using a Reinforcement Learning algorithm, called PlanningByExploration

Behaviour with penalization (Javalera et al., 2019), to converge towards an optimal (or suboptimal) value of each of the variables that connect the local agents. For the training and simulation of the Linker agents real historical data of the Barcelona DWN provided by AGBAR were used.

The Linker agents (N)

coordinates and optimizes a

cooperative

MAS environment to seek for a

global

optimum.

Ph.D. Valeria JAVALERA-RINCON. IIASA-CONACYT postdoc fellow

Ecosystems Services and Management Program and Advanced Systems Analysis programs . tel: +43(0) 2236 807 610. Email: Javalera@iiasa.ac.at

COOPERATIVE LINKER FOR THE DISTRIBUTED CONTROL OF THE BARCELONA DRINKING WATER

NETWORK

Valeria Javalera-Rincon, Vicenc Puig-Cayuela, Bernardo Morcego-Seix, Fernando Orduña-Cabrera

Keywords: Multi-Agent Systems, Large Scale Systems, Linkage of Models, Reinforcement Learning, Distributed Control, Water Networks, Large Scale Systems.

0 5 10 15 20 25

150 200 250 300 350 400 450

Agent 1 - output x1

0 5 10 15 20 25

300 400 500 600 700 800 900 1000

Agent 1 - output x2

0 5 10 15 20 25

0 500 1000 1500 2000 2500 3000 3500 4000

Agent 2 - output x3

0 5 10 15 20 25

0 500 1000 1500 2000 2500 3000 3500

Agent 2 - output x4

0 5 10 15 20 25

0 5000 10000 15000

Agent 2 - output x5

0 5 10 15 20 25

500 1000 1500 2000 2500 3000 3500

Agent 3 - output x6

0 5 10 15 20 25

1 2 3 4 5 6

7x 104 Agent 3 - output x7

0 5 10 15 20 25

0 2000 4000 6000 8000 10000 12000

Agent 3 - output x8

0 5 10 15 20 25

0 1000 2000 3000 4000 5000 6000 7000 8000

Agent 3 - output x9

0 5 10 15 20 25

2000 4000 6000 8000 10000 12000 14000 16000

Agent 3 - output x10

0 5 10 15 20 25

0 0.2 0.4 0.6 0.8 1 1.2 1.4

0 5 10 15 20 25

0 0.005 0.01 0.015 0.02 0.025

0 5 10 15 20 25

0 0.005 0.01 0.015 0.02 0.025 0.03 0.035

0 5 10 15 20 25

0 0.05 0.1 0.15 0.2 0.25

0 5 10 15 20 25

0 0.2 0.4 0.6 0.8 1 1.2 1.4

0 5 10 15 20 25

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0 5 10 15 20 25

0 0.2 0.4 0.6 0.8 1 1.2 1.4

0 5 10 15 20 25

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8

0 5 10 15 20 25

0 0.5 1 1.5 2 2.5 3

0 5 10 15 20 25

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35

0 5 10 15 20 25

0 0.2 0.4 0.6 0.8 1 1.2 1.4

0 5 10 15 20 25

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7

Examples of simulations results of tank volume evolutions. From tank x1 to x10. Blue line represents LINKER solution and green line centralized MPC. Doted lines are min and max volumes of tanks and red line is a desired volume (not mandatory).

Evolution of some of the control actions applied by The LINKER (blue) and the centralized MPC (green) during simulation of figure 4. Max value (Red) and min value (Cyan)

The Barcelona (DWN) divided in to tree sectors (red, green and blue), then the Relation diagram shows two linker agents (N1 and N2) related to MPC agents M1.M2 and M3, one for each sector.

For a detailed explanation of the

algorithm applied for this case, please see the PanningByExploration behavior

(PBEB) at :

Javalera-Rincon, V., Cayuela, V., Seix, B.

and Orduña-Cabrera, F.

Reinforcement Learning Approach for Cooperative Control of Multi-Agent

Systems.

In Proceedings of the 11th International Conference on Agents and Artificial

Intelligence (ICAART 2019) - Volume 2,

pages 80-91. ISBN: 978-989-758-350-6

Referenzen

ÄHNLICHE DOKUMENTE

The design techniques for cooperative control schemes examined in this thesis are model- based, which means that a model of the agent dynamics is required to design the con-

Here is the heart of the matter of a political, social, or economic system's capacity to learn: It is the presence of such readily recommittable resources,

In this thesis, we study interpolation-based model order reduction techniques for large-scale linear, bilinear and quadratic-bilinear control systems..

Once the 513 sample points (parameter calibrations) have been generated by the NOLH sampling method these are used to generate corresponding sets of simulated time series using

prove a similar but rather involved small gain theorem for continuous time systems; their small gain condition is not at first sight equivalent to the one in [Tee05], but in

The TSA’s decision might be the best example for this, since the operational planning decision relies only on the number of requests and required capacity as specified by

This work addresses efficient algorithms for distributed top- k queries in wide-area networks where the index lists for the attribute values (or text terms) of a query are

because it operates an own cache or the client is a caching middlebox by itself which supports CREHMA, it must validate the signature value in the Validation-Signature header, as