Multiobjective Model Predictive Control for Stabilizing Cost Criteria

(1)

AIMS’ Journals

VolumeX, Number0X, XX200X pp.X–XX

MULTIOBJECTIVE MODEL PREDICTIVE CONTROL FOR

1

STABILIZING COST CRITERIA

2

Lars Gr¨une

Chair of Applied Mathematics Department of Mathematics

University of Bayreuth 95440 Bayreuth, Germany

Marleen Stieler

Chair of Applied Mathematics Department of Mathematics

University of Bayreuth 95440 Bayreuth, Germany

Abstract. In this paper we demonstrate how multiobjective optimal control problems can be solved by means of model predictive control. For our analysis we restrict ourselves to finite-dimensional control systems in discrete time. We show that convergence of the MPC closed-loop trajectory as well as upper bounds on the closed-loop performance for all objectives can be established if the ‘right’ Pareto-optimal control sequence is chosen in the iterations. It turns out that approximating the whole Pareto front is not necessary for that choice.

Moreover, we provide statements on the relation of the MPC performance to the values of Pareto-optimal solutions on the infinite horizon, i.e. we investigate on the inifinite-horizon optimality of our MPC controller.

1. Introduction. In optimal control, it is a natural idea that not only one but

3

multiple objectives have to be optimized, see e.g. [16]. This inevitably leads to

4

the formulation of a multiobjective (MO) optimal control problem (OCP). For op-

5

timal control problems on infinite or indefinitely long horizons, model predictive

6

control (MPC) has by now emerged as one of the most successful algorithmic ap-

7

proaches [7,19]. In MPC, the optimal control problem is solved successively on

8

smaller, moving time horizons. It is not surprising that the connection between

9

multiobjective optimal control and MPC has attracted the attention of many re-

10

searchers.

11

The first question to consider is how to deal with the occuring MO optimization

12

problem in each step of the MPC scheme. A first, easy to apply method is to

13

define a weighted sum of all objectives such that the MO optimization problem

14

in the MPC iterations is transformed into a usual optimization problem, see e.g.

15

[15,19,21] or [6] (in a distributed MPC framework). This strategy is very appealing

16

because the existing theory on MPC can directly be applied. An extension, which

17

2010Mathematics Subject Classification. Primary: 93B52, 93C10, 93C55, 91A12; Secondary:

90C29.

Key words and phrases. Model Predictive Control, Cooperative Control, Feedback Synthesis, Nonlinear Systems, Multiobjective Optimization.

The authors are supported by DFG Grant Gr 1569/13-1.

1

(2)

yields comparable results, is the usage of time-varying weights in [1]. As in those

1

approaches, also the paper [13] handles the MO optimization problems by defining a

2

prioritization of objectives. This enables the authors to define a Lyapunov function

3

and thus obtain asymptotic stability. The utopia-tracking approach in [23] is a

4

no-preference method, and thus conceptually different from the previous references,

5

yet the proofs also rely on defining a Lapunov function.

6

The references just mentioned typically focus on asymptotic stability and efficient

7

computation. However a refined performance analysis is not carried out and also not

8

always possible, see [10]. Moreover, the presented approaches all rely on a specific

9

method to solve the occuring MO optimization problems.

10

In the works [5,14] the whole Pareto front (the set of all solutions to the MO

11

optimization problem) is approximated in each step of the MPC iteration and a

12

solution is chosen subject to expert decisions (e.g. by a decision maker). To solve

13

the MO optimization problems, neural networks and genetic algorithms are used.

14

The idea of the approaches is to first gain precise insights into the problem and

15

then make a decision. Convergence or performance of the MPC controller cannot

16

be guaranteed.

17

In [11] the occuring MO optimization problem is interpreted as a game and solved

18

by means of the Nash-bargaining framework.

19

The aim in this paper is to present MPC schemes and conditions on the MO op-

20

timal control problem under which the MPC algorithm yields a closed-loop solution

21

that approximates an infinite horizon Pareto-optimal solution. We will perform our

22

analysis in the framework of stabilizing MPC problems, in which the cost functions

23

penalize the distance to a desired equilibrium. The assumptions we impose will be

24

relatively straightforward extensions of assumptions which are well established in

25

single objective MPC. Both MPC schemes with and without terminal conditions

26

are covered. The results build upon and extend preliminary result from [9].

27

In our analysis we do not rely on a specific technique to solve MO optimization

28

problems. Moreover, and in contrast to the references mentioned above, we will

29

provide individual performance estimates for all objectives. In particular, we prove

30

that including an additional constraint to the MO optimization problem in each

31

MPC iteration yields performance guarantees for all objectives and convergence of

32

the MPC closed-loop trajectory. Consequently, approximating the whole Pareto

33

front in the iterations is not necessary, which makes our approach well applicable

34

for real-time problems.

35

The paper is organized as follows: In Section 1 we introduce the problems we

36

are considering along with basic definitions and properties from multiobjective opti-

37

mization as well as a general MPC procedure. In Section3we show how multiobjec-

38

tive optimal control problems can be solved by means of MPC including terminal

39

conditions, in Section 4 we move on to MPC without such terminal conditions.

40

In both sections our theoretical findings are illustrated by a numerical example.

41

Section 5 concludes this paper. Finally, some technical proofs for statements in

42

Section4 are given in AppendixA.

43

2. Setting and Basic Definitions. In this paper we consider nonlinear control systems in discrete time given by

x⁺=f(x, u), f :Rⁿ×R^m→Rⁿ, (1) which is a short notation for x(k+ 1) = f(x(k), u(k)), with admissible state and

44

control spacesX⊆RⁿandU⊆R^m. A solution of system (1) for a control sequence

45

(3)

u= (u(0), . . . , u(K−1)) ∈ U^K and initial valuex ∈ X is denoted by x^u(·, x) or

1

x(·, x) if the respective control sequence is clear from the context. The initial value

2

will also often be skipped.

3

For given stage costs`i :X×U→ R≥0, i ∈ {1, . . . , s}, and horizonN ∈ Nwe define the cost functionals

J_i^N(x,u) :=

N−1

X

k=0

`_i(x^u(k, x), u(k)), (2) which we aim to minimize wrtuand along a solution of (1). Thus, we obtain the followingmultiobjective optimal control problem

min

u J₁^N(x,u), . . . , J_s^N(x,u)

| {z }

=:J^N(x,u)

s.t. x(k+ 1) =f(x(k), u(k)), k= 0, . . . , N−1, x(k)∈X, k= 1, . . . , N,

u∈U^N.

(3)

Due to the fact that (3) contains more than one cost functional, in general it is not

4

possible to find an admissible control sequenceuthat minimizes all cost functionals

5

simultaneously. The precise meaning of the “min” will be defined in Definition2.1,

6

below.

7

Control sequences u that satisfy the constraints in (3) are collected in the set U^N(x) = {u ∈ U^N|x(k+ 1) = f(x(k), u(k)), k = 0, . . . , N −1, x(k) ∈ X, k = 0, . . . , N}. Our setting can reflect different situations. Either (1) is one system with multiple objectives to be minimized, or (1) is a collection of individual systems

x⁺=





 x⁺₁

... x⁺_p





=





 f1(x, u)

... fp(x, u)





=:f(x, u), with fi : Rⁿ×R^m →Rⁿⁱ and n=Pp

i=1ni, xi ∈Rⁿⁱ, where each system has at

8

least one cost criterion`i (i.e. s≥p).

9

By means of the MO OCP (3) we can now generate a feedback lawµ^N :X→U

10

using model predictive control (MPC), which consists of the following procedure:

11

Algorithm 1(Basic MO MPC Algorithm). 1. At timen∈Nmeasure the state

12

of the systemx(n).

13

2. Solve (1) with initial valuex=x(n)and obtain u^?,N ∈U^N(x(n)).

14

3. Defineµ^N(x(n)) :=u^?,N(0)and apply the feedbackµ^N to the system, i.e., set

15

x(n+ 1) :=f(x(n), µ^N(x(n))). Setn:=n+ 1 and go to 1.

16

Now we introduce the optimality notion used throughout this paper.

17

Definition 2.1(Pareto Optimality, Nondominated Point). A control sequenceu^?∈ U^N(x) is a Pareto optimal (control) sequence (POS) to (3) of lengthN for initial valuex∈Xif there is nou∈U^N(x) such that

∀i∈ {1, . . . , s}:J_i^N(x,u)≤J_i^N(x,u^?) and

∃i∈ {1, . . . , s}:J_i^N(x,u)< J_i^N(x,u^?).

(4)

The objective value J^N(x,u^?) := (J₁^N(x,u^?), . . . , J_s^N(x,u^?)) is called nondomi-

1

nated. The set of all POSs of length N for initial valuex∈Xwill be denoted by

2

U^NP(x).

3

Usually, there is not only one Pareto optimal solution to (3). It is rather typical

4

that there exists a continuum of such solutions and thus nondominated values as

5

shown in Figure1for the case of two objectives. The gray, dashed surface represents

6

the set of admissible values J^N(x) := {J^N(x,u) = (J₁^N(x,u), . . . , J_s^N(x,u))|u ∈

J₁ J2

Figure 1. Schematic illustration of a Pareto front for two objectives.

7

U^N(x)}, the black curve the set J_P^N(x) := {(J₁^N(x,u), J₂^N(x,u))|u ∈ U^NP(x)} of

8

nondominated values. This set is often referred to as theefficient ornondominated

9

setorPareto front. Even though all points on the black curve are equally optimal in

10

terms of the optimization problem (3), they are obviously not from each objective’s

11

point of view.

12

Convention: In the course of this paper, the min-operator is defined as min

u∈U^N(x)

J^N(x,u) =J_P^N(x) and, accordingly

argmin

u∈U^N(x)

J^N(x,u) =U^NP(x).

Since only one POS can be applied to the system in step 3 of Algorithm1, this nat-

13

urally gives rise to the question how to choose among the Pareto-optimal solutions

14

in step 2 of Algorithm1. Our approaches to solving this problem will be presented

15

in Sections3and4.

16

We now provide basic definitions and relations from the theory of multiobjective

17

optimization, adapted from [4,20] to our setting.

18

Definition 2.2(External stability). The setJ_P^N(x) is calledexternally stable, if for

19

allj∈ J^N(x)\J_P^N(x) there isj_P ∈ J_P^N(x) such thatj≥j_P holds componentwise.

20

Definition 2.3(Cone-Compactness). The setJ^N(x) is calledR^s_≥0-compact if∀j∈

21

J^N(x) the set (j−R^s_≥0)∩ J^N(x) is compact.

22

Theorem 2.4. Given a horizonN∈N≥1and an initial valuex∈XN. IfJ^N(x)6=

23

∅ andJ^N(x) isR^s≥0-compact, then the setJ_P^N(x)is externally stable.

24

A proof of this theorem can be found in [4,20]. The next lemma provides easily

25

checkable conditions for external stability and which are satisfied by our example

26

in Sections3and4.

27

(5)

Lemma 2.5. If U is compact, X is closed and f and `i are continuous for all

1

i∈ {1, . . . , s}, then the conditions of Theorem2.4 are fulfilled for allx∈Xand all

2

N ∈NsatisfyingU^N(x)6=∅.

3

Proof. Let an initial valuex∈Xand a horizon N ∈N≥1 such that U^N(x)6=∅ be

4

given. This impliesJ^N(x)6=∅.

5

It was proven in [3] that (under the given assumptions) the set ∆, that contains

6

all feasible trajectories with respective control sequences (x^u(·),u), is a compact

7

subset of Z := Rⁿ× · · · ×Rⁿ

| {z }

N times

×R^m× · · · ×R^m

| {z }

N−1 times

. If we interpret J^N as a function

8

that maps fromZtoR^s, compactness ofJ^N(x) can be concluded from compactness

9

of ∆ and continuity of the`i. The cone-compactness required in Theorem2.4is an

10

immediate consequence from the stronger property of compactness.

11

The following classes of functions are used in our paper.

12

Definition 2.6 (Comparison functions).

L:={δ:R⁺0 →R⁺0 |δcontinuous and decreasing with lim

k→∞δ(k) = 0}, K:={α:R⁺0 →R⁺0 |αcontinuous, strictly increasing withα(0) = 0}, K_∞:={α∈ K |αunbounded},

KL:={β:R⁺0 ×R⁺0 →R⁺0 |β continuous,β(·, t)∈ K, β(r,·)∈ L}.

Furthermore, the following notions will be used: For x ∈ X and ε ∈ R>0 we define

Bε(x) :={y∈X:ky−xk< ε}and

Bε(x) :={y∈X:ky−xk ≤ε}. (4)

In this paper we will be concerned with a setting that can be seen as a straight-

13

forward generalization of ’classical’ or ’stabilizing’ MPC schemes, given by cost

14

functions satisfying the following assumption.

15

Assumption 2.7 (’Stabilizing’ stage costs). 1. There is an equilibrium pair or

16

steady state(x_∗, u_∗)∈X×U, i.e., f(x_∗, u_∗) =x_∗.

17

2. There areα`,i∈ Ksuch that all stage costs`i,i∈ {1, . . . , s}, satisfy min

u∈U`_i(x, u)≥α_`,i(kx−x_∗k) ∀x∈X.

Assumption2.7requires that it is favourable for all objectives to steer the system

18

to the same equilibrium. This includes the situation, in which objectives penalize

19

the distance of components of the state to the equilibrium differently, i.e. conflict

20

does not only come from possible constraints, but also from cost functions.

21

3. Multiobjective Stabilizing MPC with Terminal Conditions. A standard way to ensure proper functioning of MPC schemes is to add appropriate terminal conditions, see [17] and the references therein, [7, Section 5] or [19]. In this section we analyze MPC schemes with such conditions, which are given by a terminal constraint set X0 and add a terminal cost F_i :X0 →R≥0. Thus, the problem we

(6)

have to solve in the MPC iterations now reads minu J₁^N(x,u), . . . , J_s^N(x,u)

| {z }

=:J^N(x,u)

s.t. x(k+ 1) =f(x(k), u(k)), k= 0, . . . , N−1,

x(k)∈X, k= 1, . . . , N−1, (5)

x(N)∈X0⊆X, u∈U^N

for

J_i^N(x,u) =

N−1

X

k=0

`i(x(k, x), u(k)) +Fi(x(N)).

Since the terminal constraint x(N) ∈ X0 can generally not be satisfied from all

1

initial values x∈X, we define the feasible set XN :={x∈ X|∃u∈ U^N : x(k)∈

2

X, k= 1, . . . , N−1, x(N)∈X0}, cf. [17] and the references therein, or [7, Definition

3

3.9] and [19, Section 2.3]. Forx∈X^N we define the set of admissible controls for the

4

MO optimization problem (5) byU^N(x) :={u∈U^N|x(k+ 1) =f(x(k), u(k)), k=

5

0, . . . , N −1, x(k)∈X, k= 1, . . . , N−1, x(N)∈X0}.

6

Assumption 3.1 (Terminal cost). We assume that x_∗ from Assumption 2.7 is

7

contained in X0, F_i(x) ≥0 for all i and all x∈ X0, and the existence of a local

8

feedback κ : X0 → U satisfying f(x, κ(x)) ∈ X0 and ∀x ∈ X0, i ∈ {1, . . . , s} :

9

Fi(f(x, κ(x))) +`i(x, κ(x))≤Fi(x).

10

Imposing Assumption3.1ensures that it is always possible to remain within the

11

terminal constraint setX0and that the cost of this control action is bounded from

12

above by the original terminal cost. The algorithm that we propose for this setting

13

is as follows:

14

Algorithm 2(MO MPC with terminal conditions).

15

(0) At timen= 0 :Setx(n) :=x0and choose a POS u^?,N_x₀ ∈U^NP(x0). Go to(2).

16

(1) Measurex(n). Choose a POSu^?,N_x(n)such that J_i^N

x(n),u^?,N_x(n)

≤J_i^N

x(n),u^N_x(n) holds for alli∈ {1, . . . , s}.

17

(2) Forx:=x^u^?,N^x(n)(N, x(n))set u^N_x(n+1):=

u^?,N_x(n)(1), . . . , u^?,N_x(n)(N−1), κ(x) .

(3) Apply the feedbackµ^N(x(n)) :=u^?,N_x(n)(0), setn=n+ 1 and go to (1).

18

Figure2visualizes the choice of the POS in step(1)of Algorithm2. The bound

19

resulting from u^N_x(n) is visualized by the black circle and determines the set of

20

nondominated points on the red line that may be chosen, namely all points which

21

are below and left of the black point. The basic idea (formalized in Lemma3.2) is

22

that the control sequenceu^N_x(n)in step(2)is a POS of length N−1 prolonged by

23

the local feedback from Assumption3.1and that the prolongation reduces the value

24

of the objective functions. Our considerations in Section 1 moreover show that –

25

under appropriate assumptions – there is a POS with smaller objective value than

26

the prolonged sequence (for eachi). This is formalized in the next lemma.

27

(7)

J2

J1

J^N x(n),u^N_x(n)

Figure 2. Step(1)in Algorithm2.

Lemma 3.2. If Assumption 3.1 holds and if there is u^N−1 ∈U^N−1(x),x∈XN, then there exists a sequenceu^N ∈U^N(x)satisfying

J_i^N(x,u^N)≤J_i^N−1(x,u^N−1) ∀i∈ {1, . . . , s}.

Proof. We defineu^N viau^N(k) :=u^N⁻¹(k) fork= 0, . . . , N−2 andu^N(N−1) :=

κ(¯x) from Assumption3.1, where ¯x:=x^u^N(N−1, x). Thenu^N is feasible because u^N−1∈U^N⁻¹(x), and therefore, ¯x∈X0. Assumption3.1ensures feasibility ofκ(¯x) andf(¯x, κ(¯x)).

With the definition ofu^N we obtain the estimates J_i^N(x,u^N) =

N−1

X

k=0

ì(xû^N(k, x),u^N(k)) +Fi(xû^N(N, x))

=

N−2

X

k=0

ì(xû^N(k, x),u^N(k)) +ì(¯x, κ(¯x)) +Fi(f(¯x, κ(¯x)))

≤

N−2

X

k=0

`i(x^u^N−1(k, x),u^N−1(k)) +Fi(¯x)

=J_i^N⁻¹(x,u^N⁻¹).

1

We are now ready to give our main result on the performance of the MPC feed-

2

back on an infinite horizon.

3

Theorem 3.3 (MO MPC Performance Theorem). Consider a multiobjective optimal control problem with system dynamics (1), stage costs `_i, i ∈ {1, . . . , s}, and let N ∈ N≥2 and x₀ ∈ XN. Let Assumptions 2.7 and 3.1 hold and let the set J_P^N(x)be externally stable for eachx∈XN. Then, the MPC feedback µ^N :X→U defined in Algorithm 2 renders the setX forward invariant¹ and has the following infinite-horizon closed-loop performance:

J_i^∞ x0, µ^N

:= lim

K→∞

K−1

X

k=0

`i x(k), µ^N(x(k))

≤J_i^N x0,u^?,N_x₀

(6) for all objectives i ∈ {1, . . . , s}, in which u^?,N_x

0 denotes the POS of step (0) in

4

Algorithm2.

5

1The setXisforward invariantfor the closed-loop systemx⁺=f(x, µ^N(x)) iff(x, µ^N(x))∈X holds for allx∈X.

(8)

Proof. Feasibility: The existence of the POS in step (0) and (1) is concluded from external stability of J_P^N(x). Feasibility of u^N_x(n+1) in (2) follows from As- sumption3.1.

Performance: It follows from the definition of the cost functionals that J_i^N

x(k),u^?,N_x(k)

=`i

x(k), u^?,N_x(k)(0)

+J_i^N⁻¹

f(x(k), u^?,N_x(k)(0)),u^?,N_x(k)(·+ 1) withu^?,N_x(k)(·+ 1) = (u^?,N_x(k)(1), . . . , u^?,N_x(k)(N−1)), and hence, for arbitrary K∈N≥1

K−1

X

k=0

`i(x(k), µ^N(x(k))) =

K−1

X

k=0

`i(x(k), u^?,N_x(k)(0))

=

K−1

X

k=0

h J_i^N

x(k),u^?,N_x(k)

−J_i^N−1

f(x(k), u^?,N_x(k)(0)),u^?,N_x(k)(·+ 1)i

≤

K−1

X

k=0

h J_i^N

x(k),u^?,N_x(k)

−J_i^N

f(x(k), u^?,N_x(k)(0)),u^N_x(k+1)i ,

in which the inequality follows from Lemma3.2 in combination with the the fact, that u^?,N_x(k)(·+ 1) ∈U^N⁻¹

f(x(k), u^?,N_x(k)(0))

, and u^?,N_x(k) is the POS chosen in the algorithm at timek. In step(1), u^?,N_x(k+1) is constructed such that the inequalities J_i^N

x(k+ 1),u^?,N_x(k+1)

≤J_i^N

x(k+ 1),u^N_x(k+1)

hold. Thus, we finally obtain

K−1

X

k=0

`i x(k), µ^N(x(k))

≤J_i^N x0,u^?,N_x₀

−J_i^N

x(K),u^N_x(K)

≤J_i^N x0,u^?,N_x₀ ,

because of the positivity ofJ_i^N. The expression on the left hand side of the inequality

1

is monotonically increasing inK and due to its boundedness, the limit forK→ ∞

2

exists and we conclude the assertion.

3

Remark 1. (i) As proven in Theorem3.3the upper bound on the performance of

4

our MPC controller defined in Algorithm2remains the same no matter which

5

u^?,N_x(n)we choose in the iterations fork≥1 as long as the additional constraints

6

are met. This has the important consequence that it is not necessary to

7

approximate the whole Pareto front in the iterations of Algorithm2because it

8

is sufficient to calculate only one solution. This can e.g. be done by optimizing

9

a weighted sum of objectives with arbitrary weights.

10

(ii) A closer look at Algorithm2reveals that only in step(1)– i.e. fork≥1 – the

11

choice ofu^?,N_x(k)is subject to additional constraints. The first POSu^?,N_x₀ , which

12

determines the bound on the performance of the algorithm, can be chosen

13

freely in step(0), Algorithm 2. Thus, the performance can be calculated a

14

priori from a multiobjective optimization of horizonN.

15

Corollary 1. Under the assumptions of Theorem 3.3 it holds that the trajectory

16

x(·)driven by the feedbackµ^N from Algorithm2converges to the equilibrium x_∗.

17

Proof. It follows from Theorem 3.3 that the sum P∞

k=0`i x(k), µ^N(x(k)) converges for each i ∈ {1, . . . , s}. Hence, the sequences `i x(k), µ^N(x(k))

k∈N0,

(9)

i∈ {1, . . . , s}, tend to zero. Together with Assumption2.7for arbitraryiwe obtain

∀ε >0 ∃K∈N0:∀k≥K:ε >|`i x(k), µ^N(x(k))

|=`i x(k), µ^N(x(k))

≥ min

u∈U(x(k))

`_i(x(k), u)≥α_`,i(kx(k)−x_∗k).

Sinceα_`,iis a Kfunction, we conclude α_`,i

lim

k→∞kx(k)−x_∗k

= lim

k→∞α_`,i(kx(k)−x_∗k) = 0

⇔ lim

k→∞kx(k)−x_∗k= 0.

1

We have proved in Theorem3.3that the inequalities J_i^∞ x0, µ^N

≤J_i^N x0,u^?,N_x₀

hold for the MPC feedbackµ^N from Algorithm2and for alli∈ {1, . . . , s}. Usually, one would like to compare the infinite-horizon MPC cost to J_i^∞(x0,u^?,∞_x₀ ), where u^?,∞_x₀ is a POS² to the infinite-horizon problem

minu (J₁^∞(x₀,u), . . . , J_s^∞(x₀,u)), withJ_i^∞(x₀,u) :=

∞

X

k=0

`_i(x(k), u(k))

s.t. x(k+ 1) =f(x(k), u(k)), k∈N0, (7) x(k)∈X, k∈N

u∈U^∞.

We now show how one can relateJ_i^∞ x0, µ^N

toJ_i^∞(x0,u^?,∞_x₀ ). Again, we summa-

2

rize all constraints in (7) by writingu∈U^∞(x0).

3

Lemma 3.4. Let N ∈ N≥2, x ∈ XN be given. Let the assumptions of The-

4

orem 3.3 hold and assume furthermore external stability of the set J_P^∞(x) :=

5

{(J₁^∞(x,u), . . . , J_s^∞(x,u))|u ∈ U^∞P(x)}. Then, for each u^?,N ∈ U^NP(x) there is

6

u^?,∞ ∈U^∞P(x) such that the inequalities J_i^N x,u^?,N

≥J_i^∞(x,u^?,∞) hold for all

7

i= 1, . . . , s.

8

Proof. ForN ∈N≥2 andx∈XN fix an arbitraryu^?,N ∈U^NP(x). Define the MPC feedbackµ^N according to Algorithm2and defineu∈U^∞(x) viau(k) =µ^N(x^µ^N(k)) fork∈N≥0. Then, we have

J_i^N x,u^?,N^Thm. ^3.3

≥ J_i^∞ x, µ^N

=J_i^∞(x,u) ∀i.

Since we assume external stability of the set J_P^∞(x), there exists u^?,∞ ∈ U^∞P(x)

9

satisfyingJ_i^∞(x,u)≥J_i^∞(x,u^?,∞) ∀i. This yields the assertion.

10

Lemma3.4 implies that Theorem3.3cannot be used to establish the inequality

11

J_i^∞ x₀, µ^N

≤J_i^∞(x₀,u^?,∞). However, we will be able to show an approximate

12

estimate of this form in Theorem 3.6, below. As a preparation, we first show that

13

the trajectory corresponding to any infinite-horizon control sequence with bounded

14

2Necessary and sufficient conditions for the existence of a POS on the infinite horizon can e.g.

be found in [12].

(10)

objectives gets arbitrarily close to the equilibrium x_∗ in a finite number of time

1

steps.

2

Lemma 3.5. Let δ >0, x∈Xand u^∞∈U^∞(x)be given. Under Assumption2.7

3

and if there isK∈R≥0 satisfyingJ_i^∞(x,u^∞)≤K ∀i∈ {1, . . . , s},then the index

4

ˆk := min{k ∈N0|x^u^∞(k) ∈ Bδ(x_∗)} fulfills ˆk ≤ _min ^K

iα`,i(δ). Here, Bδ(x_∗) := {x∈

5

X:kx−x_∗k ≤δ}.

6

Proof. Assume ˆk > _min ^K

iα_`,i(δ), then it holds

J_i^∞(x,u^∞) =

ˆk−1

X

k=0

`i(x(k), u^∞(k)) +

∞

X

k=ˆk

`i(x(k), u^∞(k))

≥

ˆk−1

X

k=0

α_`,i(kx(k)−x_∗k)>

ˆk−1

X

k=0

α_`,i(δ) = ˆk·α_`,i(δ)> K,

contradicting the assumption.

7

Theorem 3.6. Consider the MO optimal control problem (5)with cost criteria`i, i∈ {1, . . . , s}, and the corresponding optimal control problem on the infinite horizon (7)with the same constraints and stage costs. Let the Assumptions2.7and3.1hold and assume furthermore the existence ofσi∈ Ksuch thatFi(x)≤σi(kx−x_∗k)holds for all x∈X0 and all i ∈ {1, . . . , s}. Consider an arbitrary initial value x∈ XN

and a sequence u^?,∞ ∈U^∞P(x) withJ_i^∞(x,u^?,∞)≤C∀i, C ∈R≥0. Assume there isN¯ ∈Nsuch that the sets J_P^N(x)are externally stable for allN ≥N¯. Then, for each ε >0 there exists N0∈N(depending on ε andN¯) such that for allN ≥N0

there isu^?,N ∈U^NP(x)satisfying J_i^N x,u^?,N

≤J_i^∞(x,u^?,∞) +ε ∀i. (8) In particular, u^?,∞ can be approximated arbitrarily well by µ^N in terms of the infinite-horizon performance, that is,

J_i^∞ x, µ^N

≤J_i^∞(x,u^?,∞) +ε. (9) Proof. Letε > 0 and choose δ > 0 such thatσ_i(δ)≤ε ∀i and B_δ(x_∗) ⊆X0. For the sequence u^?,∞ ∈ U^∞P(x) it holds J_i^∞(x,u^?,∞) ≤ C ∀i. From Lemma 3.5 we know that the index ˆk:= min{k∈N⁰|x^u^?,∞(k)∈ Bδ(x_∗)} satisfies ˆk≤ _min ^C

iα_`,i(δ). Now let us chooseN0∈Nsuch thatN0≥max{ˆk+ 1,N¯}. For N ≥N0 define the sequenceu∈U^N(x) via

u(k) =

(u^?,∞(k), k= 0, . . . ,ˆk−1, κ(x(k)), k= ˆk, . . . , N−1,

(11)

withκfrom Assumption3.1. Since x^u^?,∞(ˆk)∈ Bδ(x_∗)⊆X0, κcan be applied and it holdsx^u(N)∈X0. From the definition ofuwe obtain

J_i^N(x,u) =

N−1

X

k=0

`_i(x(k), u(k)) +F_i(x(N))

=

ˆk−1

X

k=0

`i(x(k), u^?,∞(k)) +

N−1

X

k=ˆk

`i(x(k), κ(x(k))) +Fi(x(N))

≤J_i^∞(x,u^?,∞) +

N−1

X

k=ˆk

[F_i(x(k))−F_i(f(x(k), κ(x(k))))] +F_i(x(N))

=J_i^∞(x,u^?,∞) +Fi(x(ˆk))

≤J_i^∞(x,u^?,∞) +σ_i(kx(ˆk)−x_∗k

| {z }

≤δ

)≤J_i^∞(x,u^?,∞) +ε.

Due to external stability ofJ_P^N(x) we conclude the existence ofu^?,N ∈U^NP(x) such that

J_i^N x,u^?,N

≤J_i^N(x,u)≤J_i^∞(x,u^?,∞) +ε,

i.e. (8) holds. Choosingu^?,N_x(n)=u^?,N in step(0)of Algorithm2and combining the

1

estimates (6) and (8) yields (9).

2

3.1. Numerical Example. By means of the following example, presented in [18], we illustrate the results of this section. We consider six two-dimensional sys- temsxi ∈R², i∈ {1, . . . ,6} that are dynamically decoupled but coupled through constraints and cost criteria. Each system is steered by a two-dimensional input ui∈R². The system dynamics and stage cost of systemi∈ {1, . . . ,6} are given by

x⁺_i =

0.9 0.1

−0.2 0.8

x_i+ 1 0

0 1

u_i+ 0.1 x²_i,2

x²_i,1

,

`i(x, u) =x^T_iQixi+u^T_i Riui+ X

j∈Ni

(Cixi−Cjxj)^TQij(Cixi−Cjxj),

in whichNi ={i−1, i+ 1} fori= 2, . . . ,5 andN1={2},N6={5} and Qi=

1 0 0 1

, Ri= 5Qi, Ci=Qi, for alli, Q34=Q43= 0_2×2, Qij = 3Qi otherwise.

The states and controls are constrained bykxik_∞ ≤5 and kuik_∞ ≤2. Moreover,

3

systems three and four are coupled by the constraintkx3−x4k ≤4. In Figure 3

4

we observe that the accumulated performance of the MPC feedback defined in

5

Algorithm2forN = 6 is indeed bounded from above byJ_i^N(x0,u^?,N_x₀ ) as stated in

6

Theorem3.3. In Corollary1convergence of the closed-loop trajectories was proven.

7

This behavior is illustrated in Figure4.

8

In order to illustrate the necessity of the constraints in step (1), we have also

9

run Algorithm2for our example without these constraints, i.e., we have chosen an

10

(12)

5 10 k

50 100 150 J

1

5 10

k 100

200 300 J

2

5 10

k 150

200 250 J

3

5 10

k 200

250 J

4

5 10

k 250

300 350 J

5

5 10

k 150

200 250 J

6

Figure 3. Accumulated performance of the six objectives (blue) compared to the value of the Pareto optimal control sequenceu^?,N_x₀ from step(0), Algorithm2(red).

-5 0 5

x

1

-5 0 5

x

2

Sys. 1 Sys. 2 Sys. 3 Sys. 4 Sys. 5 Sys. 6

Figure 4. Trajectories of the six systems (phase plots).

(13)

arbitrary Pareto-optimal solution in each iteration. Figure 5 illustrates that the

1

desired performance bound is indeed violated³.

5 10

k 50

100 150 J

1

5 10

k 100

200 300 J

2

5 10

k 100

200 300 J

3

5 10

k 200

250 J

4

5 10

k 250

300 350 J

5

5 10

k 150

200 250 J

6

Figure 5. Performance without the constraints in step(1), Algorithm2.

2

4. Multiobjective Stabilizing MPC without Terminal Conditions. In this

3

section we aim to develop performance estimates for multiobjective MPC schemes

4

without terminal conditions, i.e. we no longer impose Assumption3.1. A discussion

5

why proceeding this way may be advantageous to MPC schemes with terminal

6

conditions can be found in e.g. [7, Sec. 6.1]

7

Instead of imposing such terminal conditions, we follow the procedure developed

8

in [8] (see also [22]) for scalar-valued MPC and require the following structural

9

property on POSs.

10

Assumption 4.1(Bounds on POSs). Let an optimization horizonN ∈Nbe given.

For alli∈ {1, . . . , s} there existγi ∈R>1 such that the inequalities

∀x∈X,∀u^?,1_x ∈U¹P(x)∃u^?,2_x ∈U²P(x) :J_i²(x,u^?,2_x )≤γ_i·J_i¹(x,u^?,1_x ),

∀k= 2, . . . , N,∀x∈X,∀u^?,k_x ∈U^kP(x) :J_i^k(x,u^?,k_x )≤γi·`i(x, u^?,k_x (0)) hold for all objectivesi∈ {1, . . . , s}.

11

3We observed that the violation is only visible for sufficiently large horizonsN, because for smallNthe terminal constraint becomes so restrictive that it dominates the effect of the constraint in step(1)of Algorithm2.

(14)

We note that the condition U^N(x) 6= ∅ for all x ∈ X and all N ∈ N is guar-

1

anteed by Assumption 4.1. As in the previous section we impose Assumption2.7.

2

Assumption4.1requires that all POSs are in a sense structured. The second set of

3

inequalities therein states that the values of all POSs can be expressed in terms of

4

the stage cost of the first piece of the POS for all horizon lengths. The first set of

5

inequalities is mainly needed as a base case for the induction in Lemma4.4in order

6

to prove a relation between POS of horizon lengthkandk−1. One possibility to

7

obtain these inequalities is to requireexponential controllability wrt all`_iof the MO

8

OCP, see [7, Sec. 6.2]. Together with external stability this ensures the existence of

9

POSs andγ_i satisfying the inequality.

10

The first MPC scheme we propose in this section is the following.

11

Algorithm 3(Multiobjective MPC without terminal conditions).

12

(0) At timen= 0 : Setx(n) :=x0 and choose a POSu^?,N_x₀ ∈U^N_P(x0)to (3). Go

13

to(2).

14

(1) At timen∈N: Choose a POSu^?,N_x(n) to (3) so that the inequalities J_i^N

x(n),u^?,N_x(n)

≤ γ_i^N−2+ (γ_i−1)^N−1 γ_i^N⁻² J_i^N−1

x(n),u^N_x(n)⁻¹ are satisfied for all i∈ {1, . . . , s}.

15

(2) Set

u^N_x(n+1)⁻¹ :=u^?,N_x(n)(·+ 1).

(3) Apply the feedbackµ^N(x(n)) :=u^?,N_x(n)(0), setn=n+ 1 and go to (1).

16

After giving two auxiliary results as well as a result, which resembles an aspect

17

of the Dynamic Programming Principle (see e.g. [2]), we will prove that the MPC-

18

feedback defined in Algorithm3 guarantees forward invariance and has a bounded

19

infinite-horizon performance for each objective.

20

Lemma 4.2. Givenx∈Xandu^?,k_x ∈U^k_P(x)for arbitrary k∈ {2, . . . , N}. Under Assumptions2.7and4.1 the inequalities

J_i^k−1 f(x, u^?,k_x (0)),u^?,k_x (·+ 1)

≤(γi−1)`i x, u^?,k_x (0) hold for alli∈ {1, . . . , s} and allk∈ {2, . . . , N}.

21

Proof. Consider an arbitrary x ∈ X, k ∈ {2, . . . , N} and a POS u^?,k_x ∈ U^kP(x).

Then, for alli∈ {1, . . . , s} it holds J_i^k−1 f(x, u^?,k_x (0)),u^?,k_x (·+ 1)

=J_i^k x,u^?,k_x

−`i x, u^?,k_x (0)

≤γi·`i x, u^?,k_x (0)

−`i x, u^?,k_x (0) , which shows the assertion.

22

Lemma 4.3 (Tails of POSs are POSs). If u^? ∈U^N_P(x), then u^?,K :=u^?(·+K)∈

23

U^N−KP (x^u^?(K, x)) for all K ∈ N^<N, in which the tail is defined as u^?(·+K) :=

24

(u^?(K), u^?(K+ 1), . . . , u^?(N−1)).

25

Proof. We first note, that u^? ∈ U^NP(x) ⊂ U^N(x) implies u^?,K ∈ U^N−K(x), see e.g. [7, Lemma 3.12]. Let us assume that u^?,K is not a POS of length N −K

(15)

for initial value x^u^?(K, x). This implies the existence of u ∈ U^N^−K(x^u^?(K, x)) satisfying

∀i∈ {1, . . . , s}:J_i^N−K(x^u^?(K, x),u)≤J_i^N^−K(x^u^?(K, x),u^?,K) and

∃ j∈ {1, . . . , s}:J_j^N−K(x^u^?(K, x),u)< J_j^N^−K(x^u^?(K, x),u^?,K).

Since by definition J_i^N(x,u^?) =

K−1

X

k=0

ì(xû^?(k, x), u^?(k)) +J_i^N^−K(xû^?(K, x),u^?(·+K)

| {z }

u^?,K

)

holds for allK∈N≤N, we obtain

∀ i∈ {1, . . . , s}: J_i^N(x,u^?)≥

K−1

X

k=0

ì(xû^?(k, x), u^?(k)) +J_i^N−K(xû^?(K, x),u),

∃j∈ {1, . . . , s}: J_j^N(x,u^?) =

K−1

X

k=0

`j(x^u^?(k, x), u^?(k)) +J_j^N^−K(x^u^?(K, x),u^?,K)

>

K−1

X

k=0

`_j(x^u^?(k, x), u^?(k)) +J_j^N^−K(x^u^?(K, x),u).

Using again [7, Lemma 3.12], it holds that the concatenated control sequence ¯u= (u^?(0), . . . , u^?(K−1),u) is contained in the setU^N(x), i.e. we get

∀ i∈ {1, . . . , s}: J_i^N(x,u^?)≥J_i^N(x,u) and¯

∃ j∈ {1, . . . , s}: J_j^N(x,u^?)> J_i^N(x,u).¯ This contradicts the fact thatu^?∈U^N_P(x).

1

Lemma 4.4. Given x ∈ X and N ∈ N≥2. Let Assumptions 2.7 and 4.1 hold, assume external stability of the sets J_P^k(x) for all k∈ {2, . . . , N}. Then, for each k∈ {2, . . . , N} and each u^?,k−1_x ∈U^k−1_P (x)there isu^?,k_x ∈U^k_P(x)such that

η_k,i·J_i^k x,u^?,k_x

≤J_i^k−1 x,u^?,k−1_x holds for alli∈ {1, . . . , s}, in which ηk,i is defined as

η_k,i = γ_i^k−2

γ_i^k−2+ (γ_i−1)^k−1. The proof of this lemma is given in AppendixA.

2

Theorem 4.5(Performance Theorem). Consider a multiobjective OCP with system dynamics (1), cost criteria`i,i∈ {1, . . . , s}and letN ∈N≥2, andx0∈Xbe given.

Let Assumptions2.7and4.1hold and let the setsJ_P^k(x₀)be externally stable for all k∈ {2, . . . , N}. Let moreover(γi−1)^N < γ_i^N−2 hold for alli∈ {1, . . . , s}. Then, the MPC-feedback µ^N :X→Udefined in Algorithm3rendersXforward invariant and has the infinite-horizon closed-loop performance

J_i^∞ x0, µ^N

≤ γ_i^N⁻²

γ_i^N⁻²−(γ_i−1)^N ·J_i^N x0,u^?,N_x₀ for all objectivesi∈ {1, . . . , s}and the POS u^?,N_x

0 from step(0)in Algorithm 3.

3

(16)

Proof. Existence of the POSs in Algorithm 3 is obtained by Lemma 4.4 and we can thus conclude forward invariance of the closed-loop system. We will now prove that the MPC-feedback exhibits the stated performance. For K ∈ N≥1 and all i∈ {1, . . . , s}it holds

1−(γi−1)^N γ_i^N−2

| {z }

>0

J_i^K(x0, µ^N) =

1−(γi−1)^N γ_i^N⁻²

K−1

X

k=0

`i(x(k), µ^N(x(k)))

=

1−(γ_i−1)^N γ_i^N⁻²

^K−1 X

k=0

`_i

x(k), u^?,N_x(k)(0)

=

K−1

X

k=0

`i

x(k), u^?,N_x(k)(0)

−(γi−1)^N γ_i^N⁻² `i

x(k), u^?,N_x(k)(0)

≤

K−1

X

k=0

h J_i^N

x(k),u^?,N_x(k)

−J_i^N⁻¹

f(x(k), u^?,N_x(k)(0)),u^?,N_x(k)(·+ 1)

−(γi−1)^N−1 γ_i^N⁻² J_i^N⁻¹

f(x(k), u^?,N_x(k)(0)),u^?,N_x(k)(·+ 1)

=

K−1

X

k=0

"

J_i^N

x(k),u^?,N_x(k)

−J_i^N⁻¹

f(x(k), u^?,N_x(k)(0)),u^?,N_x(k)(·+ 1)

1 + (γi−1)^N⁻¹ γ_i^N⁻²

| {z }

=^γ

N−2

i +(γi−1)N−1 γN−2

i

# ,

in which the inequality is obtained by Lemma 4.2. In step (1) the POS u^?,N_x(k) is chosen such that we obtain the estimates

1−(γ_i−1)^N γ_i^N−2

J_i^K(x0, µ^N)≤J_i^N(x0,u^?,N_x₀ )−J_i^N(x(K),u^?,N_x(K))≤J_i^N(x0,u^?,N_x₀ ) for alli∈ {1, . . . , s}. This concludes the assertion.

1

Corollary 2(Infinite-horizon near optimality). Let the assumptions of Theorem4.5 hold for N ∈N≥2 andx₀∈X and assume that there is a POSu^?,∞ ∈U^∞P(x₀)to the MO inifinite-horizon OCP (7). Then, the estimates

J_i^∞(x0, µ^N)≤ γ_i^N⁻²

γ_i^N⁻²−(γi−1)^N ·J_i^∞(x0,u^?,∞) ∀i∈ {1, . . . , s}

are obtained by applying Algorithm3with a proper initialization in step (0).

2

Proof. Positivity of the stage costs `i yields J_i^∞(x0,u^?,∞) ≥ J_i^N(x0,u^?,∞) for

3

all i ∈ {1, . . . , s} and external stability of the set J_P^N(x0) guarantees the exis-

4

tence of u^?,N_x₀ ∈ U^N_P(x0) such that J_i^N(x0,u^?,∞)≥J_i^N(x0,u^?,N_x₀ ) holds for all i ∈

5

{1, . . . , s}. By applyingu^?,N_x

0 in step(0)of Algorithm3we concludeJ_i^∞(x₀, µ^N)≤

6

γ_i^N−2

γ_i^N−2−(γi−1)^N ·J_i^∞(x0,u^?,∞) for all objectivesi∈ {1, . . . , s}.

7