• Keine Ergebnisse gefunden

Upper bounds of the invariance feedback entropy for uncertain systems

In this section, we focus on uncertain control systems. We describe the construction of a weighted directed graph which serves as the basis for computing two upper bounds for the IFE presented later in Theorem 11. Theorem 12 presents the proposed upper bound in Theorem 11 in a much simplified form as the maximum mean weight for any cycle in the graph.

Consider a discrete-time uncertain control system Σ as defined in (2.3) and a nonempty set Q⊆ X. By an invariant partition, we refer to an invariant cover ( ¯A, G) of (Σ, Q) for which ¯Ais a partition ofQ(which is consistent with the terminology used for deterministic systems in Section 4.2).

Given an invariant partition ( ¯A, G), we define a set-valued map T : Q ⇒ Q, T(x) :=

F(x, G(Ax)), where x∈Ax ∈A.¯

We constructG, a directed weighted graph with ¯Aas the set of nodes. ForA1, A2 ∈A,¯ there is an edge in G from A1 toA2 if T(A1)∩A2 6=∅. Let eA1A2 refer to the edge from A1 to A2. We define mapsD: ¯A ⇒A¯and w: ¯A →R≥0 as

D(A1) :=

A∈A |¯ T(A1)∩A6=∅ , (4.6)

w(A1) := log2#D(A1). (4.7)

The weight of edge eA1A2 is defined to bew(A1). We observe that T(A)⊆ [

A∈D(A)ˆ

A.ˆ (4.8)

Given the graph G and τ ∈N, we define sets Wτ(G) :=

(Ai)τ−1i=0 |Ai ∈A,¯ (Ai)τ−1i=0 is a path in G , (4.9) W(G) :=

(Ai)i=0 |Ai ∈A,¯ (Ai)i=0 is a path inG . (4.10) For every (x, u)∈X×U, by assumption, we have F(x, u)6=∅, thus every node in G has an outgoing edge. Therefore, for every τ ∈N, we have

Wτ(G) =

(Ai)τ−1i=0 |(Ai)i=0 ∈W(G) .

Consider a cycle c= (eAiAi+1)ki=1, Ak+1 =A1 in G. The mean cycle weight for cis defined to be the ratio of the sum of the weights and the number of edges in the cycle, i.e.,

wm(c) := 1 k

k

X

i=1

w(Ai).

The maximum mean cycle weight, wm(G), is then defined as wm(G) := maxcwm(c), where the maximum is taken over all cycles in the graphG. The following theorem presents two numerical upper bounds for the IFE.

4.4 Upper bounds of the invariance feedback entropy for uncertain systems 51 Theorem 11. For an uncertain control system Σ as in (2.3), a nonempty setQ⊆X, and an invariant partition ( ¯A, G), the IFE satisfies

hinv(Q,Σ)≤h( ¯A, G) = lim

τ→∞

1

τ max

α∈W(G) τ−2

X

t=0

w(α(t)). (4.11) A rough upper bound for the IFE of (Σ, Q) is

hinv(Q,Σ)≤h( ¯A, G)≤max

A∈A¯w(A).

The entropy of ( ¯A, G) turns out to be equal to the maximum mean cycle weight for the graph G, as described in the next theorem.

Theorem 12. In Theorem 11, let G be the directed weighted graph as defined above. Then h( ¯A, G) =wm(G).

There exist algorithms to compute the maximum mean cycle weight of a directed weighted graph, see e.g. [36].

The rest of this section is devoted to the proofs of the above two theorems. First, we present three propositions that establish some properties of the setWτ(G). Then the proof of Theorem 11 follows. Finally, we present the proof of Theorem 12.

Proposition 4. Wτ(G) is a (τ, Q)-spanning set in ( ¯A, G).

Proof. By assumption we haveF(x, u)6=∅for all (x, u)∈X×Uwhich results inT(A)6=∅ for all A ∈ A. Since ( ¯¯ A, G) is an invariant cover, for every A ∈ A, we have¯ D(A) 6= ∅. Thus, for every A ∈ A, there is ˆ¯ A ∈ A¯ such that T(A)∩Aˆ 6= ∅. This ensures that for every node in G, there exists an outgoing edge. Hence, for allτ ∈N, A∈A¯we have paths of length τ starting from A. Thus,

{α(0)|α∈Wτ(G)}= ¯A.

Consider any α ∈ Wτ(G) and t ∈ [0;τ−1]. From the definition of G, we have an edge fromα(t) to everyA ∈D(α(t)). Thus, for every t∈[0;τ −2] we have

PWτ(G)(α|[0;t]) = D(α(t)). (4.12)

Using (4.8) and (4.12), we conclude that Wτ(G) satisfies the condition in (2.5) to be a (τ, Q)-spanning set in ( ¯A, G).

Proposition 5. For every (τ, Q)-spanning set S in ( ¯A, G), we have Wτ(G)⊆ S.

Proof. Let S be a (τ, Q)-spanning set in ( ¯A, G). Then by definition, PS(α) ={α(0) |α ∈ S} ⊆ A¯ covers Q. Since ¯A is a partition of Q, PS(α) = ¯A. If α ∈ S and t ∈ [0;τ −1], then again from the definition of a (τ, Q)-spanning set in (2.5) it follows that PS(α|[0;t]) covers F(α(t), G(α(t))) =T(α(t)). As ¯A is a partition, D(α(t)), which is defined in (4.6), must be contained in every subset of ¯A that covers T(α(t)), thus PS(α|[0;t]) ⊇ D(α(t)).

Letβ ∈Wτ(G). Thenβ(0)∈A¯={α(0) |α∈ S} which gives the existence of α∈ S with α(0) = β(0). From (4.12), we have PWτ(G)(β(0)) = D(β(0)). Similarly to the arguments above, as ¯A is a partition of Q, D(β(0)) is contained in every subset of ¯A which covers T(β(0)). As S is (τ, Q)-spanning, from (2.5) we know thatT(α(0)) is covered byPS(α(0)) which implies PS(α(0))⊇ D(β(0)). From the definition of the graph G, we obtain β(1) ∈ D(β(0)) leading to β(1) ∈ PS(α(0)). Thus, there exists an α ∈ S with α|[0;1] = β|[0;1]. Inductively, we obtain the existence of α∈ S with α=β, which concludes the proof.

From (2.6) and Proposition 5, we conclude that for every (τ, Q)-spanning set S in ( ¯A, G), we have

N(Wτ(G))≤ N(S).

Let rinv(τ,A, G,¯ Σ) be the minimum of N(S), where S is a (τ, Q)-spanning set in ( ¯A, G).

We observe that

rinv(τ,A, G,¯ Σ) =N(Wτ(G)) for all τ ∈N. (4.13) Proposition 6. The expansion number of the (τ, Q)-spanning set Wτ(G) satisfies

log2N(Wτ(G)) = max Proof. By taking logarithms on both sides of (2.6), we obtain

log2N(Wτ(G)) = max This together with (4.7) concludes the proof.

Now, we have all the ingredients to prove Theorems 11 and 12.

Proof of Theorem 11. From (4.13) and Proposition 6, we have log2rinv(τ,A, G,¯ Σ) = log2N(Wτ(G))

4.4 Upper bounds of the invariance feedback entropy for uncertain systems 53 Therefore, the entropy of invariant partition ( ¯A, G) is

h( ¯A, G) = lim

This proves the first claim in Theorem 11.

For any τ ∈N, consider

Proof of Theorem 12. First we construct a mean-payoff-game (MPG) for which the max-imum of the value function over a given set equals the entropy of the invariant partition ( ¯A, G).

Consider the system in (2.3), a nonempty set Q ⊆ X, an invariant partition ( ¯A, G), the maps T :Q⇒Qand D: ¯A ⇒A¯as defined in Section 4.4. We consider the definition

Consider a play e0e1e2. . . which is an infinitely long sequence of edges. Player 1 wants to minimize the payoff

while player 2 wants to maximize the payoff we denote the set of all plays that start from the positionv and wherein the playerifollows the positional strategyσi. From (A.2) and (A.3) we have the existence of constantsc1 and c2, so that for everyτ ∈N,v ∈V,e∈ P(v, σ1) and ˆe∈ P(v, σ2) we have

In the preceding inequalities, we consider the maximum over the set V1 = ¯Aonly, because, in the later parts of the proof, we will relate the graph of the MPG with the graph G that involves only the elements of V1 as its nodes. Note that, in our construction of the MPG, player 1 always plays with a fixed strategy, σ1, i.e., for every v ∈ V1, the next position selected by player 1 is always σ1(v) = D(v). Thus, the course of any play is dictated by only player 2, and if the player 2 uses a positional strategy then there will be only one play for any given starting position v0 ∈V. This gives |P(v, σ2)|= 1 andP(v, σ2)⊂ P(v, σ1).

4.4 Upper bounds of the invariance feedback entropy for uncertain systems 55

Next, consider the set ˆW(G) which is constituted by all such paths in the graph G that correspond to some play ˆe∈ P(v, σ2), v ∈V1, and is defined as

(G) := {ˆα∈W(G)| ∃ˆe∈ ∪v∈V1P(v, σ2) so that ˆα(t) = ˆv2t ∀t∈[0;∞[}.

The inequalities (4.14) and (4.15) can now be rewritten as max W(G), therefore the above two equations lead to

lim sup GM there exists a corresponding cycle cinG such that, although the length ofcM is twice that of c, the mean weight is the same for both cycles. In an MPG, if one of the player follows a fixed positional strategy, then ν(v) is the maximum mean weight of a cycle in GM reachable from v ∈V, see [85, Sec. 4]. Thus, maxv∈V1ν(v) = wm(G).

In the next section, for deterministic systems, we establish the relationship between the discussed upper bounds of IED and IFE.

4.5 Relationship between the upper bounds for IED