A-posteriori error estimates - Model order reduction techniques for the optimal control of para

y(t) +A¯y(t)− Bµ¯= 0 & y(0) = 0¯ (2.21a)

−p(t) +˙¯ A^?p(t) + (σ¯ _Q+^σ_ε^w2I^?I)¯y(t)−^σ_ε^w₂I^?ω(t) = 0¯ & p(T¯ ) =−σ_Ωy(T¯ ) (2.21b) hσ_uµ¯− B^?(¯p+ ˆp),µ˜−µi¯ _U ≥0 for allµ˜ ∈U_ad (2.21c) h^σ_ε^w₂(¯ω− Iy),¯ ω˜−ωi¯ _W ≥0 for allω˜ ∈Wad (2.21d)

2.2. A-posteriori error estimates

Given an arbitrary admissible suboptimal control up, we want to estimate the control errorku_p−uk¯ _U without any information about the optimal controlu. We call¯ β :U →R an error bound if

∀u_p∈U_ad:ku_p−uk¯ _U ≤β(u_p). (2.22) Since we will use error bounds to decide whether to stop a process generating subop-timal control solutions, we will require that the error bounds (β(uⁿ_p))n∈N of a sequence (uⁿ_p)n∈N ⊆ U_ad with kuⁿ_p −uk¯ _U → 0 decay to zero as well; in this case, we say that β is compatible. If the orders of the errors and error bounds coincide in addition, we call β sharp. Sharp error bounds are desirable since they allow to stop the generating process when the approximation error reaches the intended accuracy. However, we re-quire numericalefficiency of β as well: The evaluation of β shall not consume too much calculation time. Indeed, in the context of model order reduction, it may occur that the calculation of the error bound for a solution to the reduced order model compensates the time sparing of the model reduction: In [76], a-posteriori error bounds are developed for solving semilinear optimal control problems with the POD method. Here, the evaluation of β which requires the determination of the smallest eigenvalue to the reduced Hessian matrix takes half the time of the solving of the full-order problem where the determi-nation of a POD control solution is provided in just 0.45% of the full-order calculation time. On the other hand, one may estimate the smallest eigenvalue by an inexact method instead. In this case,β is only exact up to a certain heuristic and notrigorous any more.

2.2.1. Lavrentiev regularization

In the following, we modify the a-posteriori error estimator for control constrained prob-lems presented in [133], Sec. 3, so that it is applicable to the transformed state constrained problems (2.2). The general idea is to interprete the suboptimal control u_p as a pertur-bation of the optimal one. It turns out that a suitable perturpertur-bation variable ζ_p can be calculated without knowing anything aboutu¯except of the fact thatu¯ satisfies the vari-ational optimality condition;β(up) then will consist essentially of the term kζ_pk_U. This idea was already used for the determination of error bounds in the context of optimal

control of ordinary differential equations [90] and is applicable in various situations since no limitations on the construction of the suboptimal controls are required. We will see that these error estimation techniques, applied in the context of POD model order reduc-tion, are sharp (Thm. 2.8). Our numerical results further indicate numerical efficiency for linear-quadratic optimal control problems, see Fig. 4.28 & Fig. 4.23.

Theorem 2.5. (Gubisch & Volkwein 2014 [61])

Letu_p∈U_adbe an arbitrary control with corresponding transformed controlv_p=Fu_p and corresponding adjoint state pp=Tup. Letζp=ζ(up) ∈U satisfy the perturbed variational inequality

∀˜v∈Vad :hF^−?(σuup− B^?(pp+ ˆp)) +ζp,v˜−vpi_U ≥0. (2.23) Then the following a-posteriori error estimate holds true:

k¯u−u_pk_U ≤β(u_p) = 1

σ_ukF^?ζ_pk_U. (2.24)

Proof. Choosingv˜=vp in (2.14c) and ˜v= ¯v in (2.23), we get hF^−?(σuup− B^?(pp+ ˆp)) +ζp,v¯−vpi_U ≥0,

hF^−?(σuu¯− B^?(¯p+ ˆp)), vp−vi¯ _U ≥0.

Together, this implies

0≤ hσ_u(u_p−u)¯ − B^?(p_p−p) +¯ F^?ζ_p,u¯−u_pi_U

=−σ_uk¯u−upk²_U +hB^?T(¯u−up),u¯−upi_U +hF^?ζp,u¯−upi_U

=−σ_uk¯u−u_pk²_U − h(S^?ΞS)(¯u−u_p),u¯−u_pi_U+hF^?ζ_p,u¯−u_pi_U

≤ −σ_uk¯u−upk²_U +kF^?ζpk_Uk¯u−upk_U

which implies the assertion.

Remark 2.6. Since the perturbed variational inequality (2.23) is equivalent to

∀˜v∈Vad :hJ˜⁰(vp) +ζp,˜v−vpi_U ≥0, (2.25) the a-posteriori error analysis for pure control constraints [133] immediately provides an error bound for the transformed controlv_p:

kv_p−vk¯ _U ≤ 1 σ_ukζ_pk_U

and therefore also a (possibly less accurate) a-posteriori error estimate for up: ku_p−uk¯ _U =kF⁻¹(vp−v)k¯ _U ≤ kF⁻¹kL_b

σu

kζ_pk_U. ♦

Next we construct a suitable perturbation ζ_p which satisfies the perturbed variational inequality (2.23) and which is computable in practice in case of the concrete control space U =L²(Θ,R^m). In this setting, the ordering relation ≤on U shall be interpreted componentwise:

u≤u˜ :⇐⇒ ∀i= 1, ..., m:u_i(t)≤u˜_i(t)for almost allt∈Θ.

Of course, ζp=−F^−?(σuup− B^?(pp+ ˆp))is computable and satisfies (2.23). However, the resulting error estimatorβ would just give a consistent error bound if u¯ is an inner point of U_ad. This problem can be compensated if we modify the negative gradient at the points where the constraints are active:

Theorem 2.7. Let up ∈ Uad with vp = Fup ∈ Vad, pp = Tup and corresponding gradient ξ_p=F^−?(σ_uu_p− B^?(p_p+ ˆp))∈U. Then the perturbationζ_p∈U, defined as

ζ_pi(t) =







−min(0, ξpi(t)) a.e. in A_aip={t∈Θ|vpi(t) = ˆyai(t)}

−max(0, ξ_pi(t)) a.e. in A_bip={t∈Θ|v_pi(t) = ˆy_bi(t)}

−ξ_pi(t) elsewise

, (2.26)

satisfies the perturbed variational inequality (2.23)

Proof. We have to show that for allv˜∈[ˆya,yˆ_b]we get hξ_p+ζ_p,˜v−v_pi_U =

i=1

(ξ_pi(t) +ζ_pi(t))(˜v_i(t)−v_pi(t))≥0. (2.27)

1. Ifvpi(t) = ˆyai(t) holds fori∈ {1, ..., m},t∈Θ, thenζpi(t) =−min(0, ξpi(t)), i.e.

(ξpi(t) +ζpi(t))≥0 & v˜i(t)−vpi(t)≥0.

2. Ifv_pi(t) = ˆy_bi(t) holds fori∈ {1, ..., m},t∈Θ, thenζ_pi(t) =−max(0, ξ_pi(t)), i.e.

(ξpi(t) +ζpi(t))≤0 & v˜i(t)−vpi(t)≤0.

3. Ifyˆ_ai(t)< v_pi(t)<yˆ_bi(t) for i∈ {1, ..., m},t∈Θ, thenζ_pi(t) =−ξ_pi(t), i.e.

ξ_pi(t) +ζ_pi(t) = 0.

Altogether,ζpsatisfies the inequality (2.27).

The next theorem states that the error boundβ is sharp if theζ-function is constructed as proposed in Thm. 2.7:

Theorem 2.8. Suppose that(u_n)n∈N⊆U_ad converges towardsu¯inU. Ifζ_n=ζ(u_n) is chosen as described in (2.26), thenβ(un) = _σ¹

ukF^?ζnk_U converges towards zero.

Proof. We follow the arguments of Thm. 4.11 in [133]. Assume that there is some >0 such that for all N ∈ N there exists n(N) > N with kζ_n(N)k_U > . Since T,F are bounded,un→u¯ inU implies ξn→ξ¯for n→ ∞, especially ξ_n(N)→ξ¯for N → ∞.

According to the Lebesgue selection theorem [121], Thm. VIII.4.9,(ξ_n(N₎)N∈N admits a subsequence (ξ_n(N)_k)k∈N such that ξ_n(N)_k(t) → ξ(t)¯ for almost all t∈ Θ,k → ∞. Due to the pointwise definition ofζ in (2.26), this impliesζ_n(N₎_k(t) →0for almost all t∈Θ and the Lebesgue convergence theorem [121], Thm. VIII.2.5, states thatkζ_n(N)_kk_U →0.

Especially, there exists some K ∈N withkζ_n(N₎

kk_U < for all k ≥K in contradiction to the primary assumption. Finally, the boundedness of F^? impliesβ(un)→0.

Remark 2.9. Let u∈U_ad. To compute the gradient ξ=ξ(u)∈U, given implicitly by F^?ξ =σ_uu− B^?(Tu+ ˆp),

we apply the operator equation S^?I^? =B^?T˜ with adjoint solution operatorT˜ :U →Y,

−p(t) +˙˜ Ap(t) =˜ I^?ξ(t),˜ p(T˜ ) = 0 → p˜= ˜Tξ.˜ (2.28) and get with p=Tu the (still implicit) equation

F^?ξ= (ε+S^?I^?)ξ = (ε+B^?T˜)ξ =⇒ ξ = 1

ε(σ_uu− B^?(p+ ˜Tξ+ ˆp)).

We insert this representation of ξ into (2.28):

−p(t) +˙˜ Ap(t) +˜ 1

εI^?B^?p(t) =˜ 1

εI^?(σuu− B^?(p+ ˆp), p(T˜ ) = 0. (2.29) The well-posedness of (2.29) is ensured by Thm. 2.1 and we receive the explicit formula

ξ= 1

ε(σuu− B^?(p+ ˜p+ ˆp)). ♦ In the following, we propose a modification which involves a-priori error estimates and does not require to solve the augmented adjoint equation (2.29):

Remark 2.10. Assume that a suitable estimate for the operator normkF k_L_b_(U,U)≤CF

is available. Then the modified a-posteriori error estimator β(u) =˜ CF

σu

kζ(u)k_U (2.30)

can be evaluated without requiring further PDE solvings if the adjoint states p = Tu and pˆwhich are needed to build up ζ are already available. This is usually the case ifu is determined in an iterative solving procedure of the optimality system (2.14) where the gradientξ is involved anyway – in contrast to the solutionp˜of (2.29) which just appears in the error estimator.

We choose

CF =ε+kIk_L

b(L²(Θ,V),U)kSk_L

b(U,L²(Θ,V)). (2.31) An a-priori estimate forkSuk_L2(Θ,V) can be derived directly from (2.10) in the proof of Thm. 2.1, choosing C = 0and w=Bu. To get a smaller boundC˜ there, we select the weights used for Young’s inequality within this proof in a more appropriate way: From

1 2

dtky(t)k²_H +hAy(t), y(t)i_V⁰_,V =hBu(t), y(t)i_V⁰_,V we derive, choosingδ ∈(0,2α1), by Young’s inequality [48], Thm. B.2.c:

(δ−2α₁)is positive and Gronwall’s inequality [48], Thm. B.2.j, implies By combining (2.32) & (2.33), we get

since φ ist continuous and bounded form below, δ can be chosen as the minimum of φon [0,2α₁].

2. IfB is even bounded as a mapping intoL²(Θ, H), we replace (2.31) by CF =ε+kIk_L

b(L²(Θ,V),U)kSk_L

b(U,L²(Θ,H)). (2.34)

To get a-priori bounds forkSk_L

b(U,L²(Θ,H)), we replace (2.32) by d

dtky(t)k²_H ≤(k2α₂+δ)ky(t)k²_H −α1ky(t)k²_V +1

δkBu(t)k²_H (2.35)

≤

δ+ 2α₂−2α₂ C_V²

ky(t)k²_H+ 1

δkBu(t)k²_H. WithC(δ) =˜ δ+ 2α₂− ^2α_C2¹

, the application of Gronwall’s inquality results in ky(t)k²_H ≤φ(δ)kBuk˜ ²_L2(Θ,H)² =⇒ kSk²_L

b(U,L²(Θ,H))≤φ(δ)kBk˜ ²_L2(Θ,H)

withφ(δ) =˜ ¹

δC(δ)˜ (e^C(δ)T^˜ −1)and we selectδ as the minimum ofφ˜on [0,2α₂].

3. IfC_V²α2 ≤α1 holds, then δ 7→C(δ) has a zero atδ◦ = 2α1−2α2C_V². In this case,φ can be continuously expanded by φ(δ◦) = _δ¹

◦ according to l’Hôpital’s rule, [50], Thm.

16.10. ♦

2.2.2. Penalization

We apply the a-posteriori error analysis of the preceding section to the situation where we regularize by penalization:

Theorem 2.11. (Grimm, Gubisch & Volkwein 2015 [59])

Let(u_p, w_p)∈U_ad×W_ad be an arbitrary control-penalty pair with the corresponding transformed variable (µp, ωp) = F(u_p, wp), the state yp = Su_p and the adjoint state pp =Tup+ ^σ_ε^u2(T_Uup+T_W(εwp+Iy_p)). Let (ζup, ζwp) =ζ(up, wp) ∈U ×W satisfy the perturbed variational inequality

∀(˜µ,ω)˜ ∈U_ad×W_ad :hσ_uµp− B^?(pp+ ˆp)) +ζup,µ˜−µpi_U

+h^σ_ε^u₂(ωp− Iy_p) +ζwp,ω˜ −ωpi_W ≥0 (2.36) and letσu=σw. Then the following a-posteriori error estimate holds true:

k(¯u−u_p,w¯−w_p)k_U×W ≤β(u_p, w_p) = 1

σ_uk(ζ_up+B^?T_Wζ_wp, εζ_wp)k_U×W. (2.37)

Proof. Recall that the transformationF and its inverseF⁻¹ are explicitly given by F(u, w) = (u, εw+ISu), F⁻¹(µ, ω) = (µ, ε⁻¹(ω− ISµ)).

Hence, since S^?I^? =B^?T_W holds by (2.19), also explicit representations of the adjoint operatorsF^?,F^−? are available:

F^?(u, w) = (u+B^?T_Ww, εw), F^−?(µ, ω) = (µ−ε⁻¹B^?T_Wω, ε⁻¹ω).

Combining (2.17) & (2.18), one findsT_U =−T_WIS and we get

Now the perturbed variational inequality (2.36) has the representation

∀(˜µ,ω)˜ ∈U_ad×W_ad;hJ˜⁰(µ_p, ω_p) + (ζ_up, ζ_wp),(˜µ−µ_p,ω˜−ω_p)i_U×W ≥0 (2.39) and we get with σu =σw, (2.25) and Thm. 2.5:

k(¯u,w)¯ −(u_p, w_p)k_U×W ≤β(u_p, w_p) = 1

σ_ukF^?(ζ_up, ζ_wp)k_U×W. (2.40)

Using the explicit representation ofF^? gives (2.37).

Remark 2.12. The conditionσ_u =σ_w is no restriction for our model since the impact of σw can be included into ε. Actually, we mainly focus on problems with low control costs (i.e. small values ofσ_u) and strong penalization (large σ_w or smallε). ♦ As before, we get a consistent perturbationζ_pby the negative gradient of the transformed objective functionalJ˜if we respect the active and inactive domains: Let the control space be given byU =L²(Θ,R^m) and the penalty space byW =L²(Θ,Rⁿ), then we have

Theorem 2.13. Consider the admissible control u_p ∈ U_ad and penalty w_p ∈ W_ad with transformed variables (µp, ωp) = F(u_p, wp) ∈ Vad and corresponding gradient

the a-posteriori error estimation (2.37) holds true and the error boundβ is consistent.

Im Dokument Model order reduction techniques for the optimal control of parabolic partial differential equations with control and state constraints (Seite 54-61)