Discretization-Optimization Methods for Nonlinear Elliptic Optimal Control Problems with State Constraints
I. Chryssoverghi1, J. Geiser2, J. Al-Hawasy1
(1) Department of Mathematics, School of Applied Mathematics and Physics National Technical University of Athens (NTUA)
Zografou Campus, 15780 Athens, Greece e-mail: ichris@central.ntua.gr
(2) Weierstrass Institute for Applied Analysis and Stochastics (WIAS) Mohrenstrasse 39, D-10117 Berlin, Germany
e-mail: geiser@wias-berlin.de
Abstract
We consider an optimal control problem described by a second order elliptic boundary value problem, jointly nonlinear in the state and control, with control and state constraints, where the state constraints and cost functionals involve also the state gradient. Since this problem may have no classical solutions, it is also formulated in the relaxed form. The classical problem is discretized by using a finite element method for state approximation, while the controls are approximated by elementwise constant, or linear, or multilinear, controls. Various necessary conditions for optimality are given for the classical and the relaxed problem, in the continuous and the discrete case. We then study the behavior in the limit of discrete optimality, and of discrete extremality and admissibility. Next, we apply a penalized gradient projection method to each discrete problem, and also a progressively refining version of this method to the continuous classical problem. We prove that accumulation points of sequences generated by the first method are extremal for the discrete problem, and that strong classical (resp. relaxed) accumulation points of sequences of discrete controls generated by the second method are admissible and weakly extremal classical (resp. relaxed) for the continuous classical (resp. relaxed) problem.
Finally, numerical examples are given.
Keywords. Optimal control, nonlinear elliptic systems, state constraints, discretization, finite elements, discrete penalized gradient projection method, progressive refining.
1 Introduction
We consider an optimal control problem described by a second order elliptic boundary value problem, which is jointly nonlinear in the state and control, with control and state constraints, where the state constraints and cost functionals involve also the gradient of the state. The problem is discretized by using a Galerkin finite element method with continuous elementwise linear basis functions for state approximation, while the controls are approximated by (not necessarily continuous) elementwise constant, or linear, or multilinear, controls. Various necessary conditions for optimality are given for the classical and the relaxed problem, in the continuous and the discrete case. Under appropriate assumptions, we prove that strong accumulation points in of sequences of optimal (resp. admissible and extremal) discrete controls are optimal (resp. admissible and weakly extremal classical) for the continuous classical problem, and that relaxed accumulation points of sequences of optimal (resp. admissible and extremal) discrete controls are optimal (resp. admissible and weakly extremal relaxed) for the continuous relaxed problem. We then apply a penalized gradient projection method to each discrete problem, and also a corresponding discrete method to the continuous classical problem, which progressively refines the discretization during the iterations, thus reducing computing
L2
time and memory. We prove that accumulation points of sequences generated by the first method are extremal for the discrete problem, and that strong classical (resp.
relaxed) accumulation points of sequences of discrete controls generated by the second method are admissible and weakly extremal classical (resp. relaxed) for the continuous classical (resp. relaxed) problem. Finally, numerical examples are given.
For approximation and optimization methods applied to distributed optimal control problems, see e.g. [2], [5,6], [8-12], [16-18], and the references therein.
2. The continuous optimal control problems
Let be a bounded domain in , with Lipschitz boundary Γ. Consider the nonlinear elliptic state equation
Ω \d
(2.1) Ay+ f x y x w x( , ( ), ( )) 0= in Ω, (2.2) y x( ) 0= on , Γ
where is the formal second order elliptic differential operator A (2.3)
1 1
: d d ( / i)[ ( )ij / j
j i
Ay x a x y x
= =
= −
∑∑
∂ ∂ ∂ ∂ ].The constraints on the control are ( )w x ∈U in Ω, where U is a compact subset of
\ν, the state constraints are
(2.4) G wm( ) : gm( , ( ),x y x y x w x dx( ), ( )) 0, m p
=
∫
Ω ∇ = =1,..., ,(2.5) G wm( ) : gm( , ( ),x y x y x w x dx( ), ( )) 0,
=
∫
Ω ∇ ≤ m= +p 1,..., ,q, and the cost functional is
(2.6) G w0( ) : g x y x0( , ( ), y x w x dx( ), ( )) .
=
∫
Ω ∇The state equation will be interpreted in the following weak form
(2.7) y V∈ , and a y v( , ) b x y x w x v x dx( , ( ), ( )) ( ) f x w x v x dx( , ( )) ( ) ,
Ω Ω
+
∫
=∫
∀ ∈v Vwhere a( , )⋅ ⋅ is the usual bilinear form associated with and defined on A V V× (2.8)
, 1
( , ) : d ij( ) .
i j i j
y v
a y v a x dx
x x
= Ω
= ∂ ∂
∑ ∫
∂ ∂Defining the set of classical controls
(2.9) W: { := w x6w x w( ) measurable from Ω to }U ⊂L∞( )Ω ⊂L2( )Ω ,
the continuous classical optimal control problem is to minimize subject to and to the above state constraints.
P G0
w W∈
It is well known that such nonconvex optimal control problems may have no classical solutions, but reformulated in the so-called relaxed form, they have a solution in an extended space under weak assumptions. Next, we define the set of relaxed controls (or Young measures; for the relevant theory, see [19], [15])
(2.10) R: { := r Ω →M U1( ) r weakly measurable}⊂L∞w( ,Ω M U( ))≡L1( , ( ))*Ω C U , where ( )M U (resp. M U1( )) is the set of Radon (resp. probability) measures on U. The set R is endowed with the relative weak star topology, and R is convex, metrizable and compact. If each classical control ( )w ⋅ is identified with its associated Dirac relaxed control r( ) :⋅ =δw( )⋅ , then W may also be considered as a subset of R,
and W is thus dense in R. For a given φ∈L1( ; ( ))Ω C U =L1( ; ( ))Ω C U (or ( , ; )
B U
φ∈ Ω \ , where ( , ; )B ΩU \ is the set of Caratheodory functions in the sense of Warga [19]) and r∈L∞w( ,Ω M U( )) (in particular, for r∈R), we shall use the notation (2.11) ( , ( )) : ( , ) ( )( ),
x r x U x u r x du
φ =
∫
φand ( , ( ))φ x r x is thus linear (under convex combinations, for ) in . A sequence ( converges to r in
r∈R r
k)
r ∈R R iff
(2.12) lim ( , ( ))k ( , ( ))
k φ x r x dx φ x r x dx
Ω Ω
→∞
∫
=∫
,for every φ∈L1( ; ( ))Ω C U , or φ∈ ΩB( , ; )U \ , or φ∈ Ω×C( U).
We denote by ⋅ the Euclidean norm in \n, by ⋅ ∞ the norm in , by and
( , n) L∞ Ω \ ( , )⋅ ⋅ ⋅ the inner product and norm in L2( ;Ω \n), and by ( , )⋅ ⋅ 1 and ⋅ 1 the inner product and norm in the Sobolev space V:=H01(Ω). We can now formulate the relaxed problem as follows. The relaxed state equation (in weak form) is given by (2.13) y V∈ and a y v( , ) f x y x r x v x dx( , ( ), ( )) ( ) 0,
+
∫
Ω = ∀ ∈v V,the control constraint is r∈R, and the relaxed functionals are (2.14) G rm( ) : gm( , ( ),x y x y x r x dx( ), ( )) ,
=
∫
Ω ∇ m=0,... .qThe continuous relaxed optimal control Problem P is to minimize subject to the constraints
0( ) G r (2.15) r∈R, G rm( ) 0,= m=1,..., ,p G rm( ) 0,≤ 1,..., .m= +p q
In the sequel, we shall make some of the following assumptions.
Assumptions 2.1 The coefficients aij satisfy the ellipticity condition
(2.16) 0 2
, 1 1
( ) ,
d d
ij i j i
i j i
a x z z α z
= =
∑
≥∑
∀z zi, j∈\, x∈Ω,with α0 >0, aij∈L∞( )Ω , which implies that
(2.17) a y v( , ) ≤α1 y1 v1, a v v( , )≥α2 v12, ∀y v V, ∈ , for some α1≥0,α2>0.
Assumptions 2.2 The functions f and fy are defined on Ω× ×\ U , measurable for fixed y u, , continuous for fixed x, and satisfy
(2.18) f x( ,0, )u ≤φ0( ),x ( , )∀ x u ∈Ω×U,
where φ0∈Ls( )Ω , with s≥2, s≥n/ 2 (e.g. s=2, for n=1, 2,3), and (2.19) 0≤ fy( , , )x y u ≤φ1( ) (x η1 y), ( , , )∀ x y u ∈Ω× ×\ U,
where η1 is an increasing function from [0,+∞) to [0,+∞), if the functionals depend on , and
1 L ( )
φ ∈ ∞ Ω Gm ∇y φ1∈Ls( )Ω otherwise.
Assumptions 2.3 The functions gm are defined on Ω×\d+1×U, measurable for fixed , continuous for fixed
, ',
y y u x, and satisfy
(2.20) gm( , , ', )x y y u ≤ψ0m( )x +β0m y' ,2 ( , , ', )x y y u d+1 U
∀ ∈Ω×\ × with y ≤C',
where C'>C, ψ0m∈L1( )Ω , β0m≥0.
Assumptions 2.4 The functions gm,gmy,gmy' are defined on Ω×\d+1×U', where is an open set containing the compact set U, measurable on for fixed
, continuous on
' U Ω
( , ', )y y u ∈\d+1×U' \d+1×U' for fixed x∈Ω, and gmy,gmy' satisfy (2.21) gmy( , , ', )x y y u 1m( )x 1m y'2( 1),
ρ
ψ β ρ−
≤ +
(2.22) gmy'( , , ', )x y y u ≤ψ2m( )x +β2m y' , (2.23) gmu( , , ', )x y y u ≤ψ3m( )x +β3m y' ,
( , , ', )x y y u d+1 U'
∀ ∈Ω×\ × , with y ≤C',
where C'<C, ψim∈L2( )Ω , 0βim ≥ , [1, )ρ∈ ∞ if n=1 or 2, 2
: 2
n ρ σ< = n
− if n≥3. The following theorem follows directly form Theorem 3.1 in [3].
Theorem 2.1 Under Assumptions 2.1-2, for every relaxed control , the state equation has a unique solution
r∈R : r
y = y ∈ ∩V Cα( )Ω , for some α∈(0,1). Moreover, there exists constants ,C C such that
(2.24) yr 1+ yr ∞ ≤C, yr Cα ≤C, for every r∈R.
The following proposition is a simple generalization of Proposition 2.1 in [7], and will be useful in the sequel. It can be proved by using the (possible) convergence
, the fact that a converging sequence in
rk →r Ls is dominated (in norm a.e. in Ω,
and up to a subsequence) by a fixed function in Ls, Hölder’s inequality, Egorov’s theorem, and Lebesgue’s dominated convergence theorem.
Proposition 2.1 For i=1,...,K, K ≥0, let si∈ +∞[1, ], [0, ]σi∈ si if si< +∞, σi: 0= if si = +∞, with
0 1
1 K i 1
i i
s s
σ
=
+
∑
≤ , 1 : 0si = if si = +∞. Let be a function defined on , measurable for every fixed, continuous for every
F ( N)K U
Ω× \ × y u, x fixed, and
satisfying (2.25)
1
( , , ) ( ) ( ) ( )
K
i i
i
F x y u x x ξ y
=
≤ Φ + Ψ
∏
,for every ( , , )x y u ∈Ω×(\N)K×U, with yi ≤Ci if si = +∞,
where y: ( ,...,= y1 yK), Φ ∈L1( )Ω , Ψ ∈Ls0( )Ω , ξi( yi ) := yi σi if si < +∞, ( ) :
i yi
ξ =1 if si = +∞. If ( converges to in strongly, , with
)
k
yi yi Lsi( ;Ω \N) i=1,...,K
k
i i
y ∞ ≤C (for sufficiently large) if k si = +∞, and ( )rk converges to in r R, then
(2.26) lim ( , k( ), ( ))k ( , ( ), ( )) .
k F x y x r x dx F x y x r x dx
Ω Ω
→∞
∫
=∫
Theorem 2.2 Under Assumptions 2.1-3, the operator r6 yr (resp. w6 yw), from R (resp. W with the relative topology of L2( ;Ω \ν), hence of L∞( ;Ω \ν)) to V , and
to C0( )Ω , and the functionals r 6G rm( ) on R (resp. on W with the same topologies) are continuous. If the relaxed problem has an admissible control (i.e.
satisfying all the constraints), then it has a solution.
m( ) w6G w
Proof. Let be a sequence that converges to in W, with the relative topology of
( )wk w
2( , )
L Ω \ν . Since the corresponding sequence of states is bounded in V and in
( )yk
0( )
Cα Ω , for some α∈(0,1), and since the injection of C0α( )Ω into C0( )Ω is compact, there exists a subsequence (same notation) converging to some in V weakly and in
y
0( )
C Ω strongly. Let any v V∈ be given. By the state equation (2.27) a y v( , )k f x y x w x vdx( , ( ),k k( )) 0
+
∫
Ω = .By the mean value theorem and since η1 is increasing, we have, for every with y y ≤C ( defined in Theorem 2.1), and for some C µ( ) [0,1]x ∈
(2.28) f x y u v( , , ) ≤ f x( ,0, )u v + f x y u v( , , ) − f x( ,0, )u v
0 1 1
( ,0, ) y( , ( ) , ) ( ) ( ) ( ( ) )
f x u v f x µ x y u yv φ x φ x vη µ x y y
= + ≤ +
0( )x 1( )x v 1( )C C
φ φ η
≤ + ,
Since φ0∈Ls, φ1∈Ls or L∞, v V∈ ⊂L2, and yk →y in L∞, we can apply Proposition 2.1 to pass to the limit in the state equation for and find that . Next, we have
yk y=yw
(2.29) 2 2
1 ( ,
n n n
y y a y y y y
α − ≤ − − )
y
( ( ,f y wn n),yn) a y y( , n) a y( n y y, ) 0,
= − − − − →
since yk → in V weakly and ( ( ,f y wn n),yn)→( ( , ), )f y w y by Proposition 2.1, which shows that in V strongly. The convergence of the original sequence follows from the uniqueness of the limit. The continuity of the functionals follows then from Proposition 2.1. The proofs for and are similar. The existence of an optimal relaxed control follows from the compactness of
yn →y
Gm
r6 yr Gm:R→\
R and the continuity of the functionals Gm (the set of admissible controls is a closed subset of
R).
Note that the classical problem may have no classical solution, and since , we generally have
W ⊂R
(2.30) 0 0 ,
constraints on constraints on
: min ( ) inf ( ) :
R W
r w
c = G r ≤ G w =c
where the equality holds, in particular, if there are no state constraints, as W is dense in R. Since usually approximation methods slightly violate the state constraints, approximating an optimal relaxed control by a relaxed or a classical control, hence the possibly lower relaxed optimal cost , is not a drawback in practice (see [19], p.
259).
cR
Lemma 2.1 Under Assumptions 2.1-4, dropping the index m in , , for , the functional G, defined on
gm Gm , '
r r ∈R R (resp. W, with U convex) is l-
differentiable at (resp. ) for every integer l, i.e. for every and any choice of l controls (resp. ),
r w l
ri∈R wi∈W i=1,...,l, we have
(2.31)
1 1
( l i( i )) ( ) l ( , i ) i ( i
i i
G r r r G r DG r r r o
1
),
l
i
ε ε ε
= =
+
∑
− − =∑
− +∑
=(resp.
1 1
( l i( i )) ( ) l ( , i ) i ( i
i i
G w w w G w DG w w w o
1
)
l
i
ε ε ε
= =
+
∑
− − =∑
− +∑
= ),for 0,εi ≥
1
1,
l i i
ε
=
∑
≤with DG r r( , i r) : H x y( , , y z r x, , ( )i r x dx( )) ,
− =
∫
Ω ∇ −(resp. DG w w( , i w) : H x yu( , , y z w w x, , )( ( )i w x dx( )) ),
− =
∫
Ω ∇ −where the Hamiltonian is defined by
(2.32) H x y y z u( , , ', , ) := −z f y x u( , , )+g x y y u( , , ', ),
and the adjoint state z:= ∈zr V satisfies the linear adjoint equation (2.33) a v z( , ) ( ( , ) , ) ( ( ,+ fy y r z v = gy y ∇y r v, ), ) (+ gy'( ,y ∇y r, ),∇v),
∇ (resp. a v z( , ) ( ( , ) , ) ( ( ,+ fy y w z v = gy y ∇y w v, ), ) (+ gy'( ,y ∇y w, ), v) ),
,
∀ ∈v V with y:= yr (resp. y:= yw).
In particular, the directional derivative of the functional G defined on R (resp. W, with U convex) is given by
(2.34)
0
( ( )) (
( , ) limG r r r G r
DG r r r
α
α ) α
→ +
+ − −
− =
( , ( ), ( ), ( ), '( ) ( )) H x y x y x z x r x r x dx
=
∫
Ω ∇ − ,(resp.
0
( ( )) (
( , ) lim G r w w G w
DG r w w
α
α ) α
→ +
+ − −
− =
( , ( ), ( ), ( ), ( ))( ( ) ( )) H x y xu y x z x w x w x w x dx
=
∫
Ω ∇ − ).Moreover, the operator r6zr, from R to V (resp. , from W to V ), and the functional
w6zw
( , )r r 6DG r r( , −r), on R R× (resp. ( , )w w 6DG r w w( , − ), on W ), are continuous.
×W Proof. We shall prove the l-differentiability for classical controls only; we could also prove the Fréchet differentiability in this case, but the proof will be thus similar to the proof for relaxed ones. We first remark that, by our assumptions and since the injection V ⊂Lρ is continuous, the functional
(2.35) v6( ( ,gy y ∇y w v, ), ) (+ gy'( ,y ∇y w, ),∇v)
belongs to the dual V* of V , and fy( , )y w ∈Ls( )Ω , 2≤ ≤ ∞s , . Hence the linear adjoint equation has a unique solution
( , ) 0 fy y w ≥
z∈V, for every w W∈ , by the Lax- Milgram theorem (if s= ∞), or by Lemma 3.2 in [3] (if s< ∞, no in ). Now let
, ,
'
y g
w W∈ wi∈W εi∈(0,1), i=1,...,l, ε: ( ,..., )= ε1 εl , with
1
: l i 1
i
ε ε
=
=
∑
≤ , and set(2.36) ,
1
: l i( i )
i
wε w ε w w
=
= +
∑
− δwi:=wi−w, :y = yw, :yε = ywε , :δεy = yε −y. From the state equation, we have(2.37) (a δy vε, ) ( ( ,+ f y wε ε)− f y w v( , ), )
( , ) ( ( , ) ( , ), ) ( ( , ) ( , ), ) a δy vε f y wε ε f y wε v f y wε f y w v 0
= + − + − = .
Using the mean value theorem, we see that δyε satisfies the linear equation
(2.38) , ,
1
( , ) ( (y ) , )) l ( i u( , ) , )
i
a δy vε f y µδyε δy vε ε f y w µδwε δw v
=
+ + = −
∑
+ i ∀ ∈v Vwhere the functions
(2.39) a:= fy(y+µδyε) (with a≥0),
1
: ( , )
l
i u i
i
f ε f y w µδwε δw
=
= −
∑
+ ,belong to L∞( )Ω (or Ls) and ( )Ls Ω , respectively, by our assumptions. It then follows from Lemma 3.2 in [3] that
(2.40) 1 s ' .
y y c f L c
ε ε
δ + δ ∞ ≤ ≤ ε
Now, by our assumptions, the functional on the open subset Y×L2( ,Ω \d)×W' of ( ) 2( , d) ( , v)
L∞ Ω ×L Ω \ ×L∞ Ω \
(2.41) ( , ', ) :y y w g x y y w dx( , , ', ) , Φ =
∫
Ωwhere
(2.42) Y:=
{
φ∈L∞( )Ω φ ∞ <C'}
, W' :={
ψ∈L∞( )Ω ψ :Ω →U'}
,has the Fréchet derivative defined by (2.43) '( , , ', )(Φ x y y w δ δy, y',δw)
' .
[ ( , , ', )gy x y y wδy gy( , , ', )x y y wδy' g x y y wu( , , ', ) w dx]
=
∫
Ω + + δThis can be shown under our assumptions by using the mean value theorem in max- form, the Cauchy-Schwartz inequality, and Proposition 2.1. Using then the above estimate on δεy, we have
(2.44)
1
( l i i )
i
o δεy ∞ δεy ε δw o( )
= ∞
+ ∇ +
∑
= εε
, hence
(2.45) G w( )ε G w( ) gy( ,y y w, )δεydx gy'( ,y y w, ) δ ydx
Ω Ω
− =
∫
∇ +∫
∇ ∇1
( , , ) ( )
l
i u i
i
g y y w w dx o
ε δ ε
= Ω
+
∑ ∫
∇ + .Similarly, the state equation, for v:=z, yields by linearization (2.46)
1
( , ) ( ( , ) , ) ( ( , ) , ) ( ) 0
l
y i u i
i
a δεy z f y w δεy z ε f y w δw z o
=
+ +
∑
+ ε = .On the other hand, the adjoint equation, for :v =δyε, yields
(2.47) a(δεy z, ) ( ( , ) ,+ fy y w z δεy) ( ( , ),= gy y w δεy) (+ gy'( , ),y w ∇δεy).
Gathering the above results, we obtain (2.48)
1
( ) ( ) [ ( , ) ( , , )] ( )
l
i u u i
i
G wε G w ε z f y w g y y w δw dx o ε
= Ω
− =
∑ ∫
− + ∇ +1
( , ( ), ( ), ( ), ( )) ( ).
l
i u i
i
H x y x y x z x w x w dx o
ε δ ε
= Ω
=
∑ ∫
∇ +Finally, the continuity of the operator is proved by using the continuity of , from W to , the compact injection , and Proposition 2.1. The continuity of the functional
w6zw
w6 yw L∞ V ⊂L2
( , )w w 6DG r w w( , − ) follows from the above continuities. The continuity proofs for relaxed controls are similar.
The following theorem states various continuous necessary conditions for optimality.
Theorem 2.3 Under Assumptions 2.1-4, if r∈R (resp. w W∈ , with U convex) is optimal for Problem P or (resp. Problem ), then (resp. ) is strongly extremal relaxed (resp. weakly extremal classical), i.e. there exist multipliers
P P r w
λm∈\, , with
0,...,
m= q
(2.49) λ0 ≥0, 0λm≥ , m= +p 1,...,q,
0
1
q m m
λ
=
∑
= , such that(2.50)
0
( , ) 0,
q
m m
m
DG r r r λ
=
∑
− ≥ ∀ ∈r R,(2.51) ( ) 0,λmG rm = m= +p 1,...,q (relaxed transversality conditions).
(resp.
(2.52)
0
( , ) 0,
q
m m
m
DG w w w λ
=
− ≥
∑
∀ ∈w W ,(2.53) ( ) 0,λmG wm = m= +p 1,...,q (classical transversality conditions) ).
The global condition (2.50) is equivalent to the strong relaxed pointwise minimum principle
(2.54) ( , ( ), ( ), ( ), ( )) min ( , ( ), ( ), ( ), ),
u U
H x y x y x z x r x H x y x y x z x u
∇ = ∈ ∇ a.e. in Ω,
where the complete Hamiltonian and adjoint , are defined with replaced by . If U is convex, then this principle implies the weak relaxed pointwise minimum principle
H z g
0 q
m m
m
λ g
∑
=(2.55) H x y z r x r xu( , , , ( )) ( ) minH x y z r xu( , , , ( )) ( , ( )),x r x
φ φ
= a.e. in Ω
where the minimum is taken over the set ( , ; )B ΩU U of Caratheodory functions (see [18]), which in turn implies the global weak relaxed condition
(2.56) H x y z r xu( , , , ( ))[ ( , ( ))φ x r x r x dx( )] 0,
Ω − ≥
∫
∀ ∈ Ωφ B( , ; )U U .A control satisfying this condition and the above transversality conditions is called weakly extremal relaxed. The global condition (2.52) is equivalent to the weak classical pointwise minimum principle
r
(2.57) ( , ( ),u ( ), ( ), ( )) ( ) min u( , ( ), ( ), ( ), ( )) , H x y x y x z x w x w x u U H x y x y x z x w x u
∇ = ∈ ∇
a.e. in Ω.
Proof. The functionals Gm, m=0,...,q, defined on R (resp. W) are continuous (Theorem 2.1) and, by Lemma 2.2, (p+1)-differentiable (cost and p equality state constraints) at (resp. ). The global condition (i) (resp. (iii)) and the transversality conditions (ii) (resp. (iv)) follow then from the general multiplier theorem V.2.3 (resp.
V.3.2) in [19] ( depends here on the control only, since or is unique for every or ). The equivalence of the global and pointwise conditions is standard, in both cases (see e.g. [19]) since U is closed (it has a dense denumerable subset). Now, the strong relaxed pointwise minimum principle can be written, for a.a. ,
r w
Gm yr yw
r w
x∈Ω x fixed
(2.58) ( ) ( ) ( ), .
UH u r du ≤H u
∫
∀ ∈u ULet φ∈ ΩB( , ; )U U be any Caratheodory function. Since U is convex here, we have (2.59) ( ) ( ) ( ( ( ) )),
UH u r du ≤H u+ε φ u −u
∫
∀ ∈u U, [0,1]∀ ∈ε ,hence
(2.60) ( ) ( ) ( ( ( ) )) ( ).
UH u r du ≤ UH u+ε φ u −u r du
∫ ∫
By the Mean Value Theorem and the uniform continuity of H in u
(2.61) ( ( ( ) )) ( )
0 (
U
H u u u H u
r du) ε φ
ε
+ − −
≤
∫
( ( )( ( ) ))( ( ) ) ( )
UH uu εµ u φ u u φ u u r du
=
∫
+ − − ( 0≤µ( ) 1u ≤ )( )( ( ) ) ( ) ( )
UH uu φ u u r du α ε
=
∫
− + ,where ( )α ε →0 as ε →0, hence
(2.62) u( )( ( ) ) ( ) ' ( )( ( )u ) 0,
UH u φ u −u r du =H r φ r −r
∫
≥for every φ∈ ΩB( , ; )U U , a.e. in Ω, which is the weak relaxed minimum principle.
By integration, we get the global weak relaxed condition (2.63) H ru( )( ( )φ r r dxdt) 0,
Ω − ≥
∫
∀ ∈ Ωφ B( , ; )U U .Remark. In the absence of equality state constraints, it can be shown that if the optimal control is regular, i.e. there exists w w'∈W such that
(2.64) G wm( )+DG w wm( , '−w) 0< , m= +p 1,...,q,
(Slater condition), then λ0 ≠0 for any multipliers as in Theorem 2.3.
3 Discretizations and behavior in the limit
We suppose in Sections 3 and 4 that Ω is a polyhedron (for simplicity). For each integer n≥0, let { }Ein Ni=1n be an admissible regular partition of Ω into elements (e.g.
-simplices), with as . Let be the
subspace of functions that are continuous on
d hn =max [diam(i Ein)]→0 n→ ∞ Vn ⊂V
Ω and linear (or multilinear) on each element . The set of discrete controls is defined as the subset of (not necessarily continuous) controls that are (optionally) constant, or linear, or multilinear, on each element , and (optionally) such that
n
Ei Wn ⊂W
wn n
Ei wn
∞ L
∇ ≤ , with independent of (this reduces to a finite number of linear constraints on the coefficients defining ). We endow with the Euclidean topology.
L n
wn Wn
Remark. If Ω has an appropriately piecewise C1 boundary Γ, one can approximate by a polyhedral one , with domain
Γ Γn Ωn, up to ; the results of this section
still hold in this case, with slight modifications in the definitions of and in the proof of Lemma 3.2 (interpolation inside
( )n o h
, V Wn n
Ωn and zero values on Γn).
The following assumptions are stronger than Assumptions 2.2-4.
Assumptions 3.1 The functions , ,f fy fu (resp ) are defined on ' (resp. on ), with
, , ',
m my my mu
g g g g
Ω× ×\ U Ω×\d+1×U' U'⊃U open, measurable for fixed (resp. ), continuous for fixed
, y u , ',
y y u x, and satisfy
(3.1) f x y u( , , ) ≤c1(1+ yρ −1), (3.2) 0≤ f x y uy( , , )≤c2(1+ yρ −2), (3.3) f x y uu( , , ) ≤c3(1+ yρ −1),
( , , )x y u U',
∀ ∈Ω× ×\
(3.4) gm( , , ', )x y y u ≤c4(1+ yρ+ y' ),2 (3.5) gmy( , , ', )x y y u c5(1 y 1 y'2( 1)),
ρ ρ
ρ
− −
≤ + +
(3.6) gmy'( , , ', )x y y u c6(1 y 2 y' ),
≤ + ρ + (3.7) gmu( , , ', )x y y u c7(1 y 2 y' ),
≤ + ρ + ( , , ', )x y y u d+1 U',
∀ ∈Ω×\ ×
where , ci ≥0 ρ∈ ∞[1, ) if n=1 or 2, 2
: 2
n ρ σ< =n
− if . Note that each of the above inequalities is also satisfied if it holds for some
3 n≥
i 0
c ≥ and ρ∈[1, )ρ .
For a given discrete control , the discrete state is the solution of the discrete state equation
wn∈Wn n
n n
v V
: n
n n
y = yw ∈V (3.8) a y v( , ) ( ( ,n n + f y wn n), ) 0,vn = ∀ ∈ .
The following theorem can be proved by using the techniques in [13] (via Brouwer’s fixed point theorem), under our coercivity, monotonicity and continuity assumptions.
Theorem 3.1 Under Assumptions 2.1 and 3.1 (on ,f fy), for every control , the discrete state equation has a unique solution
n n
w ∈W
n n
y ∈V .
The discrete state equation, which is a nonlinear system, can be solved by iterative methods. The discrete functionals, defined on Wn, are given by
(3.9) G rmn( )n gm( ,yn y w dxn, n) ,
=
∫
Ω ∇ m=0,..., .qThe discrete control constraint is and the discrete state constraints are either of the two following ones
wn∈Wn (3.10) Case (a) G wmn( n) ≤εmn, m=1,..., ,p (3.11) Case (b) G wmn( n)=εmn, 1,..., ,m= p and
(3.12) (G wmn n)≤εmn, εmn ≥0, m= +p 1,..., ,q
where the feasibility perturbations εmn are chosen numbers converging to zero, to be defined later. The discrete relaxed optimal control Problem Pan (resp. Pbn) is to
minimize subject to and the above state constraints, Case (a) (resp.
Case (b)).
( )
n n
G wm wn∈Wn
The proof of the following theorem parallels that of Theorem 2.1, noting that all norms are equivalent in the finite dimensional space Vn.
Theorem 3.2 Under Assumptions 2.1 and 3.1 (on ,f fy), the operator , from to , are continuous. Under assumptions 2.1 and 3.1 (on
wn 6 yn
Wn Vn f f, ,y gm), the
functionals , on , are continuous, and for every n, if Problem , or , is feasible, then it has a solution.
( )
n n
w 6G wm n Wn Pan
n
Pb
The proofs of the following lemma and theorem also parallel the continuous case.
Lemma 3.1 Under Assumptions 2.1 and 3.1, dropping in the functionals, is l- differentiable for every l, and its directional derivative is given for
m Gn
,
n n
w w ∈Wn by (3.13) DG w wn( n, n wn) H x yu( , n, y z wn, ,n n)(wn w dxn)
− =
∫
Ω ∇ − ,n
∇ zn
where the discrete adjoint state satisfies the linear discrete adjoint equation
: n
n n
z =z w ∈V
(3.14) a z v( , ) (n n + z fn y( ,y wn n), ) ( ( ,vn = gy yn ∇y wn, n), ) (vn + gy'( ,yn ∇y wn, n), vn), ,
n n
v V
∀ ∈ where yn:=ynwn.
Moreover, the operator wn 6 , from Wn to Vn, and the functional (w wn, n)6DG w wn( n, n−wn), on Wn×Wn, are continuous.
n
q
Theorem 3.3 Under Assumptions 2.1 and 3.1, if is optimal for Problem , then is weakly discrete extremal classical (or discrete extremal), i.e. there exist
multipliers , , with , ,
wn∈W Pbn
wn
n
λm∈\ m=0,..., λmn ≥0 λmn ≥0 m= +p 1,...,q,
0
1
q n m m
λ
=
∑
= , such that(3.15)
0
( , ) ( , , , ) 0,
q
n n n n n n n n n n n
m m
m
DG w w w H y y z w w dx
λ Ω
=
− = ∇ − ≥
∑ ∫
∀ ∈wn Wn,(3.16) (λmn G wm( n)−εmn) 0,= m= +p 1,..., ,q
where and are defined with replaced by . The global condition (3.17) is equivalent to the strong discrete classical elementwise minimum principle
Hn zn g
0 q
n
m m
m
λ g
∑
=(3.18) n ( , , , ) min n ( , , , )
i i
n n n n n n n n n n n
u u
E H y y z w w dx u U E H y y z w udx
∇ = ∈ ∇
∫ ∫
, i=1,...,Nn.Let Wn be the set of elementwise constant discrete controls. Clearly, Wn ⊂Wn in all cases. The following control approximation result (i) (resp. (ii)) is proved similarly to the corresponding result in [8] (resp. [13]).
Proposition 3.2 (i) For every r∈R, there exists a sequence (wn∈Wn) of discrete classical controls, considered as relaxed ones, that converges to in r R.
(ii) For every w W∈ , there exists a sequence (wn∈Wn) of discrete classical controls, considered as relaxed ones, that converges to in w L2 strongly.
The following key lemma gives consistency results.
Lemma 3.2 We suppose that Assumptions 2.1 and 3.1 are satisfied and drop m in the functionals.
(i) If the sequence (wn∈Wn) converges to r∈R in R (resp. to in
strongly), then (resp. ) in V strongly, (resp.
), and (resp. ) in
w W∈ L2
n
y → yr
z z →
n
y →yw G wn( n)→G r( )
( ) ( )
n n
G w →G w zn → r n zw Lρ( )Ω strongly (and in V strongly, if the functionals do not depend on ∇y).
(ii) If the sequences (wn∈Wn) and (wn∈Wn) converge to and w w, respectively, in W, then
(3.19) DG w wn( n, n −wn)→DG w w w( , − ).
Proof. (i) Suppose that wn →r in R. From the discrete state equation, we have (3.20) a y( ,n yn) ( ( ,+ f y wn n) ( (0,− f wn),yn− = −0) ( (0,f wn),yn),
and since f is increasing in y (3.21) 2 2
1 ( , ) ( (0, ), ) (0, ) 1,
n n n n n n n
y a y y f w y f w s y c y
α ≤ ≤ ≤ ≤ n
which shows that the sequence is bounded in V . By Alaoglu’s theorem, there exists a subsequence (same notation) that converges weakly in V to some , and since the injection of V in
( )yn
y∈V ( )
Lρ Ω is compact (see Ref. 20), we can suppose that in strongly. For any given
yn →y Lρ( )Ω v∈C01( )Ω , let Vn be the sequence of interpolates of v at the vertices of the partition of
(vn∈ )
Ω. This sequence converges to in
v C10( )Ω (hence in V ) strongly. We have (3.22) a y v( , ) ( ( ,n n + f y wn n), ) 0.vn =
Since wn →r in R and in V strongly, hence in strongly, by Proposition 2.1 and our assumptions, we can pass to the limit in this equation and find
yn → y Lρ( )Ω
(3.23) a y v( , ) ( ( , ), ) 0,+ f y r v =
which holds also for every v V∈ ⊂Ls, as C01( )Ω is dense in V . Therefore . The convergence in strongly of the initial sequence follows then from the uniqueness of the limit. Next, we have
y= yr
( ) Lρ Ω
(3.24) 2 2
1 ( , ) ( ( , ), ) ( , ) ( ,
n n n n n n n n
y y a y y y y f y w y a y y a y y y
α − ≤ − − = − − − − ).
y
By Proposition 2.1 and the above convergences of , the last expression converges to zero; hence in V strongly. The convergence follows from the above convergences and the same proposition. From the adjoint equation, we have
( )yn
yn → G wn( n)→G r( )
(3.25) 2 2
1 ( , ) ( ( , ) , )
n n n n n n
z a z z fy y r z z
α ≤ + n
( ( ,gy yn y z rn, , ), )n n zn (gy'( ,yn y z rn, , ),n n zn)
≤ ∇ + ∇ ∇
2( 1)
1 2
4 5
( (1 ), ) ( (1 ), )
p p
n p n p n n n
c y y z c y y z
− −
≤ + + ∇ + + + ∇ n