• Keine Ergebnisse gefunden

V4E2 - Numerical Simulation

N/A
N/A
Protected

Academic year: 2021

Aktie "V4E2 - Numerical Simulation"

Copied!
2
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

V4E2 - Numerical Simulation

Sommersemester 2018 Prof. Dr. J. Garcke

Teaching assistant: Biagio Paparella Tutor: Marko Rajkovi´c (marko.rajkovic@uni-bonn.de)

Exercise sheet 5.

To be handed in on Tuesday, 29.05.2018.

In this problem set we develop an extension of the HBJ equation to the infinite-horizon setting and solve some optimal control problem with cost only at the end (i.e. l = 0 ). Finally, we check compatibility between two definitions ofconsistency for a numerical scheme.

Exercise 1. (Optimal control in infinite time setting)

Assumef andlsatisfying our assumptions (A5) and (A6). Given a pointx∈Rnand a control belonging to A .

={α: [0,∞) →A|α(·) is measurable }, letx(·) be the unique solution to the

ODE: (

˙

x(s) =f(x(s), α(s)) s >0 x(0) =x

Fix λ >0 and define the discounted cost:

Cx[α(·)] .

= Z

0

e−λsl(x(s), α(s))ds And so the value function:

V(x) .

= inf

α(·)∈ACx[α(·)]

Prove the following properties:

1 V is bounded and if λ > Lip(f), then V is Lipschitz continuous (where Lip(f) .

= supx,y∈Dom(f),x6=y

|f(x)−f(y)|

|x−y| );

2 If 0< λ≤Lip(f), then V is Hoelder continuous for some exponent 0< β <1;

3 The value functionV is a viscosity solution of the PDE:

λu−min

a∈A{f(x, a)·Du+l(x, a)}= 0

in Rn (to clarify, if v ∈ C1(Rn) and V −v has a local maximum at x0, then λV − mina∈A{f(x, a)·Dv +l(x, a)} ≤ 0 at x0, and conversely for the minimum case. We specified in order to avoid any possible confusion due to the reversed inequality appearing in the finite-time case (because of the terminal condition) ).

(8 Punkte) In the following exercises we try to solve some optimal control situation with the help of the Pontryagin Maximum Principle (abbr. PMP). It can be directly applied when the considered cost is only at the end, but an extension to the general case is possible (here not discussed).

Theorem (Pontryagin Maximum Principle). Consider the control system:

˙

x=f(x, α), α(t)∈A, t∈[0, T], x(0) =x0

Let t 7→ α(t) be an optimal control and t 7→ x(t) = x(t, α) be the corresponding optimal trajectory for the maximization problem:

maxα∈Aψ(x(T, α))

1

(2)

Define the (row) vector t7→p(t) as the solution to the linear adjoint system:

˙

p=−p(t)B(t), B(t) .

=Dxf(x(t), α(t)) with terminal condition:

p(T) =∇ψ(x(T))

Then, for almost every τ ∈[0, T] the following maximality condition holds:

p(τ)·f(x(τ), α(τ)) = max

a∈A{p(τ)·f(x(τ), a)}

This result provides a necessary (but generallynot sufficient) condition on an optimal strategy α. It can be actively used for building solutions by hands, by following this algorithm.

First, forget the time dependence t 7→ α(t). Instead, find α as a function of x and p by exploiting the equality:

α(x, p) = argmax

a∈A

{p·f(x, a)}

Once you find (x, p)7→α(x, p), plug this solution into the system:

(x˙ =f(x, α(x, p))

˙

p=−p·Dxf(x, α(x, p)) with conditions x(0) =x0,p(T) =∇ψ(x(T)).

It becomes a system with only variablest, x, p, that once solved gives us two functionst7→x(t) and t7→p(t). Finally, use them to construct t7→α(t) via the compositiont7→α(x(t), p(t)).

Exercise 2. (Linear Pendulum)

Let q(t) be the position of a linearized pendulum with unit mass, controlled by an external force with magnitude α(t)∈[−1,1]. Thenq(·) satisfies the second order ODE:

¨

q(t) +q(t) =α(t) q(0) = ˙q(0) = 0 α(t)∈[−1,1]

Problem: find an optimal strategy α that maximizes the terminal displacement q(T).

(Hint: introduce variables x1 =q, x2 = ˙q...we seek maxx1(T, α)...the adjoint system inp can be solved without involving x or α...you can also avoid to solve t7→ x(t)...because maybe the max in the Pontryagin equation depends only on p2...)

(4 Punkte) Exercise 3. (The principle gives a condition necessary, but not sufficient)

Consider the following system inR2:

(x˙1

˙ x2=x21

With initial conditions x1(0) =x2(0) = 0 andα(t)∈[−1,1].

1 Find an optimal strategy to maximizex2(T).

2 Consider the control α(t) = 0 for each time. Does it respect the Pontryagin Maximum Principle? Is it an optimal control?

(4 Punkte) Exercise 4. (Compatibility between two definitions of consistency)

Check that if a scheme in differenced form is consistent w.r.t definition (26), then it is w.r.t.

definition (19).

(4 Punkte)

2

Referenzen

ÄHNLICHE DOKUMENTE

Prove monotonicity and consistency for

When the Taylor principle is insufficient - A benchmark for the fiscal theory of the price level in a monetary union.. by

The SLLN yields an idea called the Monte Carlo Method of direct sim- ulation.. (Interestingly, it is often much easier to find and simulate such an X than to compute

The maximal cliques of G are independent sets of the complement graph G and so the Bron–Kerbosch algorithm can be used to set up the independent set 0–1 LP formulation of the

1 Head and Neck - Craniomaxillofacial Surgery Section, General Surgery Department, Hospital Italiano de Buenos Aires, Juan D. Perón 4190, C1181ACH Buenos

Show that, as a consequence of the boundary condition, the energies are quantized and give an expression for the

Later in Section 12, an extended theoretical framework for the conduction of monetary policy will be presented, based on the fact that the slope of the money demand (or

The cointegration test, shown in Table 9, (see Engle &amp; Granger, 1987; Engle and Yoo, 1987, Table 2), shows that in the two cases with monthly data (models 5 and 6),