• Keine Ergebnisse gefunden

Numerical Appromixation of the Value of a Stochastic Differential Game with Asymmetric Information

N/A
N/A
Protected

Academic year: 2022

Aktie "Numerical Appromixation of the Value of a Stochastic Differential Game with Asymmetric Information"

Copied!
28
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Center for

Mathematical Economics

Working Papers

630

December 2019

Numerical Appromixation of the Value of a Stochastic Differential Game with Asymmetric Information

Lubomir Baˇ nas, Giorgio Ferrari, and Tsiry A. Randrianasolo

Center for Mathematical Economics (IMW) Bielefeld University

Universit¨atsstraße 25 D-33615 Bielefeld·Germany e-mail: imw@uni-bielefeld.de http://www.imw.uni-bielefeld.de/wp/

ISSN: 0931-6558

(2)

NUMERICAL APPROXIMATION OF THE VALUE OF A STOCHASTIC DIFFERENTIAL GAME WITH ASYMMETRIC

INFORMATION

‰UBOMÍR BAŒAS, GIORGIO FERRARI, AND TSIRY A. RANDRIANASOLO

Abstract. We consider a convexity constrained Hamilton-Jacobi-Bellman-type ob- stacle problem for the value function of a zero-sum dierential game with asymmetric information. We propose a convexity-preserving probabilistic numerical scheme for the approximation of the value function which is discrete w.r.t. the time and convex- ity variables, and show that the scheme converges to the unique viscosity solution of the considered problem. Furthermore, we generalize the semi-discrete scheme to obtain an implementable fully discrete numerical approximation of the value function and present numerical experiments to demonstrate the properties of the proposed numerical scheme.

1. Introduction

In this paper we consider the Hamilton-Jacobi-Bellman-type obstacle problem min

"

BtV `1

2TrpσσTpt, xqD2xVq `Hpt, x, DxV, pq, λmin

ˆ p,B2V

Bp2

˙*

“0, VpT, x, pq “ xp, gy,

(1.1)

where V ”Vpt,x,pq, pt,x,pq P r0,Ts ˆRdˆ∆pIq, ∆pIq denotes the set of probability vectors p“ pp1,...,pIq P p0,1qI that satisfy řI

i“1pi“1 and the Hamiltonian H will be specied below. The convexity of the solution V with respect to the variable p is enforced via the obstacle term λmin

´ p,BBp2V2

¯, which is the minimal eigenvalue of the Hessian matrix BBp2V2 on the tangent cone to∆pIq. More precisely, for a symmetricIˆI matrix A we denote

λminpp, Aq:“ min

zPT∆pIqppqzt0u

xAz, zy

|z|2 , whereT∆pIqppq“Ť

δą0p∆pIq ´pq{δ is the tangent cone to ∆pIq at pP∆pIq, cf. [4].

(‰. Ba¬as) Faculty of mathematics, Bielefeld university, Universitätsstraÿe 25, D- 33615 Bielefeld

(G. Ferrari) Center for Mathematical Economics, Bielefeld university, Univer- sitätsstraÿe 25, D-33615 Bielefeld

(T.A. Randrianasolo) Faculty of mathematics, Bielefeld university, Univer- sitätsstraÿe 25, D-33615 Bielefeld

Date: December 30, 2019.

1991 Mathematics Subject Classication. 65K15,65C20,49N70,49L25,35F21,52A27,52B55.

Key words and phrases. zero-sum stochastic dierential games; asymmetric information; proba- bilistic numerical approximation; discrete convex envelope; convexity constrained Hamilton-Jacobi- Bellmann equation; viscosity solution.

This research was supported by the German Research Foundation as part of the Collaborative Research Center SFB 1283.

1

(3)

Problem (1.1) describes the value of a class of zero-sum stochastic dierential games with asymmetric information, cf. [15]. Since the seminal work by Aumann and Maschler (see [1]) in the framework of repeated games, the literature on games with asymmet- ric information experienced an increasing interest ([12, 19, 22], among many others), recently also in continuous-time dierential settings (see, e.g., [7,8, 9, 13,16,24,25]).

As in [15], in our game both players can adjust the dynamics of a non degenerate Itô-diusion by controlling the drift via regular controls taking values in some compact subset of a nite dimensional space. However, one player has more information than the other in the following sense (cf. [1] and [7]). Before the game starts, the payos of the game are chosen randomly with some probability p from a nite collection of size I, and the information on which payos have been realized is transmitted only to one player. Since we assume that both players can observe the actions of the other one, the uninformed player infers which game is actually played through the moves of the informed one. It turns out that it is optimal for the informed player to release information to the uninformed one in a sophisticated way aiming at manipulating the beliefs of the latter player (see [7]).

The numerical analysis of our paper hinges on the theoretical results of [7]. There it is shown (in a setting actually more general than ours) that the previously described game has a value V, whenever the so-called Isaacs conditions are satised and addi- tional technical requirements on the problem's data area fullled. The value function V depends on time t, on the state variable x, and on a probability vector pP∆pIq;

this latter variable describes the initial value of the beliefs of the uninformed player about the game she is playing. Moreover, it is shown in [4], that V can be character- ized as the unique continuous viscosity solution (in the dual sense) to a second-order partial dierential equation complemented by a convexity constraint with respect the parameter p.

There exist only few results on numerical approximation of dierential games with incomplete information. Numerical approximation of (deterministic) dierential games with incomplete information was rst studied in [5] and generalized to games with incomplete information on both sides in [23]. As far as we are aware the only work on numerical approximation of stochastic dierential games with incomplete information is [15]. We note that all three aforementioned works only consider semi-discretization in the time-variable and the remaining variables are kept continuous, hence, the schemes are not implementable.

In this paper we generalize the probabilistic numerical approximation of [15] to include the discretization of the convex envelope, i.e., we propose a structure preserving probabilistic numerical approximation that is discrete in time and in the variable p and preserves the convexity of the solution. We show that the proposed numerical approximation converges to the unique viscosity solution of (1.1). The discretization in the probability variablepis constructed by approximating the lower convex envelope of the semi-discrete solution in p by its nite-dimensional counterpart. The discrete lower convex envelope is computed over a nite set of values which coincide with nodes of a simplicial partition of ∆pIq. The resulting approximation is monotone and inherits the Lipschitz continuity properties of the solution. To further reduce the complexity of the numerical approximation we employ random walk instead of the usual Wiener increments to simulate the associated Itô-diusion process. Furthermore,

(4)

we propose an implementable fully discrete numerical scheme by combining the semi- discrete probabilistic approximation in time and p with a spatial discretization that employs linear interpolation in the state variable xover a simplicial partition.

The paper is organized as follows. In Section 2 we collect basic denitions and assumptions on the considered problem. In Section3 we introduce a probabilistic nu- merical scheme for the approximation of (1.1) which is discrete in the time variable t and the convexity variable p and summarize the regularity properties of the numeri- cal approximation in Section 4. Convergence of the numerical approximation to the viscositiy solution is shown in Section 5. Finally, an implementable fully discrete nu- merical approximation of the problem is introduced in Section6along with numerical studies which demonstrate the practicability of the proposed approach.

2. Assumptions and preliminaries

Throughout the paper, the scalar product of two vectors x“ px1,...,xdq and y“ py1,...,ydq of Rd is denoted by xx, yy:“řd

i“1xiyi and the`1-norm is denoted by |x|:“ řd

i“1|xi|; furthermore, we use | ¨ |8 and } ¨ }8 to respectively denote the `8-norm and the L8pRdq-norm.

2.1. Description of the game. Since the aim of this paper is to provide a numeri- cal approximation of the solution to (1.1), we only provide here a brief and informal description of the stochastic dierential game related to the problem (1.1) and simply refer to [15] for detailed discussion of the game and further references. We consider a two-player zero-sum dierential game where two players control thed-dimensional Itô process dened fortP r0, Ts, xPRd as

(2.1) dXst, x, u, v “bps, Xst, x, u, v, us, vsqds`σps, Xst, x, u, vqdBs sP rt, Ts, Xtt, x, u, v “x.

HereB“ Bs:sP rt, Ts(

is ad-dimensional Brownian motion on a complete probability space, b and σ are suitable Borel-Measurable functions and the controls pu, vq PUˆV and U,Vare compact subsets of some nite dimensional spaces.

The game is characterized byI congurations with respective running costsp`iqiPt1,...,Iu: r0, Ts ˆRdˆUˆVÑR and terminal payos pgiqiPt1,...,Iu:RdÑR and is played as fol- lows. Before the game starts, one congurationiP t1,...,Iu is chosen with probability pi and the choice of i is communicated to Player 1. Player 2 only knows the probabil- ity distribution pP∆pIq of the respective congurations. Once the game has started, both players adjust their control to minimize, for the Player 1, and to maximize, for the Player 2, the expected payo, cf. [4, Section 6.3]. We assume that both players observe their opponent's control.

2.2. General assumptions. The drift term b, the diusion term σ:“ pσk,lqk,l, the running cost p`iqiPt1,...,Iu, the terminal payo g:“ pgiqiPt1,...,Iu, and the Hamiltonian H, cf. (1.1), satisfy the following standing assumptions:

(A1q b:r0, Ts ˆRdˆUˆVÑRd is bounded and continuous in all its variables and Lipschitz continuous with respect to pt, xquniformly in pu, vq PUˆV.

(A2q For 1ďk,lďd the function σk,l:r0, Ts ˆRdÑR is bounded and Lipschitz con- tinuous with respect to pt, xq. For any pt, xq P r0, Ts ˆRd the matrix pσTq´1, where the superscript T means transpose, is non-singular, bounded, and Lips- chitz continuous with respect to pt, xq.

(5)

(A3q p`iqiPt1,...,Iu:r0, Ts ˆRdˆUˆVÑR is bounded and continuous in all its vari- ables and Lipschitz continuous with respect topt, xq uniformly in pu, vq.

pgiqiPt1,...,Iu:RdÑR is bounded and uniformly Lipschitz continuous.

(A4q Isaacs condition: for all pt, x, z, pq P rt0, Ts ˆRdˆRdˆ∆pIq Hpt, x, z, pq:“inf

uPUsup

vPV

"

xbpt, x, u, vq, zy `

I

ÿ

i“1

pi`ipt, x, u, vq

*

“sup

vPV uPUinf

"

xbpt, x, u, vq, zy `

I

ÿ

i“1

pi`ipt, x, u, vq

* .

(A5q In addition, there exists a constant Cą0 such that for all t,t1P r0, Ts, x,x1P Rd, z,z1PRd, p,p1P∆pIq, the following hold

(2.2) |Hpt, x, z, pq|ďCp1`|z|q,

|Hpt, x, z, pq ´Hpt1, x1, z1, p1q|ďCp1`|z|qp|x´x1|`|t´t1|q

`C|z´z1|`C|p´p1|.

(2.3)

2.3. Viscosity solution of (1.1). Under the assumptions in the previous section Cardaliaguet [4, 7] established that there exists a unique uniformly bounded viscos- ity solution of problem (1.1), which is convex and uniformly Lipschitz continuous in p. We recall the notion of viscosity solution as well as the corresponding notions of subsolutions and supersolutions to (1.1) below, cf. [4], [6].

Denition 2.1. We say that V is a subsolution of (1.1) if V “Vpt, x, pq is upper semicontinuous and if, for any smooth test function φ:p0, Tq ˆRdˆ∆pIq ÑR such that V ´φ has a local maximum on r0, Ts ˆRdˆ∆pIq at some point pst,sx,pq P r0, Ts s ˆ Rdˆ∆pIq, one has

(2.4) min

"

Btφ`1

2TrpσσTpt, xqDx2φq `Hpt, x, Dxφ, pq, λmin

ˆ p,B2φ

Bp2

˙*

ě0, at pt, x, pq “ pst,sx,pq.s

We say that V is a supersolution of (1.1) if V “Vpt, x, pq is lower semicontinuous and if, for any smooth test function φ:p0, Tq ˆRdˆ∆pIq ÑR such that V ´φ has a local minimum onr0, Ts ˆRdˆ∆pIqat some pointpst,sx,pq P r0, Ts s ˆRdˆ∆pIq, one has

(2.5) min

"

Btφ`1

2TrpσσTpt, xqDx2φq `Hpt, x, Dxφ, pq, λmin ˆ

p,B2φ Bp2

˙*

ď0, at pt, x, pq “ pst,sx,pq.s

We say that V is a viscosity solution of (1.1) if V is a sub- and a supersolution of (1.1).

Remark 2.2. For a smooth test function φ:p0, Tq ˆRdˆ∆pIq ÑR such that V ´φ has a local maximum on r0, Ts ˆRdˆ∆pIq at some point pst,sx,pq P r0, Ts s ˆRdˆ∆pIq, we have that (2.4) is equivalent to

Btφ`1

2TrpσσTpt, xqD2xφq `Hpt, x, Dxφ, pq ě0 and λmin ˆ

p,B2φ Bp2

˙ ě0;

(2.6)

(6)

and for the smooth test function φ:p0, Tq ˆRdˆ∆pIq ÑR such that V ´φ has a lo- cal minimum on r0, Ts ˆRdˆ∆pIq at some point pst,x,s pq P r0, Ts s ˆRdˆ∆pIq, (2.5) is equivalent to

Btφ`1

2TrpσσTpt, xqDx2φq `Hpt, x, Dxφ, pq ď0 or λmin ˆ

p,B2φ Bp2

˙ ď0.

(2.7)

3. Numerical approximation

To simplify the subsequent numerical approximation, we perform a change of measure via the Girsanov transform in the spirit of, for instance, [14, Lemma 3.8] or [17], and instead of the controlled process (2.1) we consider the simpler process

(3.1) dXst, x“σps, Xst, xqdBs sP rt,Ts, Xtt, x“x,

fortP r0, TsandxPRd. Notice that the dynamics in (3.1) is independent of the players' controls.

For a xed NPN and a step size τ:“T{N we introduce an equidistant partition Πτ:“ tn(N

n“0, tn“nτ of the time interval r0, Ts. We dene the discrete process pXsnn1, xqn1“n,..., N as the weak Euler approximation of (3.1), that is

(3.2) Xsnn1, x“x`

n´1

ÿ

j“n1

σptj,Xsjn1, xj? τ , where ξn?

τ“ pξn1,..., ξndq?

τ, n“1,...,N is a suitable approximation of the Rd-valued Wiener increment rWptnq ´Wptn´1qs „Np0,τq. Here, we take ξn to be a Rd-valued binomial random walk, i.e. ξn1,...,ξnd are i.i.d. random variables with the law Ppξnk

˘1q “1{2, for everyk“1,...,d; the analysis below can be easily modied to cover other choices such as, e.g., a trinomial random walk or the discrete Wiener increments. In the following we abbreviateσnpxq:“σptn, xq andσn´Tpxq:“ pσTptn, xqq´1. The approx- imation obtained after one step of the Euler scheme (3.2) will be denoted as

(3.3) Xsn`1x :“Xsn`1n, x “x`σnpxqξn?

τ xPRd.

Let tMhuhą0 be a family of regular partitions of ∆pIq into open pI´1q-simplices K (i.e., line segments, triangles, tetrahedra for I“2,3,4, respectively) with mesh-size h“maxKPMhtdiampKqu such that∆pIq “ YKPMhK. The set of vertices of all KPMh is denoted by Nh:“ tp1,...,pMu.

The approximation of the value function Vptn, x, pmq is denoted by Vnmpxq for tnP Πτ, xPRd, pmPNh. The discrete numerical solution Vnmpxq, xPRd, n“0,...,N´1, m“1,...,M is obtained by the following algorithm.

Algorithm 3.1. ForxPRdsetVNmpxq “ xpm, gpxqyforpmPNh,m“1,...,M, setVNpxq “ VN1pxq,...,VNMpxq(

and proceed for n“N´1,...,0 as follows:

(1) Forward step: for xPRd compute:

(3.4) Xsn`1x “x`σnpxqξn? τ; (2) Backward step: for xPRd and m“1,...,M set

Zsnmpxq “1 τE“

Vn`1m pXsn`1x´Tn pxqξn

?τ‰ (3.5) ,

Ysnmpxq “E“

Vn`1m pXsn`1x q‰

`τ H`

tn, x,Zsnmpxq, pm˘ (3.6) ;

(7)

(3) Convexication: forxPRdcompute the discrete lower convex envelope Vn1pxq,...,VnMpxq( of Ysn1pxq,...,YsnMpxq(

as (3.7) Vnmpxq “Vexp

Ysn1pxq,...,YsnMpxq‰

ppmq pmPNh, m“1,...,M.

We note that for pP∆pIq the lower convex envelope in (3.7) is the solution of the minimization problem, cf. [10],

Vexp

Ysn1pxq,...,YsnMpxq‰

ppq:“min

" I

ÿ

k“1

Ysnkpxqλk;

I

ÿ

k“1

λk“1, λkě0,

I

ÿ

k“1

λkpk“p

* . (3.8)

We will discuss ecient algorithms for the computation of the discrete lower convex envelope (3.8) in Section6.1.

It is well known that the piecewise linear interpolation does not preserve the convex- ity of the interpolated data. Nevertheless, cf. [10, Corollary 2.3.], there exists a data de- pendent (regular) simplicial partitionMhn,xof∆pIqwith nodesNn,xh :“ tπ1n,x,...,πMn,xn,xu Ď Nh such that the piecewise linear interpolant of the data values at the nodesNn,xh over the partition Mhn,x (for a precise denition see (3.9) below) agrees with the discrete data values `

pm,Vnmpxq˘(M

m“1,pmPNh of the discrete lower convex envelope (3.7) for xed 0ďnďN, xPRd. We note that the partition Mhn,x does not necessarily coincide with the original mesh Mh.

We consider the set of piecewise linear Lagrange basis functionstψin,x, i“1,...,Mn,xu associated with the set of nodes Nn,xh of the partition Mhn,x. We recall the following properties of the the Lagrange basis functions which will be frequently used throughout the paper: aqψin,xkn,xq “δik, where δik is the Kronecker delta and bqřMn,x

i“1 ψn,xi ppq “1 for anypP∆pIq. We note thataqimplies that at any pointpP∆pIqthere are at mostI basis functions with non-zero value at this point, hence the sum in bqreduces tořI

i“1. We dene the convex piecewise linear interpolantVnhpx,¨q,xPRdof the discrete lower convex envelope Vn1pxq,...,VnMpxq(

over the convexity preserving partitionMhn,x as

(3.9) Vnhpx, pq:“

Mn,x

ÿ

i“1

Vmpπ

in,xq

n pxqψin,xppq, where mpπn,xi q PN is the index of πin,x inNh, i.e. πin,x“pmpπi

n,xq for some pmpπi

n,xqPNh. We note that by construction Vnhpx, pmq “Vnmpxq for all pmPNh. For the analysis below we also consider the (possibly non-convex) interpolant over the xed partition Mh

(3.10) Vrnhpx, pq:“

M

ÿ

m“1

Vnmpxqψmppq,

where tψm, m“1,...,Muis the linear Lagrange basis associated with the set of nodes Nh. By a slight abuse of notation in (3.8), we observe that

(3.11) Vnhpx, pq “Vexp

Vrnhpx,¨q‰ ppq.

Furthermore, we dene the time interpolant Vτhpt, x, pq of (3.9) which is continuous on r0,Ts as

(3.12) Vτhpt, x, pq:“Vnhpx, pq

´tn`1´t τ

¯

`Vn`1h px, pq

´t´tn τ

¯

, for tP rtn, tn`1s.

(8)

4. Regularity properties of the numerical approximation

In this section we study regularity properties of the numerical approximation ob- tained by Algorithm3.1. We establish uniform boundedness, almost Hölder continuity in time and Lipschitz continuity in p, and x, respectively, of the numerical solution.

Furthermore, we show that the numerical approximation satises a monotonicity prop- erty.

We recall the following properties of the discrete lower convex envelope which are a simple consequence of its denition (3.8).

Lemma 4.1. We consider the setN:“ p1,...,pM(

Ă∆pIqwith associated scalar values Uppmq,Vppmq, such thatUppmq ďVppmq, m“1,...,M. We denoteV“ Vpp1q,...,VppMq(

P RM and U“ Upp1q,...,UppMq(

PRM. The discrete lower convex envelope Vexp satis- es the following properties for pP∆pIq:

i) Monotonicity: VexprUsppq ďVexprVsppq,

ii) Constant preservation: VexprV`θsppq “VexprVsppq `θ for any θPR.

4.1. Lipschitz continuity in p.

Lemma 4.2. There exists a constantCą0(which only depends onAssumptions (A1q (A4q) such that for n“0,...,N and all xPRd the numerical solution is Lipschitz con- tinuous in the variable p, i.e.,

(4.1) |Vnhpx, pq ´Vnhpx, qq|ďC|p´q| @p, qP∆pIq.

Proof. Forn“N, by linearity the functionVNhpx, pq:“ xp, gpxqy is Lipschitz continuous inp for any xPRd with a Lipschitz constantLN that only depends on g.

We proceed by induction and assume that Vn`1h px, pq is Lipschitz continuous in p with a Lipschitz constant Ln`1 for some nďN´2. We consider p, q P∆pIq and as- sume without loss of generality thatVnhpx, qq ´Vnhpx, pq ě0, otherwise p and q can be commuted.

LetpPKp, whereKp“ rπ1n,xppq,...,πn,xI ppqs ĂMhn,x is the simplex given by the nodes π1n,xppq,...,πn,xI ppq PNn,xh . Hence, p“řI

i“1πin,xppqψin,xppq, where ψn,xi ppq, i“1,...,I are the linear Lagrange basis functions onrπn,x1 ppq,...,πn,xI ppqs.

We note thatřI

i“1ψn,xi ppq “1andψin,xppq ě0fori“1,...,I. Hence, by [18, Lemma 8.2.], there exist vectors ωn,x1 ,...,ωn,xI (

P∆pIq(the vectors depend on p,q,Mhn,x and are not necessarily in Nh) such that q“řI

i“1ωn,xi pqqψin,xppq and

(4.2) |p´q|“

I

ÿ

i“1

n,xi ppq ´ωin,xn,xi ppq.

By the convexity of Vnh, since q“řI

i“1ωn,xi ψin,xppq it directly follows that Vnhpx, qq ď

I

ÿ

i“1

Vnhpx, ωin,xn,xi ppq.

(9)

Using the above inequality, (4.2), the representationVnhppq|Kp“řI

i“1Vnhpx, πn,xiin,xppq and Vexprfs ďf we get on recalling (3.6) that

0ďVnhpx, qq ´Vnhpx, pq ď

I

ÿ

i“1

`Vnhpx, ωin,xq ´Vnhpx, πn,xi

ψin,xppq

ď

I

ÿ

i“1

´ E“

Vn`1h pXsn`1x , ωn,xi q‰

`τ H`

tn, x, Znhpx, ωin,xq, ωn,xi ˘

´E“

Vn`1h pXsn`1x , πn,xi q‰

´τ H`

tn, x, Znhpx, πn,xi q, πn,xi ˘¯

ψin,xppq, (4.3)

where we used that Vnhpx, πin,xq “VexprYnpxqspπn,xi q, i“1,...,I.

By the Lipschitz continuity ofVn`1h , it follows from (4.3) using (4.2) and [15, Lemma 3.6]

that

|Vnhpx, qq ´Vnhpx, pq| ďLn`1

I

ÿ

i“1

n,xi ´ωin,xn,xi ppq “Ln`1|p´q|, (4.4)

where Ln“Ln`1p1`Cτq `Cτ.

Recursively, we get that Ln“LN`Ctn`CτřN

i“n`1Ln, n“1,...,N´1 and by the discrete Gronwall lemma it follows LnďL0ď pLN`CTqexppCTq. Hence Vnh is uni- formly Lipschitz continuous in p with a Lispchitz constant L0 which only depends on

the Assumptions (A1q(A5q.

4.2. Lipschitz continuity in x. The next lemma can be show analogically to [15, Lemma 3.3].

Lemma 4.3. Let φ:RdÑR be a uniformly Lipschitz continuous function with a Lisp- schitz constant L. Then, there exists a constant Cą0, depending only on the data of Assumptions (A1q(A5q, such that the following inequality holds for n“0,...,N´1

E“

φpXsn`1x q‰

`τ H

´

tn, x, 1 τE“

φpXsn`1xn´Tpxqξn? τ‰

, p

¯

´E“

φpXsn`1x1 q‰

´τ H

´

tn, x1, 1 τE“

φpXsn`1x1n´Tpx1n

?τ‰ , p

¯

ďCτ,L|x´x1|, where Cτ,L:“Lp1`Cτq `Cτ.

Lemma 4.4 (Lipschitz continuity in x). For n“0,...,N the interpolant Vnh is (i) Lipschitz continuous in x:

|Vnhpx, pq ´Vnhpx1, pq|ďC|x´x1| for all x,x1PRd, pP∆pIq, (ii) uniformly bounded:

|Vnhpx, pq|ďC for all xPRd, pP∆pIq,

where Cą0 is a constant which depends only on Assumptions (A1q(A5q.

From the subsequent proof it follows that the non-convex interpolant Vrnh dened in (3.10) enjoys the same boundedness and Lipschitz continuity properties as Vnh.

(10)

Proof. We xpP∆pIqand considerx,x1PRd. For n“N we haveVrNhpx, pq “VNhpx, pq “ xp, gpxqy and piq, piiqhold since

VNhpx, pq ´VNhpx1, pq

xp, gpxq ´gpx1qy

ďLN|x´x1|, VNhpx, pq

xp, gpxqy ďCN,

whereLN and CN are positive constant which depend only on g.

We proceed by induction. We assume that Vn`1h px, pq, Vrn`1h px, pq are Lipschitz con- tinuous in x with a Lipschitz constant Ln`1 and bounded by Cn`1. We show that Vnhpx, pq, Vrnhpx, pq are Lipschitz continuous with a Lipschitz constantLn and bounded by a constantCn. On recalling (3.7), (3.11) we may write

Vnhpx,pq “Vexp“ Vrnhp¨q‰

ppq “Vexp

„ E“

Vrn`1h pXsn`1x ,¨q‰

´τ H`

tn, x,Zrnhpx,¨q,¨˘

 ppq, (4.5)

whereZrnhpx, pq:“1τE“

Vrn`1h pXsn`1x , pqσ´Tn pxqξn? τ‰

. Moreover, we recall that forpmPNh it holds by denition

Vrnhppmq “Vexp

„ E“

Vrn`1h pXsn`1x ,¨q‰

´τ H`

tn, x,Zrnhpx,¨q,¨˘

 ppmq. (4.6)

i) Lipschitz continuity. By Lemma 4.3 we have for pmPNh E“

Vrn`1h pXsn`1x ,pmq‰

`τ H`

tn, x,Zrnhpx,pmq,pm˘ ďE“

Vrn`1h pXsn`1x1 ,pmq‰

´τ H`

tn, x1,Zrnhpx1,pmq,pm˘

`Ln`1|x´x1|, (4.7)

withLn`1:“Lnp1`Cτq `Cτ. On noting (4.6) it follows from (4.7) byLemma 4.1that Vrnhpx,pmq ´Vrnhpx1,pmq ďLn`1|x´x1|

(4.8)

We recall (3.10) and obtain from (4.8) (note Vrnhpx,pmq “Vnmpxq) that for any pP∆pIq it holds

Vrnhpx,pq ´Vrnhpx1,pq ď

M

ÿ

m“1

´

Vnmpxq ´Vnmpx1q

¯

ψmppq ďLn`1|x´x1|, (4.9)

where we used thatřM

m“1ψm”1,ψmě0for any0ďnďN. Consequently by (4.5) and Lemma 4.1 it also follows that

Vnhpx,pq ´Vnhpx1,pq ďLn`1|x´x1|. (4.10)

After commuting the role of x and x1 and repeating the above steps we obtain for any pP∆pIq

|Vnhpx,pq ´Vnhpx1,pq|ďLn`1|x´x1|.

(4.11)

Hence, we get recursively that LnďLN`Ctn`CτřN

i“n`1Li. By the discrete Gron- wall lemma, it follows that Lnď pLN`CTqexppCTq. Consequently, Vnh, Vrnh, n“ 0,...,N are Lipschitz continuous inx, with a Lipschitz constantL:“ pLN`CTqexppCTq depending only on the data in Assumptions (A1q(A5q.

ii) boundedness. LetKp“ rπ1n, xppq,...,πIn, xppqsbe a simplex ofMhn, xsuch thatpPKp, i.e. p“řI

i“1πin, xppqψn, xi ppq, where tψn, xi ppq:i“1,...,Iu is the Lagrange polynomial

(11)

basis onKp. In particular,řI

i“1ψn, xi ppq “1andψin, xppq ě0fori“1,...,I. On recalling (3.9) we may write

Vnhpx, pq “

I

ÿ

i“1

´ E“

Vn`1h pXsn`1x , πn, xi ppqq‰

`τ H`

tn, x, Znhpx, πn, xi ppqq, πn, xi ppq˘¯

ψin, xppq, (4.12)

where Znhpx, pq:“1τE“

Vn`1h pXsn`1x , pqσn´Tpxqξn? τ‰

. By (2.2), since Vn`1h is bounded by Cn`1 we estimate the right hand side of (4.12)

E“

Vn`1h pXsn`1x , πin, xppqq‰

`τ H`

tn, x, Znhpx, πn, xi ppqq, πn, xi ppq˘

ďCn`1`τ C`

1`|Znhpx, πin,xppqq|˘ (4.13) .

Next, we show that Znhpx, πn,xi ppqq is bounded. SinceVn`1h is uniformly Lipschitz con- tinuous in the variable x. On recalling (3.3), by the generalized mean value theorem [11, Theorem 2.3.7 ] there exists ΘPRd with |Θ|8ďC such that

(4.14) Vn`1h pXsn`1x , πin, xppqq “Vn`1h px, πn,xi ppqq ` xΘ, σnpxqξn? τy.

We multiply (4.14) with p1{τqσn´Tpxqξn and take the expectation to get Znhpx, πn, xi ppqq “1

τE“

Vn`1h pXsn`1x , πin, xppqqσ´Tn pxqξn? τ‰

“1 τE“

Vn`1h px, πn,xi ppqqσ´Tn pxqξn?

τ` xΘ, σnpxqξn?

τyσn´Tpxqξn? τ‰

“1 τE“

xΘ, σnpxqξn?

τyσn´Tpxqξn? τ‰ (4.15) .

ByAssumption (A2n and σ´Tn are uniformly bounded. Hence, it follows from (4.15) that

|Znhpx, πn,xi ppqq|ď1

τ}σn}8´Tn }8E“

|Θ|8n? τ|2

ďC.

(4.16)

We substitute (4.16), (4.13) into (4.12) and get that

|Vnhpx, pq| ď

I

ÿ

i“1

Cnψn, xi ppq “Cn,

where Cn:“Cn`1`τ C. Consequently, Vnhpx, pq, n“0,...,N is uniformly bounded by

C:“CN`CT.

4.3. Almost Hölder continuity in t.

Lemma 4.5. For any τą0, hą0 and xPRd, pP∆pIq the interpolant Vτh dened in (3.12) satises the following inequality

|Vτhps,x, pq ´Vτhpt, x, pq|ďC|s´t|1{2`Cτ1{2 @s,tP r0, Ts, where the constant Cą0 depends only on Assumptions (A1q(A4q.

Proof. We consider the piecewise linear Lagrange basis functions associated with Πτ as

χnptq “

$

’’

&

’’

%

t´tn´1

τ , fortP rtn´1,tns, tn`1´t

τ , fortP rtn,tn`1s, 0 otherwise,

(12)

forn“0,...,N and note that řN

n“0χnptq “1 for tP r0,Ts.

For s,tP r0, Ts lettP rtn, tn`1s and sP rtn`n1, tn`n1`1s. Hence, we deduce from (3.12) that

Vτhpt, x,pq ´Vτhps, x,pq “

1

ÿ

k“0

Vn`kh px,pqχn`kptq ´

1

ÿ

k1“0

Vn`nh 1`k1px,pqχn`n1`k1psq

1

ÿ

k“0 1

ÿ

k1“0

`Vn`kh px,pq ´Vn`nh 1`k1px,pq˘

χn`kptqχn`n1`k1psq.

(4.17)

forxPRd and pP∆pIq.

Since Vexprfs ďf we get for pmPNh, recall (3.6), (3.7), that Vn`kh px,pmq ďE“

Vn`k`1h pXsn`k`1n`k, x,pmq‰

`τ H`

tn`k, x, Zn`kh px,pmq,pm˘ (4.18) ,

withZn`kh px,pmq:“1τE“

Vn`k`1h pXsn`k`1n`k, x,pm´Tn`kξn`k? τ‰

. Using (4.16),Assumption (A5q we obtain from (4.18) that

Vn`kh px,pmq ´Vn`nh 1`k1px,pmq ďE“

Vn`k`1h pXsn`k`1n`k, x,pmq ´Vn`nh 1`k1px,pmq‰

`τ H`

tn`k, x, Zn`kh px,pmq,pm˘ ďE“

Vn`k`1h pXsn`k`1n`k, x,pmq ´Vn`nh 1`k1px,pmq‰

`Cτ`

1`C|Zn`kh px,pmq|˘ ďE“

Vn`k`1h pXsn`k`1n`k, x,pmq ´Vn`nh 1`k1px,pmq‰

`Cτ.

(4.19)

Recursively, we estimate the rst term on the right-hand side above using the corre- sponding analogue of (4.18) as

Vn`kh pXsn`k`1n`k, x,pmq ´Vn`nh 1`k1px,pmq ďE

Vn`k`2h pXsn`k`1,Xs

n`k, x n`k`1

n`k`2 ,pmq ´Vn`nh 1`k1px,pmq ı

`Cτ.

(4.20)

We substitute (4.20) into (4.19) and obtain on noting Xsn`k`2n`k, x “Xsn`k`1,Xs

n`k, x n`k`1

n`k`2 (cf.

(3.2)) that

Vn`kh px,pmq ´Vn`nh 1`k1px,pmq ďE

Vn`k`2h pXsn`k`2n`k, x,pmq ´Vn`nh 1`k1px,pmq ı

`C2τ.

Consequently, we get afterpn1`k1´k´2q recursive steps that Vn`kh px,pmq ´Vn`nh 1`k1px,pmq

ď E“

Vn`nh 1`k1pXsn`nn`k, x1`k1,pmq ´Vn`nh 1`k1px,pmq‰

` pn1`k1´kqCτ.

(4.21)

ByLemma 4.4and Assumption (A2qwe estimate the rst term on the right hand side of (4.21) as

E“

Vn`nh 1`k1px,pmq ´Vn`nh 1`k1pXsn`nn`k,x1`k1,pmq‰

ďC

E“

x´Xsn`nn`k,x1`k1

ďC

n`n1`k1´1

ÿ

j“n`k

E

σjpXsjn`k, xq

2

τ

1{2

ďC|tn`k´tn`n1`k1|1{2, and get

Vn`kh px,pmq ´Vn`nh 1`k1px,pmq ďC|tn`k´tn`n1`k1|1{2`Cptn`n1`k1´tn`kq.

SincetP rtn, tn`1s and sP rtn`n1, tn`n1`1s it follows for k, k1“0,1 that (4.22) Vn`kh px,pmq ´Vn`nh 1`k1px,pmq ďCT1{2|t´s|1{2`Cτ1{2,

(13)

for pmPNh.

On recalling that Vnhpx,pmq “Vrnhpx,pmqfor pmPNh, Vnhpx,pq “Vexp

Vrnhpx,¨q‰ ppqfor pP∆pIq, we deduce analogically as in the proof of Lemma 4.4 (cf. (4.9), (4.10)), that the inequality (4.22) holds for any pP∆pIq. Hence, substituting (4.22) into (4.17) for pP∆pIq implies the inequality

(4.23) Vτhpt,x,pq ´Vτhps,x,pq ďC|t´s|1{2`Cτ1{2.

After reverting the role of s and t and repeating the above steps we get the statement

of the lemma.

4.4. Monotonicity.

Lemma 4.6. Let φ1, φ2:RdÑR be two uniformly Lipschitz continuous functions that satisfy 0ď pφ1´φ2q ďC. Then for any xPRd, pP∆pIq, τą0, n“0,...,N´1 it holds that

E“

φ1pXsn`1x q‰

`τ H

´

tn, x,1 τE“

φ1pXsn`1xn´Tpxqξn? τ‰

, p

¯

ěE“

φ2pXsn`1x q‰

`τ H

´

tn, x, 1 τE“

φ2pXsn`1xn´Tpxqξn? τ‰

, p

¯

´Cτ? τ , where Cą0 is a constant which depends only on Assumptions (A1q(A5q.

Proof. We set H:“E“

1´φ2qpXsn`1x q‰

`τ H

´

tn, x,1 τE“

φ1pXsn`1x´Tn pxqξn? τ‰

, p

¯

´τ H

´

tn, x, 1 τE“

φ2pXsn`1xn´Tpxqξn

?τ‰ , p

¯ .

By Assumption (A5q H is uniformly Lipschitz continuous in the third variable, hence using the generalized mean value theorem [11, Theorem 2.3.7 ] there exists a ΘPRd,

|Θ|8ďC such that H“E“

1´φ2qpXsn`1x q` 1`@

Θ,σn´Tpxqξn? τD˘ı

“E“

1 Ckσ´1k8n|? τě1

(pφ1´φ2qpXsn`1x q` 1`@

Θ, σn´Tpxqξn? τD˘ı

`E“

1 Ckσ´1k8n|?

τă1(pφ1´φ2qpXsn`1x q` 1`@

Θ, σn´Tpxqξn? τD˘ı

. (4.24)

Next, we show that the second term of the right hand side of (4.24) is positive. Since pφ1´φ2q ě0, it remains to examine the term p1`@

Θ, σn´Tpxqξn? τD˘

. We note that 1`@

Θ, σ´Tn pxqξn? τD

ě1´ }Θσ´1n }8n|?

τě1´Ckσ´1n k8n|? τ . For Ckσ´1k8n|?

τă1 it holds` 1`@

Θ, σ´Tn pxqξn

?τD˘

ą0 and hence (4.25) E“

1 Ckσ´1k8n|?

τă1(pφ1´φ2qpXsn`1x q` 1`@

Θ, σn´Tpxqξn? τD˘ı

ě0.

Using(4.25) we deduce from (4.24) that HěE

” 1

n

?τ|2ě1{R(

@Θ,pφ1´φ2qpXsn`1x´Tn pxqξn? τDı

, where R:“C2´1k28.

(14)

On noting that |ξn|“|ξn1|`...`|ξnd|“ p1`...`1q “d we deduce E

” 1

n|2τě1{R(|ξn|? τ

ı

“1 d2τě1{R(d?

τ“1 d2τě1{R(R1 Rd?

τď1 d2τě1{R(Rτ d3?

τďRτ d3? τ . Since ´pφ1´φ2q ě ´C we conclude

Hě ´E

” 1

n

?τ|2ě1{R(pφ1´φ2qpXsn`1x qkΘσ´1k8 ξn?

τ ı

ě ´CE

” 1

n

?τ|2ě1{R(|ξn? τ|ı

“ ´CRd3τ? τ .

5. Convergence of the numerical approximation

In this section we prove the convergence of numerical approximation, see Theo- rem 5.1below, in several steps. First, we show that, up to a subsequence, the sequence tVτhuh, τą0 admits a limit denoted byw. We then prove the viscosity super/sub-solution property of every accumulation pointw. Hence, by the uniqueness property of the vis- cosity solution, see [4], we may conclude that the whole sequence tVτhuh, τą0 converges to the viscosity solution.

Theorem 5.1. UnderAssumptions (A1q(A5q the numerical solutionVτh converges to the viscosity solution of (1.1) (uniformly on compact subsets of r0, Ts ˆRdˆ∆pIq) in the sense that for all pt1, x1, p1q Ñ pt, x, pq it holds that

τ,hÑ0lim Vτhpt1,x1,p1q “Vpt, x, pq,

where V is the unique uniformly bounded and continuous viscosity solution to (1.1) which is convex and uniformly Lipschitz continuous in p.

5.1. Existence of a limit.

Lemma 5.2. The sequencetVτhuτ, hą0 admits a subsequence which converges uniformly on every compact subset of r0, Ts ˆRdˆ∆pIq to a uniformly bounded and continuous function w which is convex and uniformly Lipschitz continuous in p.

Proof. The proof is a consequence of a slight modication of the the ArzelàAscoli theorem, [26, Section III.3]. The equi-boundedness is granted by Lemma 4.4 and the equi-continuity is granted byLemmas 4.2, 4.4 and 4.5.

5.2. Viscosity solution properties and uniqueness of the limit. Below, we show that every accumulation point w of tVτhuh, τą0 from Lemma 5.2 is a viscosity sub- and super-solution to (1.1) which by uniqueness of the viscosity solution V implies Theorem 5.1.

5.2.1. Viscosity subsolution property ofw.

Proposition 5.3. Every accumulation point wof the sequence tVτhuτ, hą0 is a viscosity subsolution of (1.1) on r0, Ts ˆRdˆ∆pIq.

Referenzen

ÄHNLICHE DOKUMENTE

Given a Hamilton-Jacobi equation, a general result due to Barles-Souganidis [3] says that any \reasonable" approximation scheme (based f.e. on nite dierences, nite elements,

Overall we implemented ten levels of information, a com- pletely uninformed trader (random trader), I0 and nine informed traders with different levels of information from I1 to

The model is reduced to a closed-loop differential game ([Krasovskii, Subbotin, 1988], [Krasovskii, 1985], [Kleimenov, 1993]) and analysed via methods of the theory of

In optimal control this corresponds t o the case when the optimal trajectory of the Bolza problem is unique for every initial state and the optimal feedback is a n

The purpose of this paper is to characterize classical and lower semicon- tinuous solutions to the Hamilton-Jacobi-Isaacs partial differential equa- tions associated with

In this paper the author makes stronger assumptions which elim- inate the discrimination of t h e control v ( t ) and make it possible to define the

The problem of minimizing nonconvex nondifferentiable func- tions poses a considerable challenge to specialists in mathe- matical programming.. Most of the difficulties arise from

disturbing phenomenon of “not just the confusion but the pixilation of war and games on the same screen.” Today we explore these arguments and their implications for