SFB 649 Discussion Paper 2005-038
Discretisation of Stochastic Control
Problems for Continuous Time Dynamics with
Delay
Markus Fischer*
Markus Reiss*
* Weierstraß-Institut für Angewandte Analysis und Stochastik (WIAS), Germany
This research was supported by the Deutsche
Forschungsgemeinschaft through the SFB 649 "Economic Risk".
http://sfb649.wiwi.hu-berlin.de ISSN 1860-5664
SFB 649, Humboldt-Universität zu Berlin
S FB 6 4 9 E C O N O M I C R I S K B E R L I N
time dynamis with delay
∗
Markus Fisher
Weierstraÿ-Institutfür Angewandte
Analysis und Stohastik (WIAS)
Mohrenstr. 39
10117 Berlin
Germany
Markus Reiÿ
Weierstraÿ-Institutfür Angewandte
Analysisund Stohastik (WIAS)
Mohrenstr. 39
10117Berlin
Germany
August 2,2005
Abstrat
Asamain stepin thenumerialsolutionofontrol problemsin ontinuoustime,
theontrolledproessisapproximatedbysequenesofontrolledMarkovhains,thus
disretizingtimeandspae. Anewfeaturein thisontextistoallowfordelayin the
dynamis. The existene of an optimalstrategy with respet to theost funtional
an beguaranteedin thelassofrelaxedontrols. Weakonvergeneof theapproxi-
matingextended Markovhains to theoriginalproess togetherwithonvergeneof
theassoiatedoptimalstrategiesisestablished.
1 Introdution
Ageneralstrategyforrenderingontrolproblemsinontinuoustimeaessibletonumerial
omputation isthe following: Taking asa startingpoint the original dynamis, onstrut
afamilyofontrol problemsin disretetime with disretestate spaeanddisretizedost
funtional. Standard numerial shemes an be applied to ndan optimal ontrol and to
alulatetheminimalostsfor eahofthedisreteontrolproblems. Theimportantpoint
to establish is then whether the disrete optimal ontrols and minimal osts onverge to
theontinuous-time limitasthemeshsizeofthe disretisationtendstozero. Ifthatisthe
ase,thenthe disreteontrolproblemsareavalidapproximationtothe originalproblem.
Thedynamisofthe ontrolproblemweareinterestedinaredesribedbyastohasti
delaydierentialequation(SDDE).Thus,thefutureevolutionofthedynamismaydepend
not only on the present state, but also on the past evolution. For an exposition of the
general theory of SDDEs see Mohammed (1984) or Mao (1997). The development of
∗
FinanialsupportbytheDFG-Sonderforshungsbereih649EonomiRiskisgratefullyaknowledged.
Hu et al. (2004) and the referenes therein. In Calzolari et al. (2003), segmentwise Euler
shemes are used in a non-linear ltering problem for approximating the state proess,
whih isgiven byan SDDE. Numerialproedures for deterministi ontrol with delayed
dynamis have already been used in appliations, see Bouekkine et al. (2005) for the
analysis of an eonomi growth model. The algorithm proposed there is based on the
disretisationmethodstudied here, but no formalproofof onvergene isgiven.
Also the mathemati al analysis of stohasti ontrol problems with time delay in the
state equation has been the objet of reent works, see e.g. Elsanosi et al. (2000) for
ertain expliitly available solutions, Øksendal and Sulem (2001) for the derivation of a
maximum prinipleandLarssen(2002)forthedynamiprogramming approah. Although
one an invoke the dynami programmin g priniple to derive a Hamilton-Jaobi-Bellman
equationforthevaluefuntion,suhanequationwillingeneralbeanon-lineardierential
equation with innite-dim ensional state spae. A dierent approah to treat stohasti
ontrol problems with delay is based on representing the state equation as an evolution
equationin Hilbertspae, seeBensoussan et al. (1992).
The lass of ontrol problems is speied in Setion 2. In Setion 3 we prove the
existeneofoptimalstrategiesforthoseproblemsinthe lassofrelaxedontrols. Setion4
introduestheapproximatingproessesandprovidesatightnessresult. Finally,inSetion5
the disrete ontrol problems are dened and the onvergene of the minimal osts and
optimalstrategies isshown.
2 The ontrol problem
We onsider the ontrol of a dynamial system given by a one-dimensional stohasti
delay dierential equation (SDDE) driven by a Wiener proess. Both drift and diusion
oeient may depend on the solution's history a ertain amount of time into the past.
Let
r > 0
denote the delay length , i.e. the maximal length of dependene on the past.For simpliity, we restritattention to the ase, where onlythe driftterm an be diretly
ontrolled.
Typially, the solutionproessof anSDDEdoesnotenjoythe Markovproperty, while
the segment proess assoiated with thatsolution does. For a real-valuedàdlàg funtion
(i.e.,right-ontinuous funtionwith left-handlimits)
ψ
livingonthe time interval[ − r, ∞ )
the segment attime
t ∈ [0, ∞ )
isdened to be the funtionψ t : [ − r, 0] → R , ψ t (s) := ψ(t+s).
Thus,thesegmentproess
(X t ) t≥0
assoiatedwithareal-valuedàdlàgproess(X(t)) t≥−r
takes its values in
D 0 := D([ − r, 0])
, the spae of all real-valued àdlàg funtions on the interval[ − r, 0]
. There are two natural topologies onD 0
. The rst is the one indued bythe supremum norm, whih we denote by
k . k ∞
. The seond is the Skorohod topology ofand the uniform topology lies in the dierent evaluationof onvergene of funtions with
jumps,whihappearnaturallyasinitialsegmentsanddisretizedproesses. Forontinuous
funtions both topologies oinide. Similar statements hold for
D ∞ := D([ − r, ∞ ))
andD ˜ ∞ := D([0, ∞ ))
, the spaes of all real-valued àdlàg funtions on the intervals[ − r, ∞ )
and
[0, ∞ )
, respetively. The spaesD ∞
andD ˜ ∞
will always be supposed to arry theSkorohodtopology, while
D 0
will anoniallybeequipped with the uniform topology.Let
(Γ, d Γ )
be a ompat metri spae, the spae of ontrol ations. Denote byb
the drift oeient of the ontrolled dynamis, and by
σ
the diusion oeient. Let(W (t)) t≥0
be a one-dimensional standard Wiener proess on a ltered probability spae(Ω, F , ( F t ) t≥0 , P)
satisfyingtheusualonditions,andlet(u(t)) t≥0
beaontrolproess,i.e.an
( F t )
-adapted measurable proess with values inΓ
. Consider the ontrolledSDDE(1)
dX(t) = b X t , u(t)
dt + σ(X t ) dW (t), t ≥ 0.
Theontrol proess
u(.)
together with its stohastibasisinludingthe Wiener proessisalled an admissible ontrol if, for every deterministi initial ondition
ϕ ∈ D 0
, equation(1)hasa uniquesolution whih isalso weakly unique. Write
U ad
for the setof admissibleontrolsofequation(1). Thestohastibasisoming withan admissibleontrolwill often
be omittedin the notation.
Asolutioninthesenseusedhereisanadaptedàdlàgproessdenedonthe stohasti
basisoftheontrolproesssuhthattheintegral versionofequation(1)issatised. Given
a ontrol proess together with a standard Wiener proess, a solution to equation (1) is
unique ifit isindistinguishable from anyother solutionalmost surely satisfying the same
initial ondition. Asolution is weakly unique ifit hasthe same lawasanyother solution
with the same initial distribution and satisfying equation (1) for a ontrol proess on
a possibly dierent stohasti basis so that the joint distribution of ontrol and driving
Wienerproessisthe sameforbothsolutions. Letus speifyregularityassumptionsto be
imposedon the oeients
b
andσ
:(A1) Càdlàg funtionals: the mappings
(ψ, γ) 7→
t 7→ b(ψ t , γ), t ≥ 0
, ψ 7→
t 7→ σ(ψ t ), t ≥ 0
dene measurable funtionals
D ∞ × Γ → D ˜ ∞
andD ∞ → D ˜ ∞
, respetively, whereD ∞
,D ˜ ∞
areequipped with the Borelσ
-algebras.(A2) Continuityof the drift oeient: there isa ountable subset of
[ − r, 0]
, denoted byI ev
, suh thatfor everyt ≥ 0
the funtion dened byD ∞ × Γ ∋ (ψ, γ) 7→ b(ψ t , γ)
is ontinuouson
D ev (t) × Γ
uniformlyinγ ∈ Γ
, whereD ev (t) := { ψ ∈ D ∞ | ψ
isontinuous att + s
for alls ∈ I ev } .
(A3) Global boundedness:
| b |
,| σ |
are bounded byaonstantK > 0
.(A4) UniformLipshitzondition: Thereisaonstant
K L > 0
suhthatforallϕ, ψ ∈ D 0
,all
γ ∈ Γ
| b(ϕ, γ) − b(ψ, γ) | + | σ(ϕ) − σ(ψ) | ≤ K L k ϕ − ψ k ∞ .
(A5) Elliptiity of the diusion oeient:
σ(ϕ) ≥ σ 0
for allϕ ∈ D 0
, whereσ 0 > 0
is apositive onstant.
Assumptions(A1)and (A4)onthe oeientsallowus toinvokeTheorem V.7in Protter
(2003:p.253), whihguarantees the existeneofa uniquesolution tothe ontrolledSDDE
(1)for every pieewise onstant ontrol attaining only nitely manydierent values. The
boundednessAssumption(A3)posesnolimitationexeptfortheinitialonditions,beause
the state evolution will be stopped when the state proess leaves a bounded interval.
Assumption(A2) allows us to use segmentwise approximations of the solution proess,
see the proof of Proposition 1. The assumptions imposed on the drift oeient
b
aresatised,for example,by
b(ϕ, γ) := f ϕ(r 1 ), . . . , ϕ(r n ), Z 0
−r
ϕ(s)w(s) ds
· g(γ),
where
r 1 , . . . , r n ∈ [ − r, 0]
arexed,f
,g
arebounded ontinuous funtionsandf
is Lips-hitz, and the weight funtion
w
liesinL 1 ([ − r, 0])
.Weonsiderontrolproblemsin theweakformulation (f.YongandZhou,1999:p.64).
Given an admissible ontrol
u(.)
and a deterministi initial segmentϕ ∈ D 0
, denote byX ϕ,u
the unique solution to equation (1). LetI
be a ompat interval with non-emptyinterior. Denethestopping time
τ ϕ,u T ¯
ofrstexitfromtheinteriorofI
beforetimeT > ¯ 0
by
(2)
τ ϕ,u T ¯ := inf { t ≥ 0 | X ϕ,u (t) ∈ / int(I) } ∧ T . ¯
In order to dene the osts, we presribe a ost rate
k : R × Γ → [0, ∞ )
and a boundaryost
g : R → [0, ∞ )
, whih are (jointly) ontinuous bounded funtions. Letβ ≥ 0
denotethe exponential disount rate. Thendene the ost funtional on
D 0 × U ad
by(3)
J (ϕ, u) := E Z τ
0
exp( − βs) · k X ϕ,u (s), u(s)
ds + g X ϕ,u (τ )
,
where
τ = τ ϕ,u T ¯
. Our aimis to minimizeJ (ϕ, .)
. We introdue the value funtion(4)
V (ϕ) := inf { J (ϕ, u) | u ∈ U ad } , ϕ ∈ D 0 .
The ontrol problem now onsists in alulating the funtion
V
and nding admissibleontrols that minimize
J
. Suh ontrol proesses are alled optimal ontrols or optimalstrategies.
In the lass
U ad
of admissible ontrols it may happen that there is no optimal ontrol(Kushnerand Dupuis, 2001:p.86). A way out isto enlarge the lassof ontrols, allowing
forso-alled relaxedontrols,sothatthe existeneofanoptimal(relaxed)ontrolisguar-
anteed, whiletheinmumofthe ostsoverthe newlassoinides withthe valuefuntion
V
asgiven by(4).A deterministi relaxed ontrol is a positive measure
ρ
on the Borelσ
-algebraB (Γ × [0, ∞ ))
suh that(5)
ρ(Γ × [0, t]) = t
for allt ≥ 0.
For eah
G ∈ B (Γ)
, the funtiont 7→ ρ(G × [0, t])
is absolutely ontinuous with respettoLebesgue measureon
[0, ∞ )
byvirtueof property(5). Denote byρ(., G) ˙
anyLebesguedensityof
ρ(G × [0, .])
. Thefamilyofdensitiesρ(., G) ˙
,G ∈ B (Γ)
, anbehosenina Borelmeasurable waysuhthat
ρ(t, .) ˙
isa probabilitymeasureonB (Γ)
foreaht ≥ 0
, andρ(B ) =
Z ∞
0
Z
Γ
1 {(γ,t)∈B} ρ(t, dγ) ˙ dt
for allB ∈ B (Γ × [0, ∞ )).
Denoteby
R
the spaeofdeterministi relaxedontrolswhih isequipped withthe weak- ompat topology indued by the following notion of onvergene: a sequene(ρ n ) n∈
Nof
relaxedontrolsonverges to
ρ ∈ R
ifZ
Γ×[0,∞)
g(γ, t) dρ n (γ, t) n→∞ −→
Z
Γ×[0,∞)
g(γ, t) dρ(γ, t)
for allg ∈ C c (Γ × [0, ∞ )),
where
C c (Γ × [0, ∞ ))
isthespaeofallreal-valuedontinuousfuntionsonΓ × [0, ∞ )
havingompatsupport. Underthe weak-ompattopology,
R
isa(sequentially)ompat spae.Suppose
(ρ n ) n∈
Nis a onvergent sequene in
R
with limitρ
. GivenT > 0
, letρ n|T
denote the restrition of
ρ n
to the Borelσ
-algebra onΓ × [0, T ]
, and denote byρ |T
therestrition of
ρ
toB (Γ × [0, T ])
. Thenρ n|T
,n ∈ N
,ρ |T
areall nite measures,and(ρ n|T )
onvergesweaklyto
ρ |T
.A relaxed ontrol proess is an
R
-valued random variableR
suh that the mappingω 7→ R(G × [0, t])(ω)
isF t
-measurable for allt ≥ 0
,G ∈ B (Γ)
. For a relaxed ontrolproess
R
equation (1)takes onthe form(6)
dX (t) = Z
Γ
b(X t , γ) ˙ R(t, dγ )
dt + σ(X t ) dW (t), t ≥ 0,
where
( ˙ R(t, .)) t≥0
is the family of derivative measures assoiated withR
. The family( ˙ R(t, .))
an be onstruted in a measurable way (f. Kushner, 1990:p.52). A relaxedontrol proess together with its stohasti basis inluding the Wiener proess is alled
admissible relaxed ontrol if, for every deterministi initial ondition, equation (6) has
a unique solution whih is also weakly unique. Any ordinary ontrol proess
u
an berepresented asa relaxedontrol proessbysetting
R(B ) :=
Z ∞
0
Z
Γ
1 {(γ,t)∈B} δ u(t) (dγ) dt, B ∈ B (Γ × [0, ∞ )),
where
δ γ
is the Dirameasureatγ ∈ Γ
.Denoteby
U ˆ ad
the setofalladmissible relaxedontrols. Insteadof (3)wedeneaostfuntionalon
D 0 × U ˆ ad
by(7)
J(ϕ, R) := ˆ E Z τ
0
Z
Γ
exp( − βs) · k X ϕ,R (s), γ R(s, dγ) ˙ ds + g X ϕ,R (τ )
,
where
X ϕ,R
is the solution to equation (6) with initial segmentϕ
andτ
is dened inanalogyto (2). Instead of (4) asvalue funtionwehave
(8)
V ˆ (ϕ) := inf { J ˆ (ϕ, R) | R ∈ U ˆ ad } , ϕ ∈ D 0 .
Theostfuntional
J ˆ
dependsonly onthe joint distributionof the solutionX ϕ,R
and theunderlying ontrol proess
R
, sineτ
, the time horizon,is a deterministi funtion of the solution. The distribution ofX ϕ,R
, in turn, is determined bythe initial onditionϕ
andthejointdistribution oftheontrolproessandits aompanying Wienerproess. Letting
thetime horizonvary,we mayregard
J ˆ
asafuntionofthe lawof(X, R, W, τ )
, thatis, tobe dened on a subset of the set of probability measures on
B (D ∞ × R × D ˜ ∞ × [0, ∞ ])
.Thedomainof denitionof
J ˆ
isdetermined bythe lassofadmissible relaxedontrolsforequation(6),thedenitionofthetimehorizonandthedistributionsoftheinitialsegments
X 0
.The idea in proving existene of an optimal strategy is to hekthat
J ˆ (ϕ, .)
is a (se-quentially)lowersemi-ontinuou sfuntiondenedona(sequentially)ompatset. Itthen
follows fromatheorem byWeierstraÿ (f. Yong andZhou, 1999:p.65) that
J ˆ (ϕ, .)
attainsits minimum at some point of its ompat domain. The following proposition gives the
analogueofTheorem10.1.1inKushnerandDupuis(2001:pp.271-275)foroursetting. We
present the proof in detail,beause the identiation ofthe limit proess isdierent from
the lassialase.
Proposition1. Assume(A1)(A4). Let
((R n , W n )) n∈
Nbeanysequeneofadmissiblere-
laxedontrolsforequation (6),dened onalteredprobability spae
(Ω n , F n , ( F t n ) t≥0 , P n )
.Let
X n
bea solutiontoequation (6)underontrol(R n , W n )
withdeterministi initialon- ditionϕ n ∈ D 0
, and assume that(ϕ n )
tends toϕ
uniformly for someϕ ∈ D 0
. For eahn ∈ N
, letτ n
be an( F t n )
-stopping time. Then((X n , R n , W n , τ n )) n∈
Nis tight.
Denote by
(X, R, W, τ )
a limit point of the sequene((X n , R n , W n , τ )) n∈
N. Dene
a ltration by
F t := σ(X(s), R(s), W (s), τ 1 {τ≤t} , s ≤ t)
,t ≥ 0
. ThenW (.)
is an( F t )
-adaptedWienerproess,
τ
isan( F t )
-stoppingtime,(R, W )
isanadmissiblerelaxedontrol,and
X
isa solution to (6) under(R, W )
withinitial onditionϕ
.Proof. Tightness of
(X n )
follows from the Aldous riterion (f. Billingsley, 1999:pp.176 - 179): givenn ∈ N
, anybounded( F t n )
-stoppingtimeν
andδ > 0
we haveE n
X n (ν + δ) − X n (ν)
2 F ν
≤ 2K 2 δ(δ + 1)
asaonsequene ofAssumption(A3)andthe Itisometry. Notiethatwehave
X n (0) → X(0)
asn → ∞
byhypothesis. The sequenes(R n )
and(τ n )
are tight,beause the valuespaes
R
and[0, ∞ ]
, respetively, are ompat. The sequene(W n )
is tight, sine allW n
indue the same measure. Finally, omponentw ise tightness implies tightness of the produt (f.Billingsley, 1999:p.65).Byabuseofnotation,wedonotdistinguishbetweentheonvergentsubsequeneandthe
originalsequeneandweassumethat
((X n , R n , W n , τ n ))
onvergesweaklyto(X, R, W, τ )
.The random time
τ
is an( F t )
-stopping time by onstrution of the ltration. Likewise,R
is( F t )
-adapted by onstrution, and it is indeed a relaxed ontrol proess, beauseR(t, Γ) = t
,t ≥ 0
,P
-almost surely by weak onvergene of the relaxed ontrol proesses(R n )
toR
. TheproessW
hasWiener distributionandontinuouspathswith probability one, being the limit of standard Wiener proesses. To hek thatW
is an( F t )
-Wienerproess,weusethe martingaleproblemharaterization ofBrownian motion. Tothis end,
for
g ∈ C c (Γ × [0, ∞ ))
,ρ ∈ R
dene the pairing(g, ρ)(t) :=
Z
Γ×[0,t]
g(γ, s) dρ(γ, s), t ≥ 0.
Notie that real-valued ontinuous funtions on
R
an be approximated by funtions of the formR ∋ ρ 7→ H ˜ (g j , ρ)(t i ), (i, j) ∈ N p × N q
∈ R ,
where
p
,q
are natural numbers,{ t i | i ∈ N p } ⊂ [0, ∞ )
, andH ˜
,g j
,j ∈ N q
, are suitable ontinuous funtions with ompat support andN N := { 1, . . . , N }
for anyN ∈ N
. Lett ≥ 0
,t 1 , . . . , t p ∈ [0, t]
,h ≥ 0
,g 1 , . . . , g q
be funtions inC c (Γ × [0, ∞ ))
, andH
be aontinuous funtion of
2p + p · q + 1
arguments with ompat support. SineW n
is an( F t n )
-Wiener proess for eahn ∈ N
, we havefor allf ∈ C 2 c ( R )
E n
H X n (t i ), (g j , R n )(t i ), W n (t i ), τ n 1 {τ n
≤t} , (i, j) ∈ N p × N q
·
f W n (t + h)
− f W n (t)
− 1 2
Z t+h
t
∂ 2 f
∂x 2 W n (s) ds
= 0.
Bythe weak onvergene of
((X n , R n , W n , τ n )) n∈
Nto
(X, W, R, τ )
we seethatE
H X(t i ), (g j , R)(t i ), W (t i ), τ 1 {τ≤t} , (i, j) ∈ N p × N q
·
f W (t + h)
− f W (t)
− 1 2
Z t+h
t
∂ 2 f
∂x 2 W (s) ds
= 0
for all
f ∈ C 2 c ( R )
. AsH
,p
,q
,t i
,g j
vary over all possibilities, the orresponding random variablesH(X(t i ), (g j , R)(t i ), W (t i ), τ 1 {τ≤t} , (i, j) ∈ N p × N q )
indue theσ
-algebraF t
.Sine
t ≥ 0
,h ≥ 0
werearbitrary, itfollowsthatf W (t)
− f W (0)
− 1 2
Z t
0
∂ 2 f
∂x 2 W (s)
ds, t ≥ 0,
isan
( F t )
-martingale for everyf ∈ C 2
c ( R )
. Consequently,W
isan( F t )
-Wiener proess.Itremains toshowthat
X
solvesequation(6)under ontrol(R, W )
withinitial ondi-tion
ϕ
. NotiethatX
hasontinuouspathson[0, ∞ ) P
-almostsurely,beausetheproess(X(t)) t≥0
is the weak limit inD ˜ ∞
of ontinuous proesses. FixT > 0
. We have to hekthat
P
-almost surelyX(t) = ϕ(0) +
Z t 0
Z
Γ
b(X s , γ) ˙ R(s, dγ) ds + Z t
0
σ(X s ) dW (s)
for allt ∈ [0, T ].
By virtue of the Skorohod representation theorem (f. Billingsley, 1999:p.70) we may
assume that the proesses
(X n , R n , W n )
,n ∈ N
, are all dened on the same probability spae(Ω, F , P)
as(X, R, W )
and that onvergene of((X n , R n , W n ))
to(X, R, W )
isP
-almostsure. Sine
X
,W
have ontinuous paths on[0, T ]
and(ϕ n )
onverges toϕ
in theuniform topology, onends
Ω ˜ ∈ F
withP( ˜ Ω) = 1
suh thatfor allω ∈ Ω ˜ sup
t∈[−r,T ]
X n (t)(ω) − X(t)(ω)
n→∞ −→ 0, sup
t∈[−r,T ]
W n (t)(ω) − W (t)(ω)
n→∞ −→ 0,
andalso
R n (ω) → R(ω)
inR
. Letω ∈ Ω ˜
. Werst showthatZ t
0
Z
Γ
b X s n (ω), γ R ˙ n (s, dγ)(ω) ds n→∞ → Z t
0
Z
Γ
b X s (ω), γ R(s, dγ)(ω) ˙ ds
uniformlyin
t ∈ [0, T ]
. Asaonsequene ofAssumption(A4), the uniform onvergeneofthe trajetories on
[ − r, T ]
and property(5) ofthe relaxed ontrols, we haveZ
Γ×[0,T]
b X s n (ω), γ
− b X s (ω), γ
dR n (γ, s)(ω) n→∞ → 0.
ByAssumption(A2), we nd aountable set
A ω ⊂ [0, T ]
suhthatthe mapping(γ, s) 7→
b(X s (ω), γ)
is ontinuous in all(γ, s) ∈ Γ × ([0, T ] \ A ω )
. SineA ω
is ountable we haveR(ω)(Γ × A ω ) = 0
. Hene,bythegeneralizedmappingtheorem(f.Billingsley,1999:p.21), we obtainZ
Γ×[0,t]
b X s (ω), γ
dR n (γ, s)(ω) n→∞ → Z
Γ×[0,t]
b X s (ω), γ
dR(γ, s)(ω).
Theonvergene is again uniform in
t ∈ [0, T ]
, asb
is bounded andR n
,n ∈ N
,R
areallpositive measures with mass
T
onΓ × [0, T ]
.Denote by
( ˆ X(t)) t≥−r
the unique solution to equation (6) under ontrol(R, W )
withinitial ondition
ϕ
. If we an showthat(9)
sup
t∈[0,T ]
X n (t) − X(t) ˆ
n→∞ −→ 0
in probabilityP,
then
X
willbeindistinguishablefromX ˆ
on[ − r, T ]
andwillsolve(6)aswell. Letusdeneàdlàgproesses
C n
,n ∈ N
, on[0, ∞ )
byC n (t) := ϕ n (0) +
Z
Γ×[0,t]
b(X s n , γ) dR n (γ, s), t ≥ 0,
anddene
C
in analogytoC n
. WealreadyknowthatC n (t) → C(t)
holdsuniformlyovert ∈ [0, T ]
foranyT > 0
withprobabilityone. DeneoperatorsF n
,n ∈ N
, mappingàdlàg proesses to àdlàgproesses byF n (Y )(t)(ω) := σ
[ − r, 0] ∋ s 7→
Y (t+s)(ω)
ift ≥ − s, ϕ n (t+s)
else
, t ≥ 0, ω ∈ Ω,
and dene
F
in the same way asF n
. Assumption (A4) and the uniform onvergene of(ϕ n )
toϕ
imply thatF n ( ˆ X)
onverges toF( ˆ X)
uniformly on ompats in probability (onvergene in up). Observing thatX n
solvesX n (t) = C n (t) + Z t
0
F n (X n )(s − ) dW n (s), t ≥ 0,
andanalogouslyfor
X ˆ
, TheoremV.15inProtter(2003:p.265 )assertsthat(X n )
onvergesto
X ˆ
in upand (9)follows.If the time horizon were deterministi, then the existene of optimal strategies in the
lassofrelaxedontrolswouldbelear. Givenaninitialondition
ϕ ∈ D 0
,onewouldseletasequene
((R n , W n )) n∈
Nsuhthat
(J (ϕ, R n ))
onvergestoitsinmum. ByProposition1, asuitablesubsequeneof((R n , W n ))
andtheassoiatedsolutionproesseswouldonvergeweaklyto
(R, W )
andtheassoiated solutionto equation(6). Takingintoaount (7), thedenitionof the osts,this in turn wouldimply that
J (ϕ, .)
attainsits minimum value atR
or,more preisely,(X, R, W )
.Asimilarargumentisstillvalid,ifthetimehorizon dependsontinuouslyonthepaths
with probability one under every possible solution. Thatis to say, the mapping
ˆ
τ : D ∞ → [0, ∞ ], τ ˆ (ψ) := inf { t ≥ 0 | ψ(t) ∈ / int(I) } ∧ T ¯
(10)
is Skorohod ontinuous with probability one under the measure indued by any solution
X ϕ,R
,R
anyrelaxedontrol. Thisisindeedtheaseifthediusionoeientσ
isboundedawayfrom zeroasrequired byAssumption(A5).
Byintroduingrelaxedontrols, we haveenlarged the lassofpossible strategies. The
inmumoftheosts,however,remainsthesameforthenewlass. Thisisaonsequeneof
onstant ordinarystohastiontrolswhihattainonly anitenumberofdierentontrol
values. A proof of this assertion is given in Kushner (1990:pp.59-60) in ase the time
horizon is nite, and extended to the ase of ontrol up to an exit time in Kushner and
Dupuis(2001:pp.28 2-286). Notiethatnothinghingesonthepreseneor abseneofdelay
in the ontrolled dynamis. Letus summarize our ndings.
Theorem 1. Assume (A1)(A5). Given any deterministi initial ondition
ϕ ∈ D 0
, therelaxed ontrol problem determined by (6) and (7) possesses an optimal strategy, and the
minimalosts are the same as for the original ontrol problem as dened by (1) and (3).
4 Approximating hains
In order to onstrut nite-dimensional approximations to our ontrol problem, we dis-
retize time and state spae. Denote by
h > 0
the mesh size of an equidistant time disretization starting at zero. LetS h := √
h Z
be the orresponding state spae, and setI h := I ∩ S h
. Notie thatS h
is ountable andI h
is nite. LetΛ h : R → S h
be around-o funtion. We will simplify things even further by onsidering only mesh sizes
h = M r
for someM ∈ N
, wherer
is the delay length. The numberM
will be referred toasdisretizationdegree .
Theadmissibleontrolsforthenite-dimen sio nalontrolproblemsorrespondtopiee-
wise onstant proesses in ontinuous time. A time-disrete proess
u = (u(n)) n∈
N0
on(Ω, F , P)
with values inΓ
is a disrete admissible ontrol of degreeM
ifu
takes on onlynitely manydierent values in
Γ
andu(n)
isF nh
-measurable for alln ∈ N 0
. Denote by(˜ u(t)) t≥0
the pieewise onstant àdlàginterpolatio n tou
.We allatime-disrete proess
(ξ(n)) n∈{−M,...,0}∪
N on(Ω, F , P)
adisrete hain of de-gree
M
if(ξ(n))
takesitsvaluesinS h
andξ(n)
isF nh
-measurableforalln ∈ N 0
. Inanalogy tou ˜
,write( ˜ ξ(t)) t≥−r
fortheàdlàginterpolatio ntothedisretehain(ξ(n)) n∈{−M,...,0}∪
N.We denoteby
ξ ˜ t
theD 0
-valued segment ofξ(.) ˜
at timet ≥ 0
.Let
ϕ ∈ D 0
beadeterministiinitialondition, andsupposewearegivenasequene of disrete admissibleontrols(u M ) M ∈
N,thatisu M
isadisrete admissibleontrolofdegreeM
on a stohasti basis(Ω M , F M , ( F t M ), P M )
for eahM ∈ N
. In addition, suppose thatthe sequene(˜ u M )
ofinterpolated disreteontrolsonverges weaklyto somerelaxed ontrolR
. We are then looking for a sequene approximatingthe solutionX
of equation(1)under ontrol
(R, W )
with initial onditionϕ
, where the WienerproessW
has to beonstrutedfrom the approximatingsequene.
Given
M
-step or extended Markov transition funtionsp M : S M h +1 × Γ × S h → [0, 1]
,M ∈ N
, we dene a sequene of approximating hains assoiated withϕ
and(u M )
as afamily
(ξ M ) M ∈
Nof proessessuh that
ξ M
is adisrete hainof degreeM
dened on thesamestohastibasisas
u M
,providedthefollowingonditionsarefullledforh = h M := M r
tendingto zero:
(i) Initial ondition:
ξ M (n) = Λ h (ϕ(nh))
for alln ∈ {− M, . . . , 0 }
.(ii) Extended Markovproperty: for all
n ∈ N 0
, allx ∈ S h P M ξ M (n+1) = x
F nh M
= p M ξ M (n − M ), . . . , ξ M (n), u M (n), x .
(iii) Loal onsistenywith the drift oeient:
µ ξ M (n) := E M ξ M (n+1) − ξ M (n) F nh M
= h · b ξ ˜ nh M , u M (n)
+ o(h) =: h · b h ξ ˜ nh M , u M (n) .
(iv) Loal onsistenywith the diusion oeient:
E M ξ M (n+1) − ξ M (n) − µ ξ M (n) 2 F nh M
= h · σ 2 ( ˜ ξ M nh ) + o(h) =: h · σ h 2 ( ˜ ξ nh M ).
(v) Jump heights: thereis apositive number
N ¯
, independent ofM
, suh thatsup
n | ξ M (n + 1) − ξ M (n) | ≤ N ¯ p h M .
It is straightforward, under Assumptions (A3) and (A5), to onstrut a sequene of ex-
tendedtransitionfuntionssuhthatthe jumpheight andtheloalonsistenyonditions
arefullled.
We will represent the interpolatio n
ξ ˜ M
as a solution to an equation orresponding to equation(1) with ontrol proessu ˜ M
and initial onditionϕ
. Dene the disrete proess(L M (n)) n∈
N0
byL M (0) := 0
andξ M (n) = ϕ(0) +
n−1
X
i=0
h · b h ξ ˜ ih M , u M (i)
+ L M (n), n ∈ N .
Observe that
L M
is a martingale in disrete time with respet to the ltration( F nh M )
.Setting
ε M 1 (t) :=
⌊ h t ⌋−1
X
i=0
h · b h ξ ˜ ih M , u ˜ M (ih)
− Z t
0
b ξ ˜ M s , u ˜ M (s)
ds, t ≥ 0,
the interpolated proess
ξ ˜ M
an be represented assolutiontoξ ˜ M (t) = ϕ(0) + Z t
0
b ξ ˜ s M , u ˜ M (s)
ds + L M ( ⌊ h t ⌋ ) + ε M 1 (t), t ≥ 0.
For the error termwehave
E M | ε M 1 (t) |
≤
⌊ h t ⌋−1
X
i=0
h E M
b h ξ ˜ ih M , u M (i)
− b ξ ˜ ih M , u M (i)
+ K · h
+
Z h⌊ h t ⌋
0
E M
b ξ ˜ h⌊ M s
h ⌋ , u ˜ M (s)
− b ξ ˜ s M , u ˜ M (s)
ds,
whih tends to zero as
M → ∞
uniformly int ∈ [0, T ]
by Assumptions (A2), (A3), dominatedonvergene andthe deningproperties of(ξ M )
. The disrete-timemartingaleL M
an be rewritten asdisrete stohastiintegral. Dene(W M (n)) n∈
N0
byW M (0) := 0
and
W M (n) :=
n−1
X
i=0
1
σ( ˜ ξ ih M ) L M (i+1) − L M (i)
, n ∈ N .
Usingthe pieewiseonstant interpolation
W ˜ M
ofW M
, the proessξ ˜ M
an be expressedassolutionto
(11)
ξ ˜ M (t) = ϕ(0) + Z t
0
b ξ ˜ M s , u ˜ M (s) ds +
Z t 0
σ( ˜ ξ h⌊ M s −
h ⌋ ) d W ˜ M (s) + ε M 2 (t)
for
t ≥ 0
, where the error terms(ε M 2 )
onverge to zeroas(ε M 1 )
before.Wearenowpreparedfortheonvergeneresult,whihshouldbeomparedtoTheorem
10.4.1in KushnerandDupuis(2001:p.290). Theproofissimilarto thatof Proposition 1.
We merelypoint out the main dierenes.
Proposition 2. Assume (A1)(A5). For eah
M ∈ N
, letτ M
be a stopping time withrespet to the
σ
-algebra generated by( ˜ ξ M (s), u ˜ M (s), W ˜ M (s))
,s ≤ t
. If( ˜ ξ 0 M )
onvergesto
ϕ
in the uniform topology, then(( ˜ ξ M , R M , W ˜ M , τ M )) M ∈
N istight. For any limit point(X, R, W, τ )
deneF t := σ X(s), R(s), W (s), τ 1 {τ≤t} , s ≤ t
, t ≥ 0.
Then
W
is an( F t )
-adapted Wiener proess,τ
is an( F t )
-stopping time,(R, W )
is anadmissible relaxed ontrol, and
X
is a solution to (6) under(R, W )
withinitial onditionϕ
.Proof. For the rst part, the only dierene is the proof of tightness for
( ˜ W M )
and theidentiationofthelimitpoints. Wealulatetheorderofonvergeneforthedisrete-time
previsiblequadrati variations of
(W M )
:h W M i n =
n−1
X
i=0
E (W M (i+1) − W M (i)) 2 F ih M
= nh + o(h)
n−1
X
i=0
1 σ 2 ( ˜ ξ ih M )
for all
M ∈ N
,n ∈ N 0
. Taking into aount Assumption (A5) and the denition of the time-ontinuous proessesW ˜ M
, we see thath W ˜ M i
tends toId [0,∞)
in probability forM → ∞
. By Theorem VIII.3.11 of Jaod and Shiryaev (1987:p.432) we onludethat
( ˜ W M )
onverges weakly to a standard Wiener proessW
. ThatW
has independent inrements with respet to the ltration( F t )
an be seen by onsidering the rst andseond onditional moments of the inrements of
W M
for eahM ∈ N
and applying the onditionson loalonsisteny and the jumpheightsof(ξ M )
.ByvirtueofSkorohod'stheorem,wemayagainworkunder
P
-almostsureonvergene.Theremaining slightly dierent partis the identiation of
X
assolution to equation (6)under
(R, W )
withinitialonditionϕ
. NotiethatX
isontinuouson[0, ∞ )
beauseoftheonditionon the jumps of the
ξ M
, f.Theorem 3.10.2 in Ethier and Kurtz (1986:p.148).Letus dene àdlàgproesses
C M , C
on[0, ∞ )
byC M (t) := ϕ M (0) +
Z t 0
b ξ ˜ s M , u ˜ M (s)
ds + ε M 2 (t), t ≥ 0, C(t) := ϕ(0) +
Z
Γ×[0,t]
b(X s , γ) dR(γ, s), t ≥ 0.
We then infer that
C M → C
in up as before. Dene operatorsF M
, mapping àdlàgproesses to àdlàgproesses,by
F M (Y )(t) := σ
[ − r, 0] ∋ s 7→
Y h ⌊ h t ⌋ +s
if
t ≥ − s, ξ ˜ M h ⌊ h t ⌋ +s
else
, t ≥ 0,
anddene
F
asintheproofofProposition1. Denoteby( ˆ X(t)) t≥−r
theuniquesolutiontoequation(6)under ontrol
(R, W )
with initialonditionϕ
. Assumption(A4),the uniformonvergene of
( ˜ ξ M 0 )
toϕ
and the right-ontinui ty ofϕ
imply thatF M ( ˆ X)
onverges toF( ˆ X)
in up. NotiethatX ˆ
isontinuouson[0, ∞ )
assolutionto (6).ξ ˜ M
solvesξ ˜ M (t) = C M (t) + Z t
0
F M ( ˜ ξ M )(s − ) d W ¯ M (s), t ≥ 0,
wherewehavetakenaontinuousmartingaleinterpolatio n
W ¯ M
ofW M
insteadofW ˜ M
asintegratorin the stohastiintegral, whihyields anidentialresultsine the integrandis
a pure jump proess with jump times at
kh
,k ∈ N 0
.( ¯ W M )
also onverges toW
in up,suh that by Exerise IV.4.14 in Revuz and Yor (1999) onvergene in the semimartin-
galetopology follows. We onlude again byTheorem V.15 in Protter (2003) that
(X M )
onvergesto
X ˆ
in upand thatX
andX ˆ
areindistinguishable.5 Convergene of the minimal osts
Theobjetivebehindtheintrodutionofsequenes ofapproximatinghainswastoobtain
adevieforapproximatingthevaluefuntion
V
oftheoriginalproblem. Theideanowistodene,foreahdisretizationdegree
M ∈ N
,adisreteontrolproblemwithostfuntionalJ M
sothatJ M
isanapproximationofthe ostfuntionalJ
oftheoriginalproblemin thefollowing sense: Given an initial segment
ϕ ∈ D 0
and a sequene of disrete admissibleontrols
(u M )
suh that(˜ u M )
weakly onverges tou ˜
, we haveJ M (ϕ, u M ) → J (ϕ, u) ˜
asM → ∞
. Under the assumptions introdued above, it will follow that also the value funtions assoiated with the disrete ost funtionals onverge to the value funtion ofthe originalproblem.
Fix
M ∈ N
, and leth := M r
. Denote byU ad M
the set of disrete admissible ontrols ofdegree
M
. Denethe ost funtionalof degreeM
by(12)
J M ϕ, u := E
N h −1
X
n=0
exp( − βnh) · k ξ(n), u(n)
· h + g ξ(N h )
!
,
where
ϕ ∈ D 0
,u ∈ U ad M
is dened on the stohasti basis(Ω, F , ( F t ), P)
and(ξ(n))
is adisrete hain of degree
M
dened aording top M
andu
with initial onditionϕ
. Thedisreteexit time step
N h
is given by(13)
N h := min { n ∈ N 0 | ξ(n) ∈ / I h } ∧ ⌊ T h ¯ ⌋ .
Denote by
˜ τ M := h · N h
the exit time for the orresponding interpolated proesses. The valuefuntion of degreeM
isdened as(14)
V M (ϕ) := inf
J ϕ, u u ∈ U ad M , ϕ ∈ D 0 .
We are now in a position to state the result about onvergene of the minimal osts.
Proposition 3 and Theorem 2 are omparable to Theorems 10.5.1 and 10.5.2 in Kushner
andDupuis (2001:pp.292-295).
Proposition 3. Assume (A1)(A5). If the sequene
( ˜ ξ M , u ˜ M , W ˜ M , τ ˜ M )
of interpolated proesses onverges weakly to a limit point(X, R, W, τ )
, thenX
is a solution to equation(6)under relaxed ontrol
(R, W )
withinitialonditionϕ
,τ
isthe exit timeforX
as givenby (2), andwe have
J M (ϕ, u M ) M −→ →∞ J ˆ (ϕ, R).
Proof. The onvergene assertion for the ostsisa onsequene of Proposition 2,the fat
that,byvirtueofAssumption(A5),the exittime
τ ˆ
denedin(10)isSkorohod-onti nuous, andthe denition ofJ M
andJ
(orJ ˆ
).Theorem2. Assume(A1)(A5). Thenwehave
lim M→∞ V M (ϕ) = V (ϕ)
for allϕ ∈ D 0
.Proof. First notie that
lim inf M→∞ V M (ϕ) ≥ V (ϕ)
as a onsequene of Proposition 2.In order to show
lim sup M→∞ V M (ϕ) ≤ V (ϕ)
hoose a relaxed ontrol(R, W )
so thatJ(ϕ, R) = ˆ V (ϕ)
by Proposition 1. Givenε > 0
, one an onstrut a sequene of disreteadmissibleontrols
(u M )
suhthat(( ˜ ξ M , u ˜ M , W ˜ M , τ ˜ M ))
isweaklyonvergent,where( ˜ ξ M )
,( ˜ W M )
,(˜ τ M )
areonstruted asabove,andlim sup M →∞ | J M (ϕ, u M ) − J ˆ (ϕ, R) | ≤ ε
. Theexisteneofsuhasequeneofdisreteadmissibleontrolsisguaranteed,f.the disussion
at the end of Setion 3. By denition,
V M (ϕ) ≤ J M (ϕ, u M )
for eahM ∈ N
. Using Proposition 3 we ndthatlim sup
M→∞
V M (ϕ) ≤ lim sup
M→∞
J M (ϕ, u M ) ≤ V (ϕ) + ε,
andsine
ε
wasarbitrary, the assertion follows.AllainBensoussan,GiuseppeDaPrato,MihelC.Delfour,andSanjoyK.Mitter.Represen-
tation and Control of Innite-Dimensional Systems , volume 1 of Systems& Control.
Birkhäuser, Boston,1992.
Patrik Billingsley. Convergene of probability measures. John Wiley & Sons, NewYork,
2nd edition, 1999.
RaoufBouekkine, Omar Liandro, LuisA. Puh, and Fernandodel Rio. Vintageapital
and the dynamis of the AKmodel. Journalof Eonomi Theory ,120:3972,2005.
Evelyn Bukwar. Introdution to the numerial analysis of stohasti delay dierential
equations. J. Comput.Appl. Math., 125(1-2):297307, 2000.
Antonella Calzolari, Patrik Florhinger, and Giovanna Nappo. Convergene in nonlin-
ear ltering for stohasti delay systems. Tehnial report, Università di Roma La
Sapienza, 2003.
IsmailElsanosi,BerntØksendal,andAgnèsSulem. Somesolvablestohastiontrolprob-
lems with delay. Stohastis and Stohasti Reports,71(1-2):6989, 2000.
Stewart N. Ethier and Thomas G. Kurtz. Markov Proesses. Charaterization and Con-
vergene. John Wiley&Sons, NewYork,1986.
YaozhongHu,Salah-EldinA.Mohammed,andFengYan.Disrete-timeapproximationsof
stohastidelayequations: theMilstein sheme. Ann. Probab.,32(1A):265314, 2004.
JeanJaodand AlbertN.Shiryaev. Limit Theorems for Stohasti Proesses, volume 288
of Grundlehren der mathematishen Wissenshaften. Springer, Berlin,1987.
HaroldJ.Kushner.WeakConvergeneMethodsandSingularlyPerturbedStohastiControl
and Filtering Problems ,volume 3 ofSystems& Control. Birkhäuser,Boston, 1990.
HaroldJ. Kushnerand Paul Dupuis. Numerial Methods for Stohasti Control Problems
in Continuous time . Springer, NewYork,2nd edition, 2001.
Bjørnar Larssen. Dynami programmin g in stohasti ontrol of systems with delay.
Stohastis and Stohastis Reports,74(3-4):651673, 2002.
XuerongMao. Stohasti Dierential Equations andtheir Appliations. Horwood, Chih-
ester, 1997.
Salah-Eldi n A.Mohammed. Stohasti funtional dierential equations. Pitman Publish-
ing, London, 1984.
Bernt ØksendalandAgnès Sulem. A maximum priniplefor optimal ontrol of stohasti
systems with delay, with appliations to nane. In J.L. Menaldi, E. Rofman, and
A. Sulem, editors, Optimal Control and PDE., pages 6479. IOS Press, Amsterdam,
2001.
PhilipE. Protter. Stohasti Integrationand Dierential Equations. Springer,Berlin,2nd
edition, 2003.
Grundlehrender MathematishenWissenshaften. Springer,Berlin,3rdedition,1999.
Jiongmin Yong and Xun Yu Zhou. Stohasti Controls. Hamiltonian Systems and HJB
Equations. Springer, NewYork,1999.