markov chains - Theoretical Background - Towards a Statistical Physics of Collective Mobility a

Theoretical Background

3.4 markov chains

Stochastic processes capture the probabilistic notion of a dynamic system evolving in time. Classical time-discrete dynamical systems are represented by mapsfthat evolve their next statex_n+1 =f(x_n)deterministically depend-ing solely on their respectively current statex_n(the very definition of state).

Arguably, a Markov chain is what comes closest to a map for time-discrete stochastic processes, as its state X_n+1 depends solely on the last state X_n (rather than the whole past of the process).

3.4 markov chains 25

3.4.1 Markov chains on countable spaces

Definition3.23([123,127,128]). LetSbe a countable set, and letX={X_n:n∈N} be a discrete-time stochastic process with values inS. Such a processXis aMarkov chainif its random stateX_n for eachn > 0depends on the past only through the previous stateX_n−1 (Markov property), that is

P{X_n=j|X₀ =i₀,. . .,X_n−1 =i_n−1}=P{X_n=j|X_n−1 =i_n−1} for alln > 0,j,i₀,. . .,i_n−1 ∈S. For allx,y∈S, letP(x,y) denote the transition probability from statexto statey,

P(x,y) =P{X_n=y|X_n−1=x}, such thatP(x,y) > 0 andP

z∈SP(x,z) = 1 for x,y ∈ S. Recursively define the n-step transition matrixas

Pⁿ(x,z) = X

y∈S

P(x,y)Pⁿ⁻¹(y,z)

withP⁰ defined as the identityP⁰(y,z) =δ_yz, so that for allx,y∈S,Pⁿ(x,y) = P{X_n=y|X₀=x}. The probability measure µ(x) = P{X₀ =x} is the initial distributionof the chain.

Theorem 3.24 ([128]). Let S be a countable space, and let µ : S → [0,1]be an initial probability measure onS, and further letP(x,y) be transition probabilities such thatP(x,y) >0andP

z∈SP(x,z) = 1forx,y∈S. Then there is a Markov chainX_nsuch that

P{X_n=y|X_n−1 =x,. . .,X₀ =x₀}=P(x,y) n > 0,x,y,x₀∈S. andP{X₀=x₀}=µ(x₀)forx₀∈S.

Theorem3.25([123, p.8]). LetSbe a countable space, letS⁰be a Polish space, let f:S×S⁰ → Sbe measurable and letX_nbe a time-discrete stochastic process onS such that

X_n=f(X_n−1,Y_n), n > 0

withY₁,Y₂,. . .iid random variables with values inS⁰independent ofX₀. Then,X_n is a Markov chain with transition probabilitiesP(x,y) =P{f(x,Y₁) =y}.

3.4.2 Markov chains on general spaces

Theorem 3.26 ([128]). Let (S,S) be a Polish space (in fact,S could be any space endowed with a countably generatedσ-algebraS). Letµbe a probability measure on S, and letP(x,B) be a probability kernel for allx ∈ S,B ∈ S. Then there exists a discrete-time stochastic processX_nsuch that forn > 0,B₀,. . .,B_n∈S

P{X₀ ∈B₀,. . .,X_n∈B_n}

= Z

x0∈B₀

· · · Z

xn−1∈B_n−1

µ(dx₀)P(x₀,dx₁)· · ·P(x_n−1,B_n) andP{X₀∈B₀}=µ(B₀)forB₀∈S.

26 stochastic processes

Definition 3.27 ([128]). Such a discrete-time stochastic process X_n is called a Markov chain on (S,S) with transition probability kernel P(x,B) and initial dis-tributionµ. Recursively define then-step transition probability kernelas

Pⁿ(x,B) = Z

P(x,dy)Pⁿ⁻¹(y,B) x∈S,B∈S

with P⁰(x,B) = δ_x(B), so that for all x,y ∈ S we have P{X_n=y|X₀=x} = Pⁿ(x,y).

3.4.3 First passages and returns

The following definitions are formulated for Markov chains on general Polish spaces. The definitions easily transfer to Markov chains on a count-able space by regarding single elements y instead of Borel sets B, where applicable.

Definition 3.28 ([127, 128]). Let X_n be a discrete-time stochastic process on a Polish spaceS, and letB∈S. Theoccupation numberη_Bis the random number of (possibly infinite) visits ofXtoB:

η_B= X∞ n=1

δ_X_n(B).

The event that the process visits the setB∈Sinfinitely often after starting atx∈S has the probability

Q(x,B) =P{η_B=∞|X₀ =x}.

Forn > 0define thefirst-passage-time probability (kernel)f_n :S×S→ [0,1]

from state x∈ S to set B∈ S as the probability thatnis the smallest ifor which X_i∈Bgiven thatX₀=x:

f_n(x,B) =P{X_n∈B,X_n−1 ∈/ B,. . .,X₁ ∈/B|X₀=x}

with f₁(x,B) = P(x,B). Furthermore, for n > 0, let Fn : S×S → [0,1]be the probability (kernel) that the process starting at x ∈ S visits a set B ∈ S between times1andn, inclusive:

F_n(x,B) = Xn i=1

f_i(x,B).

Thefirst return time τ_Bis the random time after0when the process first entersB (or when it first returns toB, ifX₀ ∈B):

τ_B=min{n > 0:X_n∈B}.

Given that the process starts inx, the probability distribution ofτ_Bis P{τ_B=n|X₀ =x}=f_n(x,B).

Forx∈SandB∈S, define thereturn probabilitiesas the probability to return to B(in finite time) when starting inx:

L(x,B) =P{τ_B<∞|X₀ =x}= X∞ n=1

P{τ_B=n|X₀ =x}=F_∞(x,B)

3.4 markov chains 27

Definition3.29([128]). LetX_nbe a Markov chain on a Polish spaceSwith n-step transition probability kernelPⁿ. Define the auxiliary probability kernelU:S×Sas

U(x,B) = X∞ n=1

Pⁿ(x,B) (x∈S).

We have for all x ∈ S,B ∈ S the expected number of returns to B after starting atxasE[η_B|X₀ =x] =U(x,B).

3.4.4 Irreducibility

Irreducibilityof a Markov chain guarantees that the chain eventually visits all regions of its state space:

Definition 3.30 ([128]). Let S be a Polish space. A Markov chain X_n on S is ϕ-irreducible if there is a measureϕonSsuch that for allx∈S,B∈S:

ϕ(B)> 0⇒L(x,B)> 0.

Theorem3.31([128]). LetX_nbe a Markov chain on a Polish spaceS. The following statements are equivalent: X is ϕ-irreducible. ϕ(B) > 0 ⇒ U(x,B) > 0 for all x∈S,B∈S.

Theorem3.32([128]). LetXbe aϕ-irreducible Markov chain on a Polish spaceS for some measureϕ. Then there exists an “essentially unique maximal” irreducibil-ity measureψonSsuch that

1. Xisψ-irreducible.

2. ψ(B) =0⇒ψ{x∈S:L(x,B)> 0}=0for allB∈S.

3. ψ(S\B) =0⇒B=B₀∪N:ψ(N) =0,P(x,B₀) =1for allx∈B₀ (B₀ is absorbing).

Definition 3.33 ([128]). A Markov chainX isψ-irreducible if it isϕ-irreducible for some measure ϕ and if the measureψ is a maximal measure according to the preceding theorem. Define the family of sets of positiveψmeasure as

S⁺={B∈S:ψ(B)> 0}.

The setS⁺ is the same for different maximal irreducibility measures, and hence,S⁺ is well-defined. [128] For a countable state space S, the maximal irreducibility measure is the counting measure.

3.4.5 Transience and recurrence

Recurrence is a weak notion of stability of a Markov chainX. A recurrent chainXvisits every set of positive measure infinitely often. Contrarily, a tran-sient chain visits bounded sets only a finite number of times, and eventually leaves any such set. Specifically, we consider recurrence and transience in terms of the occupation number random variableη_B.

28 stochastic processes

Definition3.34([128]). LetXbe a Markov chain on a Polish spaceS. A setB∈Sis uniformly transientif there exists an upper boundM <∞such thatU(x,B)6M for allx∈B. A setB∈SisrecurrentifU(x,B) =∞for allx∈B. A setB∈Sis transientif there is a countable cover ofBby uniformly transient sets.

Definition3.35([128]). LetXbe aψ-irreducible Markov chain on a Polish spaceS. The chainXisrecurrentif every setB∈S⁺ is recurrent. The chainXis transient ifSis transient.

Theorem3.36([128]). LetXbe aψ-irreducible Markov chain on a Polish spaceS. ThenXis either recurrent or transient.

Definition3.37([128]). LetXbe a Markov chain on a Polish spaceS. A setB∈S isHarris recurrentifQ(x,B) =1for allx∈B. The chainXisHarris recurrent if it is ψ-irreducible and every set B ∈ S⁺ is Harris recurrent (or equivalently, it holds thatL(x,B) =1for allx∈S).

Hence, Harris recurrence is stronger than recurrence: Theexpectednumber of visits to a recurrent set is infinite, while a Harris recurrent set is visited infinitely oftenalmost surely.

Theorem 3.38 ([128]). Let X be a recurrent Markov chain on a Polish space S. Then

X=H∪N

with an absorbing and nonempty setHand a transient setNwithψ(N) =0. Every subset ofHinS⁺ is Harris recurrent.

The theorem implies that the restriction of a recurrent chainXtoHdiffers to the original chain only by a ψ-null set. At the same time, the restriction to H yields stronger stability results in terms of Harris recurrence. For a countable state spaceS, the setNis empty: a recurrent chain on a countable state space is also Harris recurrent.

3.4.6 Stochastic recursive sequences

Stochastic recursive sequences generalize the notion of Markov chains to discrete-time stochastic processes. Rather than by a sequence of iid random variables, they are driven by an arbitrary random sequence:

Definition 3.39 ([129, p. 507]). Let S and S⁰ be two Polish spaces. Let ξ_n be a sequence of random elements on S⁰. Let f be a deterministic measurable function S×S⁰ → S. A time-discrete stochastic process X_non Sis a stochastic recursive sequence driven by the sequenceξ_nifX_nsatisfies the relation

X_n=f(X_n−1,ξ_n), n > 0 withX₀ independent ofξ_n.

As Borovkov [129, p. 17] points out, each Markov chain is a stochastic recursive sequence driven by iid ξ_n. Furthermore, there is a notion of reno-vatingevents of the processX_nfrom which on only the driving sequenceξ_n determines the evolution of the process rather than the states X_nbefore the event. The notion of renovating events is weaker than renewals, but never-theless allows to infer long-term behavior and ergodic properties. [129]

4 D I S C R E T E - E V E N T S Y S T E M S

Im Dokument Towards a Statistical Physics of Collective Mobility and Demand-Driven Transport (Seite 36-41)