The general case - Bounds on the total variation distance between the finite time and asymptoti

6.3 Bounds on the total variation distance between the finite time and asymptotic degree distribu-

6.3.2 The general case

LetSJ∞(ω) =SJT(ω) as in the pure birth case. Furthermore, let, for the time being, the individuals numbered as in the proof of Theorem 4.3.1, and let

U(ω) =1{J_T(ω)<YT(ω)}F∗(A_J_T(T, ω)) +1{J_T(ω)=YT(ω)}F^∗(A_J_T(T, ω)),

whereF∗(t) = 1−^e^−λt_1−e^−e^{−(λ−µ)T}_{−(λ−µ)T}^e^−µt andF^∗(t) = ^λ(1−e^−µt_λ−µ^)−µ(1−e^−λt⁾1[0,T)(t)+1{T}(t). ThenAJT =A∗, whereA∗is the random variable from Corollary 4.3.3, i.e.A∗ =1{J_T<YT}F_∗⁻¹(U)+1{J_T=YT}(F^∗)⁻¹(U) (cf. the proof of Corollary 4.3.3). Moreover, let AJ∞ = Z∗, where Z∗ is the random variable from Corollary 4.3.3, i.e.A_J_∞(ω) =F^−∞(U(ω)). Note, givenY_T >0, we haveA_J_∞ ∼Exp(λ).

Recall the definition of the parameter random variable Λ_T from (6.6), and let Λ^∗_T be a random variable withL(Λ^∗_T) =L(Λ_T|Y_T >0) as before. Moreover, set

M := α

β+µ(S_J_∞+E(S))(1−e^−(β+µ)A^J∞), (6.15) and let M^∗ be a random variable withL(M^∗) =L(M|Y_T >0).

We already know that MixPo(Λ^∗_T) is the degree distribution at time T in the general case. The-orem 6.3.4 below implies that MixPo(Λ^∗_T) converges at a rate of just a bit over e⁻¹⁶^(λ−µ)T to the MixPo(M^∗) distribution, which is the asymptotic degree distribution stated in Section 3.2 of [BL10], asT → ∞. Note again that this theorem is much more powerful since it gives an exact bound for the total variation distance for finiteT.

6.3.4 Theorem

LetσS <∞ be the standard deviation of S. Then forT ≥ 2 log(4(λ(λ−µ)⁻¹))

λ−µ , we have d_{T V}(MixPo(Λ^∗_T),MixPo(M^∗))

≤α 5√

6 2

λ λ−µ+2

5 +

β+µ 2

229

5(λ−µ) + 2 λ+µ

E(S) + 27 10

√ 2σ_S

(λ−µ)T²e⁻¹⁶^(λ−µ)T +αβ

1 2 + µ

4β µ

λ+ 8

µ+ 4 β

λ³(λ+µ) (λ−µ)³ +

6T +

√6 2 + 3

β(µT + 3) +5λ β²

E(S)T²e^{−(λ−µ)T} + 3

√

2αλσST²e^{−(λ−µ)T}.

Note that the right-hand side is of order

O(T²)e⁻¹⁶^(λ−µ)T asT → ∞.

The main idea of the proof of this theorem is the same as in the pure birth case: We make use of Theorem 3.4.1 to obtainE(|Λ_T−M| |Y_T >0) as an upper bound fordT V(MixPo(Λ^∗_T),MixPo(M^∗)) and establish a further bound for this expected value. In order to do so, we use that the expected value of the first summand of the right-hand side of (6.6) converges quickly to zero since the timeT−TBT+DT

since the last event beforeT converges quickly to zero, and compare the remaining summand of the right-hand side of (6.6) with the right-hand side of (6.15). For this comparison, we make vital use of the fact that the average of the social indices of the nodes living at timeT is close toE(S) by the Law of Large Numbers again and that, given T_l for some large l ∈ N, the percentage of the nodes living at time Tl that survive up to timeT is approximately e^−µ(T^−T^l⁾ by the Law of Large Numbers (see Lemma 6.3.19).

A further important ingredient is that the reciprocal of the node process (Yt)t≥0 conditioned on survival is a supermartingale (see Corollary 6.3.9), which makes it easy to deal with the expected value of its maximum. Finally, we also use that the age A_J_T(T) of the randomly picked individual converges quickly toAJ∞ by Corollary 4.3.3.

The fact that nodes may die complicates the procedure considerably since the population size after a fixed number of events is random in this case and additional dependencies have to be treated (e.g.

the inter-event times depend on the random population size at the previous event time).

In order to cope with additional dependencies, we introduce a deterministic number κ(T) that depends on T in such a way that the probability for more than κ(T) events up to time T decreases

exponentially inT (see Definition 6.3.11 and Lemma 6.3.12(ii) below). Essentially, we substitute the number of events B_T +D_T up to time T by κ(T) in the proof of Lemma 6.3.21 below since it is difficult to treat the dependencies between the number of events up to timeT and the event times.

In order to cope with additional dependencies on the random indexJT, we note that the probability that the node that is randomly picked at timeT was born up to time T /2 decreases exponentially in T (see Lemma 6.3.12(i) below). Thus we essentially only have to consider the time interval [T /2,∞) instead of the interval [T_r(J_T₎,∞), where T_r(J_T₎ is the birth time of the randomly picked individual.

The choice ofT /2 as the left endpoint of the interval makes sure that we always have a large number of individuals in the time interval with high probability.

Some lemmas of general interest

Before we prove Theorem 6.3.4 in detail, we formulate several important results that are used in the proof and could also be useful in other situations. Together with the core results from Chapter 2 and the more specialized results in the next paragraph, the reader obtains a comprehensive body of knowledge on linear birth and death processes.

Our first result gives an expression for the extinction probability given that the process survives up to timeT. In order to simplify the notation, we defineY∞:= lim

t→∞Yt. Note that Y∞∈ {0,∞}almost surely.

6.3.5 Lemma

For the conditioned extinction probability given Y_T >0, we have P(Y∞= 0|Y_T >0) = µ

λe^{−(λ−µ)T}. P roof: By conditioning on the population size, we obtain

P(Y∞= 0|Y_T >0) =E(P(Y∞= 0|Y_T)|Y_T >0) =E since (µ/λ)^m is the extinction probability of a linear birth and death process with birth rateλ, death rateµ and initial valuem (see Remark 2.2.2).

On the one hand, we have E

On the other hand, we can make use of the known formula for the probability generating function of Y_T (see Section III.5 of [AN72]) to obtain

For the probability of YT = 1 conditioned on YT >0, we have the following lemma.

6.3.6 Lemma

We have forT ≥ _λ−µ¹ log(2)

P(YT = 1|Y_T >0)≤ 2(λ−µ)

λ e^{−(λ−µ)T}.

P roof: Using the probability mass functionspnand the function ˜pfrom Section 2.2, forT ≥ _λ−µ¹ log(2), we obtain

P(YT = 1|Y_T >0) = p1(T)

1−p0(T) = 1−λ˜p(T) = λ−µ

λe^(λ−µ)T −µ ≤ 2(λ−µ) λe^(λ−µ)T.

We next consider sub- and supermartingale properties of conditioned processes.

6.3.7 Lemma

(i) (Y_t)t≥0 conditioned on Y∞= 0 is a supermartingale.

(ii) (Y_t)t≥0 conditioned on Y∞>0 is a submartingale.

P roof:

(i) Consider a subcritical linear birth and death process ( ˜Y_t)t≥0 withbirth rate µ,death rate λand initial value one. Then ( ˜Yt)t≥0 has the same law as (Yt)t≥0 conditioned on Y∞ = 0 (see e.g.

page 78 in [Lam08]), which can be proved by using the embedded generation process (see e.g.

Section IV.3 of [AN72] or Subsection 6(a) of [Wau58]) and the corresponding result for discrete-time branching processes (see e.g. Theorem 3 in Section I.12 of [AN72]) or the general result from [Wau58] for conditional Markov processes (cf. Subsection 6(a) of [Wau58]). We know from Corollary 2.2.8 that a subcritical linear birth and death process is a supermartingale, which yields the result.

(ii) Consider a process ( ˆYt)t≥0 that has the law of (Yt)t≥0 conditioned onY∞>0. Note that ( ˆYt)t≥0

inherits the Markov property from (Y_t)t≥0. Furthermore, we know from Corollary 2.2.8 that (Yt)t≥0 is a submartingale. Thus we obtain fort > s≥0 and ys∈N

E( ˆY_t|Yˆ_s=y_s) =E(Y_t|Y_s=y_s, Y∞>0)

= E(1{Y∞>0}Yt|Y_s=ys) P(Y∞>0|Y_s=ys)

= E(Y_t|Y_s=y_s)−E(1{Y∞=0}Y_t|Y_s=y_s) P(Y∞>0|Y_s=y_s)

≥ y_s−P(Y∞= 0|Y_s=y_s)E( ˜Y_t|Y˜_s =y_s) P(Y∞>0|Y_s=y_s)

≥ ys−P(Y∞= 0|Y_s=ys)ys

P(Y∞>0|Y_s=y_s)

=y_s,

where ( ˜Yt)t≥0 is the supermartingale from (i), which also inherits the Markov property from (Y_t)t≥0. Thus (Y_t)t≥0 conditioned on ultimate survival is a submartingale.

Since a more explicit proof of the analogous results for the embedded jump chain is easier, we also state and prove these here.

6.3.8 Lemma

(i) (Y_T_k)k∈N conditioned on Y∞= 0 is a supermartingale.

(ii) (Y_T_k)k∈N conditioned on Y∞>0 is a submartingale.

P roof:

(i) Consider a linear birth and death process ( ˜Yt)t≥0 with birth rate µ, death rate λ and initial value one. Then ( ˜Y_t)t≥0 has the same law as (Y_t)t≥0 conditioned on Y∞ = 0 (see the proof of Lemma 6.3.7). Thus it remains to show that ( ˜YT˜k)k∈N is a supermartingale in order to show (i), where ( ˜T_k)k∈N are the event times of ( ˜Y_t)t≥0.

Since ( ˜Yt)t≥0 is a birth and death process, we have E( ˜Y_T_˜

k+1|Y˜_T_˜

1, . . . ,Y˜_T_˜

k) = ˜Y_T_˜

k+P( ˜Y_T_˜

k+1 = ˜Y_T_˜

k+ 1|Y˜_T_˜

1, . . . ,Y˜_T_˜

−P( ˜Y_T_˜

k+1 = ˜Y_T_˜

k−1|Y˜_T_˜

1, . . . ,Y˜_T_˜

= ˜Y_T_˜

k+ µ

λ+µ− λ λ+µ

≤Y˜T˜k.

Thus (Y_T_k)k∈N conditioned on Y∞= 0 is a supermartingale.

(ii) Consider a process ( ˆY_t)t≥0 that has the law of (Y_t)t≥0 conditioned onY∞ >0 and let the event times of ( ˆYt)t≥0 be denoted by ( ˆTk)k∈N. Then we have

E( ˆY_T_ˆ

k+1|Yˆ_T_ˆ

1, . . . ,Yˆ_T_ˆ

k) =Y_T_ˆ

k+P( ˆY_T_ˆ

k+1 = ˆY_T_ˆ

k+ 1|Yˆ_T_ˆ

1, . . . ,Yˆ_T_ˆ

−P( ˆY_T_ˆ

k+1 = ˆY_T_ˆ

k −1|Yˆ_T_ˆ

1, . . . ,Yˆ_T_ˆ

k).

Consequently, it remains to show that

P( ˆYTˆ_k+1= ˆYTˆ_k+ 1|YˆTˆ1, . . . ,YˆTˆ_k)−P( ˆYTˆ_k+1 = ˆYTˆ_k −1|YˆTˆ1, . . . ,YˆTˆ_k)≥0.

In order to do so, we compute P( ˆY_T_ˆ

k+1= ˆY_T_ˆ

k+ 1|Yˆ_T_ˆ

1, . . . ,Yˆ_T_ˆ

k) =P(Y_T_k+1 =Y_T_k + 1|Y_T₁, . . . , Y_T_k, Y∞>0)

= P(Y_T_k+1=Y_T_k+ 1, Y∞>0|Y_T₁, . . . , Y_T_k) P(Y∞>0|Y_T₁, . . . , YT_k)

= P(YTk+1=YTk+ 1|Y_T₁, . . . , YTk)−P(YTk+1 =YTk + 1, Y∞= 0|Y_T₁, . . . , YTk) P(Y∞>0|Y_T₁, . . . , Y_T_k)

=P(Y∞>0|Y_T₁, . . . , YTk)⁻¹

P(YTk+1 =YTk+ 1|Y_T₁, . . . , YTk)

−P(YT_k+1=YT_k+ 1|Y_T₁, . . . , YT_k, Y∞= 0)P(Y∞= 0|Y_T_k)

. (6.17)

Since

P(Y_T_k+1 =Y_T_k+ 1|Y_T₁, . . . , Y_T_k, Y∞= 0) =P( ˜Y_T_˜

k+1 = ˜Y_T_˜

k+ 1,|Y˜_T_˜

1, . . . ,Y˜_T_˜

k) = µ

λ+µ and

P(YT_k+1=YT_k+ 1|Y_T₁, . . . , YT_k) = λ λ+µ, the right-hand side of (6.17) is equal to

λ+µ−_λ+µ^µ P(Y∞= 0|Y_T_k) P(Y∞>0|Y_T₁, . . . , YTk) ≥

λ+µ−_λ+µ^λ P(Y∞= 0|Y_T_k)

P(Y∞>0|Y_T₁, . . . , YTk) . (6.18) Since

λ+µ =P( ˜YT˜k+1 = ˜YT˜k −1|Y˜T˜1, . . . ,Y˜T˜k) =P(YTk+1 =YTk−1|Y_T₁, . . . , YTk, Y∞= 0)

and µ

λ+µ =P(YTk+1 =YTk −1|Y_T₁, . . . , YTk), the right-hand side of (6.18) is equal to

P(Y_T_k+1 =Y_T_k −1|Y_T₁, . . . , Y_T_k)−P(Y_T_k+1 =Y_T_k −1|Y_T₁, . . . , Y_T_k, Y∞= 0)P(Y∞= 0|Y_T_k) P(Y∞>0|Y_T₁, . . . , YT_k)

= P(YTk+1 =YTk−1|Y_T₁, . . . , YTk)−P(YTk+1 =YTk−1, Y∞= 0|Y_T₁, . . . , YTk) P(Y∞>0|Y_T₁, . . . , Y_T_k)

= P(Y_T_k+1 =Y_T_k−1, Y∞>0|Y_T₁, . . . , Y_T_k) P(Y∞>0|Y_T₁, . . . , YTk)

=P(Y_T_k+1 =Y_T_k −1|Y_T₁, . . . , Y_T_k, Y∞>0)

=P( ˆY_T_ˆ

k+1 = ˆY_T_ˆ

k −1|Yˆ_T_ˆ

1, . . . ,Yˆ_T_ˆ

k).

Thus (YTk)k∈N conditioned on ultimate survival is a submartingale.

Lemma 6.3.7 yields a useful result about (Y_t⁻¹)t≥0 conditioned on ultimate survival:

6.3.9 Corollary

(Y_t⁻¹)t≥0 conditioned on Y∞>0 is a supermartingale.

P roof: In general, for a submartingale (Z_t)t≥0 with respect to a filtration (F_t)t≥0 withZ_t≥1 for all t≥0, we have fort > s≥0

E 1

Z_t − 1 Z_s

F_s

Zs−Zt

Z_sZ_t

F_s

≤E(Z_s−Z_t|F_s)≤0.

Thus (Z_t⁻¹)t≥0 is a supermartingale. Consequently, Lemma 6.3.7 implies that (Y_t⁻¹)t≥0 conditioned

on ultimate survival is a supermartingale.

6.3.10 Remark

It can be proved analogously to Corollary 6.3.9 that (Y_T⁻¹

k )k∈N conditioned on Y∞ > 0 is a super-martingale.

Now we introduce the deterministic numberκ(T) as mentioned before, which helps us to cope with dependencies. Furthermore, we define the random numberK(T) as index of the last event time before T /2 such thatr(J_T)>K(T) is equivalent toT_r(J_T₎≥T /2 andM_T as the sum of the number of births and the number of deaths to simplify notation.

6.3.11 Definition

Let κ(T) := be³²^(λ+µ)Tc and K(T) := max{k : Tk < T /2}, such that YTK(T) = Y_{T /2} almost surely.

Moreover, letM_T :=B_T +D_T be thenumber of events up to time T.

The following lemma implies the properties of κ(T) and K(T) that were stated before and that we need to prove the convergence rate for the degree distribution in the general case.

6.3.12 Lemma

(i) For the probability that fewer than K(T) events have occurred up to the birth time of the randomly picked nodeJT was born, we have

P(r(J_T)≤ K(T)|Y_T >0) =P

(i) By Remark 4.3.9, we have

P(r(J_T)≤ K(T)|Y_T >0) =P(T_r(J_T₎≤T_K(T₎|Y_T >0) =P By Proposition 2.2.4, the right-hand side of (6.19) is smaller than or equal to

e⁻¹²^λT + λ−µ

(ii) Using Proposition 2.3.1, we compute

κ(T)−2E(BT)≥e³²^(λ+µ)T −1− 2λ

Since T ≥ 2(log(4λ)−log(λ−µ))

λ−µ , we have 1

2e^2µTe¹²^(λ+µ)T ≥ 2λ

λ−µ. (6.21)

Thus for T ≥ 2(log(4λ)−log(λ−µ))

λ−µ , the right-hand side of (6.20) is larger than or equal to e^(λ−µ)T1

2e^2µTe¹²^(λ+µ)T +3µ−λ

λ−µ . (6.22)

Since for T ≥ 2(log(4λ)−log(λ−µ))

λ−µ , we have

e^(λ−µ)T ≥1 + (λ−µ)T ≥1 + 2 log 4λ

λ−µ

≥1 + 2 log(4)≥2, (6.23) Inequality (6.21) implies that for T ≥ 2(log(4λ)−log(λ−µ))

λ−µ , the expression (6.22) is bounded from below by

e^(λ−µ)T3

8e^2µTe¹²^(λ+µ)T + 3µ λ−µ ≥ 3

8e³²^(λ+µ)T. Thus for T ≥ 2(log(4λ)−log(λ−µ))

λ−µ , we have

κ(T)−2E(BT)≥ 3

8e³²^(λ+µ)T >0. (6.24)

Consequently, we can apply Chebyshev’s inequality and obtain P(κ(T)≤ M_T|Y_T >0)≤ P(κ(T)≤ M_T)

P(YT >0) ≤ P(κ(T)≤2BT) P(YT >0)

= 1

P(YT >0)P(κ(T)−2E(B_T)≤2B_T −2E(B_T))

≤ 1

P(Y_T >0)

4Var(B_T)

(κ(T)−2E(B_T))², (6.25) where T ≥ 2(log(4λ)−log(λ−µ))

λ−µ . By P(Y_T > 0) ≥ P(Y∞ > 0) = ^λ−µ_λ and Inequality (6.24), for T ≥ 2(log(4λ)−log(λ−µ))

λ−µ , the right-hand side of (6.25) is smaller than or equal to 256

9 λ λ−µ

Var(B_T)

e^3(λ+µ)T ≤ 30λ λ−µ

λ²(λ+µ)

(λ−µ)³ e^−(λ+µ)T + 2λ²µ

(λ−µ)³e^−2(λ+µ)T

, (6.26) where we used Proposition 2.3.1 in order to obtain the upper bound on the right-hand side. Since e^−(λ+µ)T ≤1/2 by (6.23), the right-hand side of (6.26) is smaller than or equal to

60λ³(λ+µ)

(λ−µ)⁴ e^−(λ+µ)T.

Further lemmas

In what follows, we give some more specialized results that are used in the proof of Theorem 6.3.4.

The following lemma states that the time T−TM_T since the last event becomes small quickly.

6.3.13 Lemma

P roof: LetX have the cumulative distribution function

G(t) =1{YT>1}(1−e^−(Y^T^−1)λt) +1{YT≤1}1{t≥T}. right-hand side of (6.28) is smaller than or equal to

E The following lemma gives us an upper bound for the conditional expectation on the right-hand side of (6.27) and is proved similarly to Proposition 2.2.4.

6.3.14 Lemma

and in the proof of Proposition 2.2.4, we computed an upper bound for the series in (6.29) that yields that the right-hand side of (6.29) is smaller than or equal to

λ−µ In order to prove our next lemma, we use the supermartingale from Corollary 6.3.9.

6.3.15 Lemma

where the last line follows from Corollary 6.3.9 and the submartingale inequality (2) in Theorem 6.14 on page 99 in [Yeh95] applied to (−Y_t⁻¹)t≥0 (i.e. we use the continuous-time analogue to the super-martingale inequality (1) in Corollary 6.8 on page 94 in [Yeh95]).

We use the following lemma to bound the conditional expectation in the second summand of the right-hand side of Lemma 6.3.15 from above.

6.3.16 Lemma

where the second line follows from Proposition 2.2.4 and the last inequality holds forT ≥ _λ−µ² log(2).

Before we continue to state and prove further useful results, we introduce another random variable.

6.3.17 Definition

LetR_T_l_,T be the number of nodes that are alive at time T_l and survive up to timeT.

The following lemma gives us the first two conditional moments ofRTl,T. 6.3.18 Lemma

We have

E(RT_l,T|Y_T_l, T_l, T_l+1) = (YT_l−1)e^−µ(T^−T^l+1⁾

E(R²_T_l_,T|Y_T_l, Tl, Tl+1) =YTle^−µ(T^−T^l+1⁾−YTle^−2µ(T^−T^l+1⁾+Y_T²_le^−2µ(T^−T^l+1⁾

−P(YTl+1 =YTl−1|Y_T_l, Tl, Tl+1) 2(YTl−1)e^−2µ(T^−T^l+1⁾+e^−µ(T^−T^l+1⁾ . P roof: Firstly, we determine the conditional expectation ofR_T_l_,T:

E(RTl,T|Y_T_l, Tl, Tl+1) =p⁺E(RTl,T|Y_T_l, Tl, Tl+1, YTl+1 =YTl+ 1)

+p⁻E(R_T_l_,T|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l−1), (6.30) wherep⁺:=P(Y_T_l+1 =Y_T_l+ 1|Y_T_l, T_l, T_l+1) and p⁻:=P(Y_T_l+1 =Y_T_l−1|Y_T_l, T_l, T_l+1).⁶

With

E(R_T_l_,T|Y_T_l, T_l, T_l+1, Y_T_l+1=Y_T_l+ 1) =E(R_T_l+1_,T −1_{T⁻

r−1(l+1)>T}|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l+ 1)

= (Y_T_l+ 1)e^−µ(T^−T^l+1⁾−e^−µ(T^−T^l+1⁾=Y_T_le^−µ(T^−T^l+1⁾ (6.31) and

E(R_T_l_,T|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l−1) =E(R_T_l+1_,T|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l−1).

= (Y_T_l−1)e^−µ(T^−T^l+1⁾, Equation (6.30) implies

E(R_T_l_,T|Y_T_l, T_l, T_l+1) =Y_T_le^−µ(T^−T^l+1⁾−e^−µ(T^−T^l+1⁾p⁻. Secondly, we compute the conditional second moment ofR_T_l_,T:

E(R²_T_l_,T|Y_T_l, Tl, Tl+1) =p⁺E(R²_T_l_,T|Y_T_l, Tl, Tl+1, YTl+1 =YTl+ 1)

+p⁻E(R²_T_l_,T|Y_T_l, Tl, Tl+1, YTl+1 =YTl−1). (6.32) We treat the summands separately again. For the case where a birth occurs at timeT_l+1, we have

E(R²_T_l_,T|Y_T_l, Tl, Tl+1, YT_l+1 =YT_l+ 1)

=E((RTl+1,T −1{T⁻

r−1(l+1)>T})²|Y_T_l, Tl, Tl+1, YTl+1 =YTl+ 1)

R_T²

l+1,T −2R_T_l+1_,T1{T⁻

r−1(l+1)>T}+1{T⁻

r−1(l+1)>T}

Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l+ 1, Y_T_l>0

(6.33) and further

E(R²_T_l+1_,T|Y_T_l, Tl, Tl+1, YTl+1 =YTl+ 1) = Var(RTl+1,T|Y_T_l, Tl, Tl+1, YTl+1 =YTl+ 1)

+ (E(R_T_l+1_,T|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l+ 1))². (6.34)

6We havep⁺= _λ+µ^λ andp⁻=_λ+µ^µ . However, we do not need these explicit expressions for our purposes.

Given YTl+1, let L_l+1 be the set of the YTl+1 nodes living at time Tl+1. Then by independence of various death times, we obtain for the conditional variance

Var(R_T_l+1_,T|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l+ 1)

= Var

j∈L_l+1

1_{T⁻

j >T}

Y_T_l, T_l, T_l+1, Y_T_l+1=Y_T_l+ 1, Y_T_l >0

= (Y_T_l+ 1)(e^−µ(T^−T^l+1⁾−e^−2µ(T^−T^l+1⁾).

For the second summand of (6.34), we obtain

E(R_T_l+1_,T|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l+ 1) = (Y_T_l+ 1)e^−µ(T^−T^l+1⁾. Thus (6.34) is equal to

(YTl+ 1) e^−µ(T^−T^l+1⁾+YTle^−2µ(T^−T^l+1⁾ . For the remaining parts of (6.33), we have

E(R_T_l+1_,T1_{T⁻

r−1(l+1)>T}|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l+ 1)

=P(T_r⁻−1(l+1)> T|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l+ 1)

·E(R_T_l+1_,T|T⁻

r⁻¹(l+1) > T, Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l+ 1)

=e^−µ(T^−T^l+1⁾(1 +YTle^−µ(T^−T^l+1⁾) and

E(1_{T⁻

r−1(l+1)>T}|Y_T_l, T_l, T_l+1, YT_l+1 =YT_l+ 1) =e^−µ(T^−T^l+1⁾. Thus (6.33) implies

E(R²_T_l_,T|Y_T_l, Tl, Tl+1, YTl+1=YTl+ 1)

= (Y_T_l+ 1) e^−µ(T^−T^l+1+Y_T_le^−2µ(T^−T^l+1⁾

−2 e^−µ(T^−T^l+1+Y_T_le^−2µ(T^−T^l+1⁾

+e^−µ(T^−T^l+1⁾

=Y_T_le^−µ(T^−T^l+1⁾−Y_T_le^−2µ(T^−T^l+1⁾+Y_T²

le^−2µ(T^−T^l+1⁾. For the case where a death occurs at timeT_l+1, we have

E(R²_T_l_,T|Y_T_l, Tl, Tl+1, YTl+1 =YTl−1)

= Var(R_T_l+1_,T|Y_T_l, T_l, T_l+1, Y_T_l+1=Y_T_l−1) + (E(R_T_l+1_,T|Y_T_l, T_l, T_l+1, Y_T_l+1 =Y_T_l−1))²

= (Y_T_l−1)(e^−µ(T^−T^l+1⁾−e^−2µ(T^−T^l+1⁾) + (Y_T_l−1)²e^−2µ(T^−T^l+1⁾

= (Y_T_l−1)e^−µ(T^−T^l+1⁾+ (Y_T²

l−3Y_T_l+ 2)e^−2µ(T^−T^l+1⁾. Thus from (6.32) follows

E(R²_T_l_,T|Y_T_l, Tl, Tl+1, YTl>0) =YTle^−µ(T^−T^l+1⁾−YTle^−2µ(T^−T^l+1⁾+Y_T²_le^−2µ(T^−T^l+1⁾ +p⁻ 2(1−Y_T_l)e^−2µ(T^−T^l+1⁾−e^−µ(T^−T^l+1⁾

Knowing the conditional moments ofR_T_l_,T, we can find an upper bound for a more complex conditional expectation involvingR_T_l_,T that appears in the proof of the main theorem below:

6.3.19 Lemma P roof: By Jensen’s inequality, we obtain

By (6.36) and (6.37), we can bound the right-hand side of (6.35) from above by The following purely analytical lemma is also used in the proof of Theorem 6.3.4.

6.3.20 Lemma or equal to the derivative of the right-hand side of (6.38). Hence the statement follows.

For the conditional expectation of the sum of the squared inter-event times since the birth of the randomly picked node, we have the following lemma.

6.3.21 Lemma P roof: For the left-hand side of (6.39), we deduce

E and independent form >K(T). In order to derive an upper bound for the first conditional expectation on the right-hand side of (6.40), we introduce a sequence of random variables (U_m)m∈Nsuch that, given

(YTk)k∈N and K(T), Um ∼ Exp((λ+µ) minK(T)<kYTk) are independent and identically distributed. By Lemma 6.3.5, the second summand of the right-hand side is equal to

T² 4

λe^{−(λ−µ)T}.

The first summand of the right-hand side of (6.41) is bounded from above by T

where the last equality follows from the formula for the expectation of the maximum of independent and identically exponentially distributed random variables (see e.g. the introduction of [Eis08]).

Using the well-known upper bound for the harmonic sum yields T

For the conditional expectation in (6.42), we obtain forT ≥ ^{2 log(2)}_λ−µ E

Thus we can conclude that the first summand of the right-hand side of (6.41) is smaller than or equal to

for sufficiently largeT.

For the last line of (6.40), we can use the upper bounds from Lemma 6.3.12 and obtain that for T ≥(_λ−µ¹ log(2)∨2(log(4λ)−log(λ−µ))

Proof of the main theorem

In order to simplify notation, we introduceE^∗(

·

) =E(

·

|Y_T >0) and P^∗(

·

) =P(

·

|Y_T >0).

In the following we prove Theorem 6.3.4. As before, we use Theorem 3.4.1 to obtain d_{T V}(MixPo(Λ^∗_T),MixPo(M^∗))≤E^∗|Λ_T −M|.

In order to find an upper bound forE^∗|Λ_T −M|, we use the triangle inequality for the absolute value after plugging in the definitions of Λ_T and M given by (6.6) and (6.15), which yields

E^∗ This is bounded from above by

E^∗

+E^∗

whereM_T is the number of events up to timeT andr⁻¹(l) is the number of births that occur not later than thelth event for all l∈ {1, . . . ,M_T}, i.e.r⁻¹:{1, . . . ,M_T} → {1, . . . ,B_T}, l7→PBT

i=11{r(i)≤l}. In the following, we deduce upper bounds for (6.45)-(6.49).

Upper bound for (6.45) and (6.46)

We treat (6.45) similarly to the corresponding expression in the pure birth case, but condition onB_T, D_T,JT and the information which nodes survive up to timeT, and obtain

E^∗ Since we condition onJT, the same expression is also obtained for (6.46).

Lemma 6.3.13 reveals an upper bound for the conditional expectation on the right-hand sides of (6.50): ForT ≥ _λ−µ¹ log(2), we have

Lemma 6.3.14 gives us an upper bound for the second summand of the upper bound from Lemma 6.3.13: Plugging this expression in the right-hand side of (6.50) results in the following upper bound for the sum of (6.45) and (6.46):

Upper bound for (6.47)

Recall that R_T_l_,T is the number of nodes that are alive at time T_l and survive up to time T. We

≤E^∗ all other summands are zero, the right-hand side of (6.52) is smaller than or equal to

E^∗

By Lemma 6.3.5, we have for the second summand of the right-hand side of (6.55) P^∗(Y∞= 0) = µ

λe^{−(λ−µ)T}.

The first summand of the right-hand side of (6.55) is smaller than or equal to

By combining Lemma 6.3.15, where δ = 1/2 and γ = 1/6, and Lemma 6.3.16, we obtain for T ≥ _λ−µ² log(2) Thus for T ≥ _λ−µ² log(2), the first summand of the right-hand side of (6.54) is smaller than or equal to By Lemma 6.3.12(i), we obtain that forT ≥ _λ−µ¹ log(2), the second summand of the right-hand side of (6.54) is bounded from above by

√

Thus forT ≥ _λ−µ² log(2), the expression (6.47) is bounded from above by the sum of (6.57) and (6.58).

Upper bound for (6.48) For (6.48), we have

E^∗ since the second sum on the left-hand side of (6.59) has exactlyR_T_l_,T −1 non-zero summands. The right-hand side of (6.59) is equal to

E^∗

Now we use

in order to obtain that (6.60) is smaller than or equal to E^∗

Firstly, we consider the first summand of the right-hand side of (6.61). Since the social indexS_J_T is independent of all other random variables appearing in (6.61), this summand is equal to

2αE(S)E^∗

For _λ−µ¹ log(2), the second summand of the right-hand side of (6.62) is smaller than or equal to T e⁻¹²^λT + 2(λ−µ) by Lemma 6.3.12(i). The first summand of the right-hand side of (6.62) is equal to

E^∗

For the inner expectation, we have since a jump at this time has probability zero. Thus due to the Markov property for (Yt)t≥0 and the formula for the extinction probability of a linear birth and death process with a general initial value given in Remark 2.2.2, the conditional probability P(Y_T >0|K(T), Y_T_K(T)) is equal to p₀(T /2)^Y^T^K(T), which implies that (6.64) is equal to

P(Y_T_K(T) >0|K(T)) Thus the right-hand side of (6.63) is smaller than or equal to

E By Lemma 6.3.19, summing over l and taking the expectation yields the following upper bound for the first summand of the right-hand side of (6.62):

E^∗

Obviously, (6.67) is equal to

√

We may conclude that for forT ≥ _λ−µ¹ log(2), the first summand of (6.61) is bounded from above by

For the first outer expectation in (6.68), we have E^∗

By Inequality (6.56), we obtain that forT ≥ _λ−µ² log(2), this expression is smaller than or equal to T For the second outer expectation in (6.68), we obtain

E where the last equality follows from Lemma 6.3.5.

In conclusion, for T ≥ _λ−µ² log(2), the first summand of the right-hand side of (6.61) is bounded from above by

Since the social index S_J_T is independent of all other random variables appearing in (6.61), the second summand of the right-hand side of (6.61) is smaller than or equal to

2α

By Lemma 6.3.20, this expression is bounded from above by

λ+µ ), Lemma 6.3.21 implies that (6.69) and hence also the sec-ond summand of the right-hand side of (6.61) is smaller than or equal to

α(β+µ)E(S) µ

For the third summand of the right-hand side of (6.61), we obviously obtain the same upper bound except that the factorβ+µis replaced by β.

We may conclude that for T ≥(_λ−µ² log(2)∨ 2(log(4λ)−log(λ−µ))

λ+µ ), the expression (6.48) is bounded from above by

Upper bound for (6.49)

Recall that we arrangedS_J_T =S_J_∞. Since the sum in (6.49) telescopes, this implies E^∗

where the last inequality holds since SJ∞ is independent of all other random variables appearing in (6.70).

By combining Lemma 6.3.13 and Lemma 6.3.14 as before, we obtain for the first conditional expec-tation on the right-hand side of (6.70) the upper bound

For the second conditional expectation on the right-hand side of (6.70), Corollary 4.3.3 implies E^∗

Altogether, we obtain that the right-hand side of (6.70) is smaller than or equal to 4α

Combining the upper bounds obtained for (6.45)-(6.49), we have forT ≥(_λ−µ² log(2)∨2(log(4λ)−log(λ−µ))

λ+µ )

In the remainder of the proof, we find a simpler upper bound by elementary calculations. In order to do so, we assumeT ≥ 2 log(4(λ(λ−µ)⁻¹))

and finally 2≤(λ−µ)T. Thus the first three lines of the right-hand side of (6.71) are smaller than or equal to Note that we have the following inequalities:

log Thus the last four lines of (6.71) are smaller than or equal to

α 4(λ−µ)

+ 4(λ−µ) We first consider those terms in (6.73) that depend onE(S). Since 2≤(λ−µ)T, grouping the terms whereβ dominates in the denominator, we have

8αE(S)

grouping theβ-free terms in (6.73) yields αE(S) and considering the terms in (6.73) whereβ dominates in the numerator, we obtain

αβE(S) µ Combining (6.74)-(6.77), we obtain the following upper bound for the right-hand side of (6.73):

αβ

The total upper bound for the theorem is the sum of (6.72) and (6.78), and this sum is obviously of

the desired order.

1234

−log(dtv(MixPo(ΛT),MixPo(M)))/(λT)

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7

Figure 6.1: Estimated −log(d_{T V}(MixPo(Λ_T),MixPo(M)))/(λT) for S = α = β = λ = 1 and µ = 0 based on 10⁶ (crosses), 500,000 (circles), 5,000 (squares) and 500 (triangles) simulated realizations of a linear birth and death process, respectively

Im Dokument Convergence Rates in Dynamic Network Models (Seite 96-122)