Recursive solution of the selection-recombination equation

1 2 3 4

5 6 7 8

1 2 3 4 5 6 7 8 9 10

Figure 4.4. A nondecreasing permutation of sites. The original labels of the sites, 16i6n, are at the top;below each site with labeli, we have noted the correspondingkfor whichik=i.

individuals in the case that i_∗ 6∈A. It is indeed a common pitfall to assume that Theorem 4.3 holds for arbitrary A. This is also implicit in [BB03]; see the corresponding erratum. ♦

4.4 Recursive solution of the selection-recombination equa-tion

The first main result in this chapter will be a recursive solution of the SRE. The recursion will start ati_∗ and work along the site indices in agreement with the partial order introduced in Definition 4.1. If the original indices are used, the recursion must be formulated individually for every choice ofi_∗; in particular, it looks quite different depending on whetheri_∗is at one of the ends or in the interior of the sequence. To establish the recursion in a unified framework, we introduce a relabelling; let us fix a nondecreasing (in the sense of the partial order from Definition 4.1) permutation (ik)_06k6n−1ofS(compare Fig. 4.4) and denote the corresponding heads and tails by upper indices, that is,C^(k):=C_i_k and D^(k):=D_i_k (compare Figure 4.1).

Note that i0 = i∗, D⁽⁰⁾ =S and C⁽⁰⁾ = ∅and also that this choice of permutation implies that for all ℓ > k, one has either D^(ℓ) ⊆ D^(k) (if ℓ < k) or D^(ℓ) ⊆ C^(k) (if ℓ and k are incomparable). Furthermore, we define ̺^(k):=̺_i

k and R^(k)=R_i_k fork >0.

We now proceed as follows. First, we recapitulate the solution of the pure selection equation, that is, we solve (4.9) in the special case that all recombination rates vanish. Then, in accordance with the labelling given by (ik)_16k6n−1, we will successively add sites at which we allow recombination. This can be formalised as follows.

Definition 4.4. For ̺⁽¹⁾, . . . , ̺⁽ⁿ⁻¹⁾ as above and everyk∈[0 :n−1], we set Ψ^(k)_rec :=

Xk ℓ=1

̺^(ℓ) R^(ℓ)−id, Ψ^(k):= Ψsel+ Ψ^(k)_rec

(with the usual convention that the empty sum is 0). We then define the SRE truncated at k as the differential equation

ω_t^(k)= Ψ^(k)(ω_t^(k)).

Furthermore, we understand (ω^(k))_06k6n−1 as the family of the corresponding solutions, all with the same initial condition ω₀. In particular, ω⁽⁰⁾ is the solution of the pure selection

equation

ω_t⁽⁰⁾= Ψ_sel(ω⁽⁰⁾_t ) =sf(ω⁽⁰⁾_t ) b(ω⁽⁰⁾_t )−ω⁽⁰⁾_t . (4.21) We also define ψ^(k) = (ψ_t^(k))t>0 as the flow semigroup associated to the differential equation defined via Ψ^(k). In line with (4.9), we have ω =ω⁽ⁿ⁻¹⁾ (which is to sayω_t=ω_t⁽ⁿ⁻¹⁾ for all t>0) and Ψ = Ψ⁽ⁿ⁻¹⁾, and we likewise set ψ=ψ⁽ⁿ⁻¹⁾. We will also write ϕinstead of ψ⁽⁰⁾

for the (pure) selection semigroup. ♦

Proposition 4.5. The solution of the pure selection equation (4.21) with initial condition ω₀∈ P(X) is given by

ω_t⁽⁰⁾=ϕ_t(ω₀) = e^stF(ω₀) + (1−F)(ω₀)

e^stf(ω₀) + 1−f(ω₀) , t>0, (4.22) with f andF as given in (4.2)and (4.3). In particular,

f(ω⁽⁰⁾_t ) = e^stf(ω₀)

e^stf(ω₀) + 1−f(ω₀) (4.23) is increasing over time andω⁽⁰⁾_t =ϕ_t(ω₀) is a convex combination of the initial type distribu-tions of the fit (that is, beneficial) and unfit (that is, deleterious) subpopulations introduced in Eqs. (4.5) and (4.6), namely,

ω_t⁽⁰⁾=f(ω_t⁽⁰⁾)b(ω₀) +1−f(ω⁽⁰⁾_t )d(ω₀).

This in particular implies

bϕ_t(ω₀)=b(ω₀)and dϕ_t(ω₀)=d(ω₀). (4.24) Proof. By straightforward verification. To see Eq. (4.24), recall that the fitness operator F is a projection and b(ω) is in the image of F, while d(ω) is in the image of 1−F for any ω∈ P(X).

Remark 4.8. Eq. (4.23) generalises the well-known solution of the selection equation for a single site, which is simply a logistic equation; compare [Dur08, p. 198]. Eq. (4.24) reflects the plausible fact that, while the proportion of fit individuals increases at the cost of the unfit ones (as quantified in Eq. (4.22)), the type composition within the set of fit types remains

unchanged, and likewise for the set of unfit types. ♦

The main result in this section is the following recursion formula for the family of solutions of the (truncated) SREs.

Theorem 4.6. The family of solutions (ω^(k))_16k6n−1 of Definition 4.4 satisfies the recursion ω_t^(k) = e^−̺^(k)^tω^(k−1)_t +π_C(k).ω_t^(k−1)⊗π_D(k).

Z t

0 ̺^(k)e^−̺^(k)^τω_τ^(k−1)dτ

4.4 Recursive solution of the selection-recombination equation 45

for16k6n−1and t>0, whereω⁽⁰⁾ is the solution of the pure selection equation given in Proposition 4.5.

We will first give an analytic proof. Then, in the next section, we will give a genealogical proof of the recursion by means of the ancestral selection-recombination graph (ASRG), which will provide additional insight.

To deal with the nonlinearity of recombination and to exploit the underlyinglinear structure (see [BB16]) more efficiently, we now introduce a variant of the product of two measures that are defined on XA and XB, where A and B need not be disjoint. Namely, given a subsetU of S, sets I, J ⊆U, and signed measures ν_I, ν_J on X_I and X_J, respectively, we define

ν_I⊠ν_J := (π_I\J.ν_I)⊗ν_J,

which is a signed measure on XI∪J (recall that π∅.ν =ν(XI) for all signed measures ν on XI, I ⊆S in line with Remark 2.2). Note that we use ν_I here to mean any signed measure on X_I, whereas we abbreviate byν^I the specific signed measure onX_I that is obtained from ν on X via ν^I =π_I.ν.

Proposition 4.7. Let U ⊆ S. For I, J, K ⊆ U and signed measures ν_I, ν_J, ν_K on X_I, X_J, and X_K, respectively, the operation ^⊠ has the following properties.

(i) (ν_I^⊠ν_J)^⊠ν_K =ν_I^⊠(ν_J^⊠ν_K) (associativity).

(ii) If I ∩J = ∅, we have ν_I^⊠ν_J = ν_I ⊗ν_J = ν_J^⊠ν_I (reduction to tensor product and commutativity).

(iii) If I ⊆J, then ν_I^⊠ν_J =ν_I(X_I)ν_J (cancellation property).

Proof. For associativity, note that

(ν_I^⊠ν_J)^⊠ν_K = (π_I\J.ν_I)⊗ν_J^⊠ν_K = π_(I∪J)\K.(π_I\J.(ν_I)⊗ν_J)⊗ν_K

=π_I\(J∪K).ν_I⊗π_J\K.ν_J⊗ν_K=π_I\(J∪K)⊗(ν_J^⊠ν_K) =ν_I^⊠(ν_J^⊠ν_K), where we have used in the third step that ((I∪J)\K)∩(I\J) =I\(J∪K).

When I∩J =∅, one has

ν_I^⊠ν_J =π_I\J.ν_I⊗ν_J =π_I.ν_I⊗ν_J =ν_I⊗ν_J =ν_J ⊗ν_I,

which implies the claimed reduction to ⊗and thus commutativity. Finally, for I ⊆J, ν_I⊠ν_J = (π_I\J.ν_I)⊗ν_J = (π∅.ν_I)⊗ν_J =ν_I(X_I)ν_J

establishes the cancellation property.

Under the conditions of Proposition 4.7, we now denote by ν_J^⊞ν_K the formal sum of ν_J and ν_K (and use^⊟ for the corresponding formal difference). Note that the formal sum turns into a proper sum (and hence⊞ reduces to +) when I =J. Furthermore, we define

ν_I^⊠(ν_J^⊞ν_K) := (ν_I^⊠ν_J)^⊞(ν_I^⊠ν_K). (4.25) Clearly, the right-hand side reduces to a proper sum whenI∪J =I∪K.

Generalising the formal sum above, we define A(X_U) to be the real vector space of formal sums

ν :=λ₁ν_U₁^⊞. . .^⊞λqν_U_q,

where q ∈ N,λ1, . . . , λq ∈R, U1, . . . , Uq ⊆ U ⊆ S, and ν_U₁, . . . , ν_U_q are signed measures on X_U₁, . . . , X_U_q, respectively. We also writeν(XU) :=^P^q_i=1λiν_U_i(XUi).

Remark 4.9. If one extends the definition of ⊠ canonically to all of A(X_U) (recalling that the projections are linear), A(XU),^⊠ becomes an associative, unital algebra with neutral element 1, the measure with weight 1 on X∅. Note that, when multiplying ν ∈A(XI) and µ∈A(X_J) for disjointI and J, the multiplication introduced above agrees with the measure

product. ♦

Now, we can rewrite Ψ^(k)_rec of Definition 4.4 as

Ψ_rec ω_t^(k)=ω_t^(k)^⊠

⊞

ℓ=1

̺^(ℓ) π_D

ℓ.ω_t^(k)−1; (4.26) note that the right-hand side indeed reduces to a proper (rather than a formal) sum of measures via (4.25), becauseω_t^(k) lives onX_S and D_ℓ ⊆S for 16ℓ6k, so that each term is a measure on XS.

We shall see later that, when combined with selection, this representation has an advantage over the use of recombinators because it nicely brings out the recursive structure; this will streamline calculations and connect to the graphical construction in a natural way. The fact that the head alone determines the fitness of an individual manifests itself in the right-multiplicativity of Ψ_sel and its associated flowϕ(compare Definition 4.4) as follows.

Lemma 4.8. For all µ∈ P(X) and all ν ∈A(XS^∗), F(µ⊠ν) =F(µ)⊠ν.

If, in addition, ν(XS^∗) = 1, one has

Ψ_sel(µ^⊠ν) = Ψ_sel(µ)^⊠ν and therefore

ϕ_t(µ^⊠ν) =ϕ_t(µ)^⊠ν

4.4 Recursive solution of the selection-recombination equation 47

for everyt>0.

Proof. To keep the notation simple, we assume U₁, U₂ ⊆ S^∗ and ν = ν_U₁^⊞ν_U₂ with signed measuresν_U₁ andν_U₂ onXU1 and XU2, respectively. By the tensor product representation of F from (4.4), we have

F(µ^⊠ν_U₁ +µ^⊠ν_U₂) =F(µ^⊠ν_U₁) +F(µ^⊠ν_U₂) =F(π_U

1.µ⊗ν_U₁) +F(π_U

2.µ⊗ν_U₂)

= (Pi∗⊗id_U₁_\i_∗)(π_U

1.µ)⊗idU1(ν_U₁) + (Pi∗⊗id_U₂_\i_∗)(π_U

2.µ)⊗idU2(ν_U₂)

=π_U

1.(P_i_∗⊗id_S^∗)(µ)⊗id_U₁(ν_U₁) +π_U

2.(P_i_∗⊗id_S^∗) (µ)⊗id_U₂(ν_U₂)

=F(µ)^⊠ν_U₁ +F(µ)^⊠ν_U₂,

which gives the first claim. Taking the first claim together with the fact thatf(µ^⊠ν) =f(µ) ifν(XS^∗) = 1, we get the second and the third claim.

Now, the proof of Theorem 4.6 is straightforward.

Proof of Theorem 4.6. Let Ψ^(k) be as in Definition 4.4. With the shorthand ν_t^(k−1) :=π_D(k).

Z t 0

̺^(k)e^−̺^(k)^τω_τ^(k−1)dτ,

one has ν_t^(k−1)(X_D(k)) = 1−e^−̺^(k)^t, and the right-hand side of the recursion formula from Theorem 4.6 can be expressed as

µ^(k)_t :=ω^(k−1)_t ^⊠(e^−̺^(k)^t1_⊞ν_t^(k−1)). (4.27) First, we show that

µ^(k)_t ^⊠π_D(ℓ).µ^(k)_t = ω_t^(k−1)^⊠π_D(ℓ).ω_t^(k−1)^⊠(e^−̺^(k)^t1_⊞ν_t^(k−1)) (4.28) for all 16ℓ6k. To see this, write the left-hand side asω^(k−1)_t ^⊠A^⊠B, where

A:= e^−̺^(k)^t1_⊞ν_t^(k−1) and

B :=π_D(ℓ). ω_t^(k−1)⊠(e^−̺^(k)^t1_⊞ν_t^(k−1))=π_D(ℓ).µ^(k)_t .

Recall that, by our monotonicity assumption on the permutation of sites, we have either D^(k) ⊆ D^(ℓ) or D^(k) ∩D^(ℓ) = ∅. In the first case, (4.28) follows by cancelling A using Proposition 4.7 (note that A(X_D_(k)) = 1). In the second case,B is just π_D_(ℓ).ω_t^(k−1), and so A^⊠B = B^⊠A, again by Proposition 4.7. Now we compute, using (4.26) and (4.27) in the first step, (4.28) and Lemma 4.8 in the second, Definition 4.4 in the third, and Proposition 4.7

in the last:

Ψ^(k)(µ^(k)_t ) = Ψ_sel(ω_t^(k−1)^⊠ e^−̺^(k)^t1_⊞ν_t^(k−1))+ Xk ℓ=1

̺^(ℓ)µ^(k)_t ^⊠(π_D(ℓ).µ^(k)_t ^⊟1)

= Ψ_sel(ω_t^(k−1)) + Xk ℓ=1

̺^(ℓ)ω^(k−1)_t ^⊠(π_D(ℓ).ω_t^(k−1)^⊟1)^⊠(e^−̺^(k)^t1_⊞ν_t^(k−1))

= Ψ^(k−1)(ω^(k−1)_t ) +̺^(k)ω^(k−1)_t ^⊠(π_D(k).ω^(k−1)_t ^⊟1)^⊠(e^−̺^(k)^t1_⊞ν_t^(k−1))

= ˙ω_t^(k−1)^⊠(e^−̺^(k)^t1_⊞ν_t^(k−1)) +ω^(k−1)_t ^⊠(̺^(k)e^−̺^(k)^tπ_D(k).ω_t^(k−1)^⊟̺^(k)e^−̺^(k)^t1).

Identifying̺^(k)e^−̺^(k)^tπ_D_(k).ω_t^(k−1) with ˙ν_t^(k−1), we see that the last line is just the time deriv-ative ofµ^(k)_t of (4.27).

Remark 4.10. We could have proved Theorem 4.6 also without the help of formal sums and the new operations ⊞,⊟,⊠. However, we decided on the current presentation in order to familiarise the reader with this — admittedly somewhat abstract — formalism, as it is the key to stating the duality result in Section 4.7 in closed form. It will also allow us later to state the solution itself in closed form; see Corollary 4.26. ♦ Remark 4.11. Note that the only property of the selection operator that entered the proof of Theorem 4.6 is the second property in Lemma 4.8, namely, Ψ_sel(ω^⊠ν) = Ψ_sel(ω)^⊠ν for all ν ∈ A(XS^∗) with ν(XS^∗) = 1. Therefore, the result remains true if Ψsel is replaced by a more general operator with this property. In particular, Theorem 4.6 remains true when frequency-dependent selection and/or mutation at the selected site is included. ♦ Remark 4.12. Applying Theorem 4.3 to A = {i_∗} shows that the marginal type frequency at the selected site is unaffected by recombination. More generally, consider the set

L^(k):={i₀ =i_∗, i₁, . . . , i_k}

and note that L^(k)\i_∗ is exactly the set of recombination sites that are considered up to and including thek-th iteration. Obviously, marginalisation consistency holds forL^(k) for all 0 6k 6 n−1. Since̺^L_i^(k) =̺_i for i∈ L^(k)\i_∗, Remark 4.6 and Eq. (4.18) together with Definition 4.4 give

π_L(k).ω˙_t=π_L(k). ^X

i∈L^(k)\i∗

̺_i(R_iω_t−ω_t) =π_L(k).Ψ^(k)_rec(ω_t) =π_L(k).ω˙^(k)_t ,

and soπ_L_(k).ω_t^(k)=π_L_(k).ω_t. This implies that if one is only interested in the marginal with respect toL^(k), then one may stop the iteration after the k-th step. ♦ An important application of Theorem 4.6 is the following recursion for the first-order correla-tion funccorrela-tionsω_t^(k)−R^(k)ω^(k)_t between the type frequencies at the sites contained inC^(k) and

Im Dokument Dynamic and probabilistic aspects of recombination (Seite 51-57)