Finite-dimensional approximations of the filtering equation

✵ ✶ ✷ ✸ ✹ ✺ ✻ ✼ ✽ ✾ ✁

❚✂✄☎

✆

✝

✞

✟

✠

✡

❆☛☞✌✍✎✏✑✒✓

✔

✕✖ ✗

✘

❋✙✚✛✜✢✣✤✥✦

Figure 5.3: Example of a sampled trajectory of the process (X,Θ). Red curve shows the abundance of gene productX over time. Blue curve shows the state of the filtering distribution mean Θ. Note that, as can be seen from (5.22), jumps in Θ occurring when Θ = 1 always lead to the same value of Θ =c2/(c2+c3), indicated by the dotted line. Whether Θ increases or decreases after the jump depends on the value ofX. Parameters werec₁ = 1,c₂ = 0.25,c₃ = 0.5 and c4= 2. Initial state was X(0) = 0 and Θ(0) = 1.

5.4 Finite-dimensional approximations of the filtering equation

φ: ˆX→R) d

dthφi_t= X

j∈J_X

hh_j(x(t),x)φ(ˆˆ x+ ˆν_j)i_t− hh_j(x(t),x)φ(ˆ x)iˆ _t

− X

j∈J_X

hhj(x(t),x)φ(ˆ x)iˆ _t− hhj(x(t),x)iˆ _thφ(x)iˆ _t

(5.25)

in between jumps. Focusing on the marginal process Y, at a jump of Y via reaction j∈J_X we have

hφi_t+= hh_j(x(t−),x)φ(ˆˆ x+ ˆν_j)i_t−

hhj(x(t−),x)iˆ _t− . (5.26)

The moment equations are, as is generally the case, not closed: Choosing, for instance, φ(ˆx) = ˆxnfor a reaction network to obtain first-order moments, the resulting equations will depend on moments of order higher than one. Just as for the CME, we require a way to close the system of equations. Note that for the filtering equations, the moment equations will not be closed even for a fully linear system.

Here, we focus on entropic matching, as presented in 3.1.2, to close the equations. The reason for considering entropic matching specifically will become clear in the following.

5.4.2 Entropic matching

The derivation of the entropic matching equations for the filtering equations is quite similar to the derivation for the CME, so that we just summarize it briefly. Choose a parametric family of probability distributionspθ(ˆx) depending on parametersθ ranging in some open subset ofR^K. Assume that, at timet, we have an approximation p_θ(t)(ˆx) of the filtering distribution π_t(ˆx) available. As for the filtering distribution itself, we identify the approximation pθ(ˆx) on ˆXwithpθ(x,x) =ˆ δ_{x(t), x}pθ(ˆx) onX×X.ˆ

We first consider the continuous part of the filtering equation. Then a short time-step

∆tlater, p_θ(t) will have evolved to

p(x,x) =ˆ p_θ(t)(x,x) + ∆t[Pˆ _x(t)Lp_θ(t)](x,x)ˆ −∆t p_θ(t)(x,x)[SPˆ _x(t)Lp_θ(t)].

We will obtain an approximation to p(x,x) that lies in the parametric familyˆ pθ by choosing parameters θ(t+ ∆t) to minimize the relative entropy D[p k p_θ(t+∆t)]. We then take the limit ∆t → 0 to obtain an ordinary differential equation (ODE) for the parameters θ. Write, for brevity, θ =θ(t) and ˜θ =θ(t+ ∆t). Then we have, to first order in ∆t,

D[pkpθ˜] =

lnp_θ+ ∆t{P_x(t)Lp_θ−p_θ[SP_x(t)Lp_θ]}

p_θ_˜

lnpθ

pθ˜

+ ∆t

P_x(t)Lp_θ

p_θ −[SP_x(t)Lpθ]

lnpθ

pθ˜

+ const

where “const” denotes terms independent of ˜θ, and h · i_θ denotes an expectation taken with respect to the distribution pθ. The first term is simply equal to D[pθ kpθ˜], which to second order in ˜θ−θ is given by

D[pθ kpθ˜] = 1

2( ˜θ−θ)^†G(θ)( ˜θ−θ),

where G(θ) is the Fisher information matrix of the parametric family p_θ at parameter valueθ, the components of which are given by

G_kl(θ) =

∂lnp_θ

∂θ_k

∂lnp_θ

∂θ_l

, k, l= 1, . . . , K. (5.27) To minimizeD[pkp_θ_˜], we take the derivative with respect to ˜θ and obtain

0 =G(θ)( ˜θ−θ)−∆t

P_x(t)Lpθ

pθ

−[SP_x(t)Lp_θ]

∇_θ_˜lnp_θ_˜

=G(θ)( ˜θ−θ)−∆tnD

L^†P_x(t)^† ∇_θ_˜lnp_θ_˜E

θ+ [SP_x(t)Lp_θ]

∇_θ_˜lnp_θ_˜

Dividing by ∆t, taking the limit ∆t→0 and using thath∇_θlnpθi_θ= 0, we get d

dtθ=G(θ)⁻¹D

L^†P_x(t)∇_θlnp_θE

θ. (5.28)

This is a closed equation for the parameters θ. Using the resulting approximate solu-tion pθ of the filtering equation, all necessary expectations, in particular the marginal transition rates, can be computed.

When the process X jumps, the filtering distribution jumps according to (5.13), so that the approximation pθ(t−) immediately before the jump is updated to

p= P_x(t+)Lp_θ(t−) [SP_x(t+)Lpθ(t−)].

Here too we can obtain an updated approximation within the parametric family by minimizing the relative entropy, i.e. choosing θ(t+) to minimize D[p k p_θ(t+)]. In general, this will be impractical. However, usually one will choosep_θto be an exponential family

p_θ(ˆx) = 1 Z(θ)exp

( _K X

k=1

θ_kφ_k(ˆx) )

q(ˆx). (5.29)

In this case, minimizing the relative entropy amounts to matching moments, i.e. choosing θ(t+) so thathφ_ki_θ(t+) =hφ_ki_p fork= 1, . . . , K, which is often practical.

The entropic matching equations (applied in the context of filtering for stochastic differential equations) were first proposed in [51] and derived using a projection argument employing the Fisher-Rao information metric. This geometrical approach to (5.28), which we describe in Section 5.7, is completely analogous to the projection leading to the filtering equation. In this way, entropic matching is seen to be a very natural way to produce a finite-dimensional approximation to the filtering equation, in addition to the justification provided above (which, in turn, has the advantage of allowing a unified treatment of the continuous and discrete parts of the filtering equation).

5.4.3 Example: The totally asymmetric exclusion process

In this section, we will apply the marginal process framework to the TASEP on the line with open boundaries. The TASEP [85] describes particles hopping on N sites

5.4 Finite-dimensional approximations of the filtering equation

X1, . . . , XN, where each site can be occupied by at most one particle. We take Xn= 1 when siteX_n is occupied, andX_n= 0 otherwise. If the first siteX₁ is empty, a particle can enter at a rate α. If siteXn+1 is empty and site Xn occupied, a particle can move fromXn toXn+1 with ratec. Finally, a particle at the last siteXN can leave the system with rate β.

We consider the situation where only the dynamics of the last site XN is of interest to us, which might serve as a proxy for, say, the flux through the entire system. Thus, the only transitions which will be retained are the two transitions corresponding to a particle entering or leaving site XN. The filtering moment equations for the mean occupancies read

dthx₁i_t=αh1−x₁i_t−chx₁(1−x₂)i_t, d

dthxni_t=chxn−1(1−xn)i_t−chxn(1−xn+1)i_t, n= 2, . . . , N −2, d

dthx_N−1i_t=chxN−2(1−xN−1)i_t−c(1−x_N(t))hxN−1i_th1−xN−1i_t.

As expected and is well known, these contain second-order moments. Here we are interested in obtaining the simplest possible approximate marginal process. Thus, we will obtain closed equations in terms of the first-order moments hx₁i_t, . . . ,hxN−1i_tonly.

A very natural approach to obtain such a closure is to use entropic matching with a product-Bernoulli distribution ansatz:

p_θ(x) =

N−1

n=1

θ^x_nⁿ(1−θ_n)^1−xⁿ.

We refer to the resulting approximate marginal process as theBernoulli-marginalTASEP.

After application of product-Bernoulli entropic matching, the closed filtering moment equations, in between observations, are given by

dtθ1(t) =α(1−θ1(t))−cθ1(t)(1−θ2(t)), d

dtθ_n(t) =cθn−1(t)(1−θ_n(t))−cθ_n(t)(1−θ_n+1(t)), n= 2, . . . , N −2, d

dtθN−1(t) =cθN−2(t)(1−θN−1(t))−c(1−xN(t))θN−1(t)(1−θN−1(t)).

Unsurprisingly, the resulting equations are identical to a “naive” mean-field approxima-tion. Note that whenxN(t) = 1, i.e. the last site is occupied, the observation term (the last term of the last line) vanishes because a particle cannot enter the last site.

When a particle leaves site X_N, no update to the filtering distribution moments is required. When a particle enters site XN at timet, the update is simply given by

θn(t+) =θn(t−), n= 1, . . . , N −2, θN−1(t+) = 0.

This is intuitively clear: Site XN−1 is necessarily empty immediately after a particle enters site X_N. The means of the remaining sites are left unchanged because of the product-form closure employed.

0 1000 2000 Waiting time 0

1 2 3

Probability density

10^-3

0 50 100

Waiting time 0

0.02 0.04 0.06

0 20 40

Waiting time 0

0.05 0.1 0.15

0 2 4 6

Waiting time 0

0.5 1

Exact marginal Bernoulli-marginal

(a) (b) (c) (d)

Figure 5.4: Numerical evaluation of the accuracy of the Bernoulli-marginal TASEP ap-proximation on N = 10 sites at stationarity. Waiting-time distributions for a particle to enter site X_N after the previous particle left X_N. Parameters wereα =β= 1 and c= 0.01 in (a), c= 0.1 in (b), c= 1 in (c) and c= 2 in (d). Distributions from 100,000 samples.

We performed Monte Carlo simulations of both the Bernoulli-marginal TASEP and of the full TASEP to compare their behavior. In Figure 5.4, we plot the distribution of waiting times between a particle leaving site XN and the next particle entering XN

when the process is at stationarity. The waiting-time distribution varies depending on the parameters of the process. The Bernoulli-marginal process reproduces the exact results with high accuracy, despite the fact that we have used a very simple closure for the filtering equation.

Im Dokument Approximation and Model Reduction for the Stochastic Kinetics of Reaction Networks (Seite 87-91)