Result: The CHnMM Forward Algorithm - Conversive Hidden non-Markovian models

Now that the state change probability can be computed, it can be used along with the known probability to be in each state at time t = 0 to compute the virtual probabilities to be in every possible successor state at the time of the first symbol emission and having emitted that first symbol. This approach can then be used iteratively for each observation from a trace to inductively compute the Forward probabilities for each time of symbol emission from the Forward probabilities of the previous symbol emission. The sum of these virtual Forward probabilities for the final observation is then the virtual probability of the model to have caused the whole trace of observations, and is thus the solution to the Evaluation problem.

It should be noted that the actual Evaluation probability for any CHnMM without deterministic activity durations is always infinitesimal, since it depends on point probabilities of continuous probability distributions (as shown in the derivation ofPchange). The introduction of virtual probabilities allows us to as-sign numerical values to these otherwise intractable probabilities. With these, Evaluation can be used to select the most likely model out of a set of models to explain a given trace of observations, since the virtual Evaluation probabilities of different models are in the same ratio as the actual unknown infinitesimal probabilities (cf. Equation 4.3). However, a single virtual Evaluation probabil-ity is only a stand-in and must not be interpreted as the actual infinitesimal probability of a model to have caused a given trace of observations.

So the HMM Forward algorithm provides a general algorithm structure for an algorithm to solve the CHnMM Evaluation task, the Proxel simulation model provides data structures that help adapt the HMM algorithm to non-Markovian systems, and the formulas derived allow for exact computations of the proba-bilities needed for such an algorithm. In the next section, all these individual parts will be assembled to form the CHnMM Forward algorithm

Algorithm 1: CHnMMForward Input: A,Π, O, T R, N, T, K

R0= [

i∈{1,...N|πi6=0}

{(i, ~0, πi)};

fort = 1 toT do

v=ot.v;

∆t=ot.e−ot−1.e;

R_t=∅;

foreachρ∈R_t−1do

i =ρ.q;

P_sojourn= Y

j∈{1,...,N|a_ij6=∅}

1−cdf(a_ij.dist)(ρ.τ_a_ij_.id+ ∆t) 1−cdf(aij.dist)(ρ.τa_ij.id) ;

foreachj∈ {1, . . . , N|aij 6=∅} do

Row_i={ai1, . . . , a_iN};

Row_j={aj1, . . . , a_jN};

~τ⁰:τ_k⁰ =











τ_k+ ∆t ifT R_k∈Row_i∧T R_k 6=a_ij∧ ¬isExp(T R_k.dist)∧ (T Rk∈Rowj∨T Rk.aging)

τ_k ifT R_k∈/Row_i∧T R_k.aging∧ ¬isExp(T R_k.dist)

0 otherwise

µ=hrf(a_ij.dist);

ρ⁰ = (j, ~τ⁰, ρ.α∗P_sojourn∗µ(ρ.τ_a_ij_.id+ ∆t)∗a_ij.b(v));

if ρ⁰.α= 0 thencontinue;

if ∃ρ⁰⁰∈Rt with(ρ⁰⁰.q=ρ⁰.q∧ρ⁰⁰.~τ =ρ⁰.~τ)then

ρ⁰⁰.α+ =ρ⁰.α;

else

Rt=Rt∪ {ρ⁰};

return RT 20

For each of these state changes the adjusted age vector for the new state is computed (line 12): The age is increased by the length of the current time step (the time between the previous and the current symbol emissions) for all those activities that were active for the whole time step and may still continue. This includes all those activities that did not cause the current state change and are either still active in the new discrete state or are of type RACE AGE and thus may be interrupted to be resumed in a later discrete state. For all activities that where not active in the current state and are of type RACE AGE, their age does not change from the age they had before the time step. And in all other cases, the age of the activity is reset to zero. This includes exponentially distributed activities, which are memoryless and whose behavior is thus age-independent, the activity that was just completed and caused the state change, and those activities that where active in the previous state but are not in the new one and thus are cancelled.

With the known age vector and discrete state after the current symbol emis-sion a Proxel is created for that new state and its probability is computed according to the formulas in Section 4.3 (line 14). If the set of Proxels for the successor time step does not already contain a Proxel with the same discrete state and age vector, then the newly created Proxel is added to that set (line 19). Otherwise, the two Proxels are merged by adding the new probability to the already existing Proxel (line 17).

The algorithm terminates once the set of Proxels for the time of the final observation from the trace have been created, and returns that set. Each of these Proxels then contains the (virtual) joint probability of the model to be in the state represented by the Proxel and having emitted the whole trace.

According to the law of total probability, summing up all those probabilities yields the desire virtual Evaluation probability of the model to have created the observation trace:

P(O) = X

ρ∈R_T

ρ.α

Thus, this algorithm solves the CHnMM Evaluation task.

4.4.1 Differences to the Proxel Simulation Algorithm

The CHnMM Forward algorithm is generally similar to the Proxel simulation method: all possible single model state changes are followed for each time step, and the resulting states are stored in Proxels. However, in detail there are major differences between the two algorithms:

Virtual Probabilities In the CHnMM Forward algorithm, the Proxels store virtual probabilities which have no meaningful interpretation as an actual prob-ability. In addition the CHnMM Forward algorithm stores joint probabilities of a given state and the emission of a given trace in those Proxels, whereas the Proxel simulation algorithm stores absolute state probabilities.

Accuracy The state change probabilities for the CHnMM Forward algorithm are not approximations of the solution to an ODE, but are computed through closed-form exact formulas.

On-3 On

t

On-4 On-2 On-1

ρ

₃

Figure 4.2: Schematic representation of the conditions under which Proxels have identical age vectors and can thus be merged.

Additionally, the Proxel method made the unwarranted assumption that only a single discrete state change may occur during each time step. This assumption was made deliberately[29] in order to make the approach compu-tationally feasible, but at the same time introduced a noticable approximation error. The CHnMM Forward algorithm also makes this assumption, but for CHnMMs this assumption is valid: since all discrete state changes emit an observable symbol, there can be no discrete state change in the time interval between two symbol emissions. This and the previous point together mean that the CHnMM Forward algorithm does not contain any of the approximation errors of the Proxel method and is thus accurate.

Time Step Size In the Proxel method, the time step size has to be chosen carefully by the user to select a tradeoff between computation time and result accuracy[43]. For the CHnMM Forward algorithm the variable time step sizes are selected automatically to conform to the time intervals between symbol emissions, and this way eliminate all approximation errors. Thus, the CHnMM Forward algorithm contains no user-tunable parameters and is thus easier to use by practitioners.

Proxel Merging Proxel simulation and the CHnMM Forward algorithm both employ Proxel merging: Whenever two Proxels for the same time step exist that present the same discrete state and age vector, they are merged to form a single Proxel with the combined probability of the two. This merging prevents both algorithms from otherwise suffering an exponential increase in the number of Proxels with each new time step, since each Proxel from one time step would always have multiple successors in the next time step. Through merging, several of those successors are combined to form a single Proxel whereby the number of Proxels per time step is bounded for many models. Yet, both algorithms differ in the circumstances under which mergable Proxels exist:

In the Proxel simulation, the age vector elements are discretized according to the time step size and so Proxels that would otherwise only have similar age vectors thus have identical ones and can be merged. In CHnMMs, age values

are not discretized and can take on potentially any positive real value, virtually eliminating chance mergings. This potentially reduces the probability of Proxel mergings and thus increases the required computational effort.

Besides those virtually impossible chance mergings, Proxels can only be merged in CHnMMs when the most recent symbol emission at which each ac-tivity was completed matches between the Proxels. Figure 4.2 visualizes this behavior: In a model with three activities (the sets of three parallel horizontal lines), three proxels ρ₁, ρ₂ and ρ₃ may have arbitrary age vectors at the time of theO_n−4th symbol emission and thus cannot be merged. However, their age vector values at timeO_n.t(length of the colored line segments) depend only on the time that has passed since each activity has been completed (yellow circles) most recently. Thus, in this example, irrespective of

• the prior age vector values of ρ1, ρ2 andρ3

• which activity has been completed atO_n−4.t

• whether the second or third activity has been completed atO_n−2.t, as long as theOn−3th observation is caused by the completion of the first ac-tivity, theO_n−1th observation by the second activity and theO_nth observation by the third activity, the Proxels will have identical age vectors at time O_n.t and – when their discrete states match as well – will be merged in the For-ward computation step forO_n. So, even though age vector elements can take on potentially any real number, the set of possible age values at the time of a particular observation only depends on the combinatorics of which activity may have cause what prior observation. Proxels for which these combinatorics result in identical age vectors can then be merged when their discrete states match as well. This condition for Proxel merging holds far more often than chance merg-ings and is likely to be a major contributing factor in the practical feasibility of the approach.

Furthermore, in CHnMMs successor Proxels are only created for state changes and not also for inactivity (as with the Proxel simulation). This re-duces the number of successors of each Proxel by one and thus rere-duces the factor by which the number of Proxels could multiply in each time step. And in CHnMMs, Proxels are only created for the times of symbol emissions, whereas the Proxel simulation has to create Proxels in discrete time steps which are usually far smaller than the time intervals between discrete state changes in order to reduce the approximation errors inherent in the method. So the num-ber of times at which the numnum-ber of Proxels may multiply is also far lower for CHnMMs.

Thus, even though the CHnMM Forward algorithm does not use discretized age vector values as the Proxel simulation method and is thus less likely to merge Proxels by chance, it may still be practically feasible. The actual Proxel merging behavior and the general behavior of the CHnMM Forward algorithm are tested experimentally in the next section.

Im Dokument Conversive Hidden non-Markovian models (Seite 51-55)