5 PSpace-hardness of EL −> -unification

In this section, we reduce the intersection emptiness problem for deterministic finite automata (DFA) to a unification problem in EL^−>. These DFA are a special case of nondeterministic finite automata, which in turn are special AFA.

An alternating finite automaton (AFA) A = (Q∃, Q∀,Σ, q0, δ, F) is an ε-AFA with a restricted transition function δ : Q ×Σ → P(Q) that does not allow ε-transitions. The semantics of these automata is the same as for ε-AFA, ex-cept that the relation `A is restricted to non-ε-transitions. The automaton is called nondeterministic finite automaton (NFA) if Q∀ = ∅ and is then written as (Q,Σ, q₀, δ, F). It is called deterministic finite automaton (DFA) if it is an NFA and for each q ∈ Q and α ∈ Σ, the set δ(q, α) has the cardinality 0 or 1. The transition function is then equivalently expressed as the partial function δ⁰ :Q×Σ→Qwhere δ⁰(q, α) =q⁰ iff δ(q, α) = {q⁰}. This definition implies that any DFA has at most one run on any given word.

First, we define a translation from a given DFA A = (Q,Σ, q₀, δ, F) to a set of subsumptions ΓA. In the following, we only consider automata that accept a nonempty language. For such DFAs we can assume without loss of generality that there is no state q ∈ Q that cannot be reached from q₀ or from which F cannot be reached. In fact, such states can be removed from Awithout changing the accepted language.

For every state q ∈ Q, we introduce a variable X_q. There is only one constant, A, and we define NR:= Σ. The set ΓA is defined as follows:

Γ_A :={L_q v^? X_q|q ∈Q\F} ∪ {AuL_q v^? X_q |q∈F}, where L_q := l

α∈Σ δ(q,α) is defined

∃α.X_δ(q,α).

Note that the left-hand sides of the subsumptions in ΓA are indeedEL^−>-concept terms, i.e., the conjunctions on the left-hand sides are nonempty. In fact, every

state q∈Qis either a final state or a final state is reachable by a nonempty path from q. In the first case, A occurs in the conjunction, and in the second, there must be an α∈Σ such that δ(q, α) is defined, in which case ∃α.Xδ(q,α) occurs in the conjunction.

Lemma 19. Let q ∈ Q, w ∈ Σ^∗ and γ be a ground EL^−>-unifier of ΓA with γ(X_q)v ∃w.A. Then w∈L(A_q), where A_q := (Q,Σ, q, δ, F) is obtained from A by making q the initial state.

Proof. We prove this by induction on the length ofw. If|w|= 0, thenγ(X_q)vA.

Thus, A must be a top-level conjunct of γ(X_q). Since γ is a unifier of ΓA, this can only be the case if q ∈F. Thus, w=ε is accepted by A_q.

Let now w=α⁰w⁰ with α⁰ ∈Σ,w⁰ ∈Σ^∗. Since γ is a unifier of ΓA, l

α∈Σ δ(q,α) is defined

∃α.γ(X_δ(q,α))v ∃α⁰w⁰.A .

Thus, we must have γ(X_δ(q,α⁰₎) v ∃w⁰.A by Lemma 1. By induction, we know that w⁰ is accepted by A_δ(q,α⁰₎. Thus, w=α⁰w⁰ is accepted by A_q.

Together with Lemma 3, this lemma implies that, for every ground EL^−>-unifier γ of ΓA, the language {w ∈ Σ^∗ | ∃w.A ∈ Part(γ(X_q₀))} is contained in L(A).

Conversely, we will show that for every word w accepted by A we can construct a unifier γ_w with ∃w.A∈Part(γ_w(X_q₀)).

For the construction of γ_w, we consider every q∈Q and try to find a word u_q of minimal length that is accepted by A_q. Such a word always exists since we have assumed that we can reach F from every state. Taking arbitrary such words is not sufficient, however. They need to be related in the following sense.

Lemma 20. There exists a mapping from the states q∈Q to words u_q ∈L(A_q) such that that either q ∈ F and u_q = ε or there is a symbol α ∈ Σ such that δ(q, α) is defined and uq =αu_δ(q,α).

Proof. We construct the words uq using induction on the lengthn of a shortest word accepted by A_q.

If n= 0, then q must be a final state. In this case, we setu_q :=ε.

Now, let q be a state such that a shortest word w_q accepted by A_q has length n > 0. Then w_q =αw⁰ for α ∈ Σ and w⁰ ∈ Σ^∗ and the transition δ(q, α) = q⁰ is defined. The length of a shortest word accepted by Aq⁰ must be smaller thann, since w⁰ is accepted by A_q⁰. By induction, u_q⁰ ∈L(A_q⁰) has already been defined and we have αu_q⁰ ∈L(A_q). Since αu_q⁰ cannot be shorter thanw_q =αw⁰, it must also be of length n. We now define uq:=αuq⁰.

We can now proceed with the definition of γ_w for a wordw∈Σ^∗ that is accepted byA. The unique successful run ofAonw=w₁. . . w_nyields a sequence of states q0, q1, . . . , qn with qn ∈ F and δ(qi, wi+1) =qi+1 for every i∈ {0, . . . , n−1}. We define the substitution γ_w as follows:

γw(Xq) :=∃uq.Au l

i∈Iq

∃wi+1. . . wn.A ,

where I_q :={i ∈ {0, . . . , n−1} | q_i =q}. For every q ∈ Q, we include at least the conjunct ∃u_q.A inγ_w(X_q) and thus, γ_w is in fact an EL^−>-substitution.

Lemma 21. If w ∈ L(A), then γ_w is an EL^−>-unifier of ΓA and γ_w(X_q₀) v

∃w.A.

Proof. Let the unique successful run of A on w = w₁. . . w_n be given by the sequence q₀q₁. . . q_n of states with q_n ∈ F and δ(q_i, w_i+1) = q_i+1 for every i ∈ {0, . . . , n−1}, and let γw be defined as above.

We have to show thatγ_w satisfies the subsumption constraint introduced for every state q ∈Q, i.e.,

F_qu l

α∈Σ δ(q,α) is defined

∃α.γ_w(X_δ(q,α))vγ_w(X_q) .

To do this, we consider every top-level atom ofγ_w(X_q) and show that it subsumes the left-hand side of the above subsumption.

• Consider the conjunct ∃uq.A. If uq = ε, then q ∈ F and Fq = A. In this case, the subsumption is satisfied. Otherwise, by construction there is a transition δ(q, α) = q⁰ with u_q = αu_q⁰. Since ∃u⁰_q.A is a top-level conjunct of γw(Xq⁰), we have γ(Xq⁰)v ∃uq⁰.A and thus, ∃α.γw(Xq⁰)v ∃uq.A.

• Let i∈ I_q, i.e., q_i =q, and consider the conjunct ∃w_i+1. . . w_n.A. Since we have δ(q_i, w_i+1) = q_i+1 and ∃w_i+2. . . w_n.A is a conjunct of γ_w(X_q_i+1),³ we know ∃w_i+1.γ_w(X_q_i+1)v ∃w_i+1. . . w_n.A.

This shows that γ_w is a ground EL^−>-unifier of ΓA. Furthermore, since 0 ∈ I_q₀, the particle ∃w₁. . . w_n.A = ∃w.A is a top-level conjunct of γ_w(X_q₀), i.e., γw(Xq0)v ∃w.A.

The intersection emptiness problem considers finitely many DFAs A₁, . . . ,A_k, and asks whether L(A₁)∩. . .∩L(A_k)6=∅. Since this problem is trivially solvable in polynomial time in case L(Ai) = ∅ for some i,1≤ i≤k, we can assume that the languages L(A_i) are all nonempty. Thus, we can also assume without loss of

3Ifi=n−1, then∃wi+2. . . wn.A=A.

generality that the automata A_i = (Q_i,Σ, q_0,i, δ_i, F_i) have pairwise disjoint sets of states Q_i and are reduced in the sense introduced above, i.e., there is no state that cannot be reached from the initial state or from which no final state can be reached.

The flat EL^−>-unification problem Γ is now defined as follows:

Γ := [

i∈{1,...,k}

Γ_A_i∪ {X_q_0,i v^?Y} ,

where Y is a new variable not contained in ΓA_i for i= 1, . . . , k.

Lemma 22. Γ is unifiable in EL^−> iff L(A₁)∩. . .∩L(A_k)6=∅.

Proof. If Γ is unifiable in EL^−>, then it has a ground EL^−>-unifier γ and there must be a particle∃w.Awithw∈Σ^∗ andγ(Y)v ∃w.A. Sinceγ(X_q_0,i)vγ(Y)v

∃w.A, Lemma 19 yieldsw∈L(A_i,q_0,i) = L(A_i) for eachi∈ {1, . . . , k}. Thus, the intersection of the languages L(A_i) is nonempty.

Conversely, let w∈Σ^∗ be a word with w∈L(A₁)∩. . .∩L(A_k). By Lemma 21, we have for each of the unification problems ΓAi an EL^−>-unifier γw,i such that γ_w,i(X_q_0,i) v ∃w.A. Since the automata have disjoint state sets, the unification problems ΓA_i do not share variables. Thus, we can combine the unifiers γ_w,i into

an EL^−>-substitution γ by defining γ(Y) := ∃w.A and γ(Xq) := γw,i(Xq) for

each i ∈ {1, . . . , k} and q ∈ Q_i. Obviously, this is an EL^−>-unifier of Γ since it satisfies the additional subsumptions X_q_0,i v^? Y.

Since the intersection emptiness problem for DFAs is PSpace-hard [14, 11], this lemma immediately yields our final theorem:

Theorem 23. The problem of deciding unifiability in EL^−> is PSpace-hard.

6 Conclusion

Unification in EL was introduced in [4] as an inference service that can sup-port the detection of redundancies in large biomedical ontologies, which are fre-quently written in this DL. Motivated by the fact that the large medical ontology SNOMED CT actually does not use the top concept available in EL, we have in this paper investigated unification inEL^−>, which is obtained fromELby remov-ing the top concept. More precisely, SNOMED CT is a so-called acyclic EL^−> -TBox,⁴ rather than a collection of EL^−>-concept terms. However, as shown in

4Note that the right-identity rules in SNOMED CT [18] are actually not expressed using complex role inclusion axioms, but through the SEP-triplet encoding [19]. Thus, complex role inclusion axioms are not relevant here.

[6], acyclic TBoxes can be easily handled by a unification algorithm for concept terms.

Surprisingly, it turned out that the complexity of unification inEL^−> (PSpace) is considerably higher than of unification in EL (NP). From a theoretical point of view, this result is interesting since it provides us with a natural example where reducing the expressiveness of a given DL (in a rather minor way) results in a drastic increase of the complexity of the unifiability problem. Regarding the complexity of unification in more expressive DLs, not much is known. If we add negation toEL, then we obtain the well-known DLALC, which corresponds to the basic (multi-)modal logicK[17]. Decidability of unification inKis a long-standing open problem. Recently, undecidability of unification in some extensions of K (for example, by the universal modality) was shown in [20]. These undecidability results also imply undecidability of unification in some expressive DLs (e.g., in SHIQ [12]).

Apart from its theoretical interest, the result of this paper also has practical implications. Whereas practically rather efficient unification algorithm for EL can readily be obtained by a translation into SAT [5], it is not so clear how to turn the PSpace algorithm for EL^−>-unification introduced in this paper into a practically useful algorithm. One possibility could be to use a SAT modulo theories (SMT) approach [15]. The idea is that the SAT solver is used to generate all possible subsumption mappings for Γ, and that the theory solver tests the system I_Γ,τ induced by τ for the existence of a finite, admissible solution. How well this works will mainly depend on whether we can develop such a theory solver that satisfies well all the requirements imposed by the SMT approach.

Another topic for future research is how to actually compute EL^−>-unifiers for a unifiable EL^−>-unification problem. In principle, our decision procedure is constructive in the sense that, from appropriate successful runs of the ε-AFA A(X, A), one can construct a finite, admissible solution of IΓ,τ, and from this an

EL^−>-unifier of Γ. However, this needs to be made more explicit, and we need

to investigate what kind of EL^−>-unifiers can be computed this way.

Appendices

A Locality

InEL, we have the interesting property that for every solvable unification problem there exists a local unifier γ, where γ(X) is a conjunction of atoms of the form γ(D) for D ∈ NV(Γ). However, simply extending this notion to EL^−>-unifiers does not give a similar result for EL^−>.

Example 24. Consider the flatEL-unification problem Γ that contains the three equations

X ≡^? Y uA, Y u ∃r.X ≡^? ∃r.X, Zu ∃r.X ≡^? ∃r.X.

Then the substitutions σ₀ := {X 7→ A, Y 7→ >, Z 7→ >} and σ₁ := {X 7→

A, Y 7→ >, Z 7→ ∃r.A} are the only local EL-unifiers of Γ. In fact, we have NV(Γ) ={A,∃r.X}, and thus the only possible image forX in a local unifierσ is A (since σ(∃r.X) = ∃r.σ(X) obviously cannot be a conjunct of σ(X)). Since the first equation implies that A=σ(X)vσ(Y), we know thatσ(Y) can only be >

or A. However, the second equation prevents the second possibility. Finally, the third equation ensures that σ(Z) is >or ∃r.A.

Note thatσ0 andσ1 both contain>, and thus are notEL^−>-unifiers. This shows that Γ does not have an EL^−>-unifier that is local in the sense defined above.

Nevertheless, Γ has EL^−>-unifiers. For example, the substitution γ₁ := {X 7→

Au ∃r.A, Y 7→ ∃r.A, Z 7→ ∃r.∃r.A}is such a unifier.

In this example, the top-level atoms of γ₁(X), γ₁(Y), γ₁(Z) that are not of the form γ(D) for some D ∈ NV(Γ) are all particles of γ(D) for some D ∈ NV(Γ).

This motivates the following definition.

Definition 25. The EL^−>-unifierγ of Γ is a local EL^−>-unifier of Γ if, for every variable X, each top-level atom of γ(X) is

• of the form γ(D) for someD∈NV(Γ) or

• a particle of γ(D) for some D∈NV(Γ).

There are always only finitely many localEL-unifiers for a given unification prob-lem [4]. In EL^−>, however, it is possible that there exist infinitely many local unifiers, as the next example demonstrates.

Example 26. Consider the unification problem Γ from Example 24 and the following EL^−>-substitutions γn:

γ_n(X) :=Au ∃r.Au · · · u ∃rⁿ.A γ_n(Y) :=∃r.Au · · · u ∃rⁿ.A γn(Z) :=∃rⁿ⁺¹.A

It is easy to verify that each γ_n is an EL^−>-unifier of Γ. Furthermore, every top-level atom of γ_n(X), γ_n(Y), and γ_n(Z) is either A or a particle of γ_n(∃r.X).

Note that both A and ∃r.X are non-variable atoms of Γ. Thus, Γ has infinitely many local EL^−>-unifiers.

Additionally, these unifiers are even incomparable w.r.t. the subsumption order on unifiers, i.e., for no two n, m ∈ N with n 6= m it holds that γ_n(X) v γ_m(X) for all variables X. This is the case since the concept termsγn(Z) = ∃rⁿ⁺¹.Aare incomparable in this sense.

We will show that checking for local unifiers suffices to decide unifiability inEL^−>

by demonstrating that the decision procedure described in Section 4 can be used to construct localEL^−>-unifiers. To be able to use the reductions to the problems of solvability of sets of language inclusions and emptiness ofε-AFA, we first define appropriate notions of locality for these formalisms.

Definition 27. LetI be a finite set of inclusions of the form

X ⊆L₀∪L₁X₁∪. . .∪L_nX_n, (1) as described in Section 4.2. A solution θ of I is called local if all words w ∈ θ(X)\ {ε} for X ∈ Var(I) occur on the right-hand side of some inclusion Y ⊆ L₀∪L₁X₁∪. . .∪L_nX_n of I under θ, i.e., either w∈ L₀ or w∈ (L_i \ {ε})θ(X_i) for some i∈ {1, . . . , n}.

The final definition is concerned with locality in alternating automata.

Definition 28. LetAbe anε-AFA. A successful run ofAis calledlocal if there is at least one leaf labeled by (q, ε) for some stateqofA. Since the run is successful, q is then either a final state or a universal state without possible successors. We denote by L^l(A) the set of all words accepted by A via local, successful runs.

In a successful runR ofA that is not local, all leafs are labeled by configurations (q, w) with w6=ε. In this case,q has to be a universal state without successors.

However, since such states accept any word, it is easy to change R into a local run. We simply identify the shortest word w that occurs in the label of a leaf.

Since R is a run, w is the shortest word occuring in it and all other words in R must have the suffix w. Thus, we can simply remove the suffix w from all configurations inR and obtain a successful run that accepts a shorter word. This new run is local since it must contain at least one leaf labeled by (q, ε) for some state q.

This construction also shows that runs accepting minimal words, i.e., words for which no prefix is accepted byA, are always local. This is an important property of locality in ε-AFA which will prove to be useful.

The following lemma proves a connection between local runs and local solutions by analyzing one direction of Lemma 16 in more detail.

Lemma 29. Let I be a finite set of inclusions of the form (1) and let the ε-AFA AX for a variable X ∈Var(I)be constructed as in Definition 14. If w∈L^l(AX), then there is a finite, local solutionθof Isuch thatw∈θ(X)and everyw⁰ ∈θ(Y) for some Y ∈Var(I) is a suffix of w.

Proof. LetR be a local, successful run ofA_X starting in ((X,0), w) and consider the solution θ_R that was constructed in the proof of Lemma 16:

θR(Y) :={u∈Σ^∗ | ∃v ∈V⁰ :l(v) = ((Y, . . .), u)}

for all variables Y ∈ Var(I). Since V⁰ is a subset of the finite set of nodes of R, θ_R is finite. By definition of the transition relation of A_X, the run R, and thus also θ_R, contains only suffixes ofw. Furthermore,w∈θ_R(X) since the root node of R is labeled by ((X,0), w) and contained in V⁰. It remains to show thatθ_R is local.

Since R is local, there is a leaf of R that is labeled by (q, ε) for some state q of A_X. We now consider the pathpleading from the root of R to this leaf. Its root is labeled by ((X,0), w), while its leaf is labeled by (q, ε). Thus, every suffix of w must occur along this path. To show locality, it thus suffices to show that every word occuring along p satisfies the conditions on locality. We will show this by backwards induction along p.

We begin the induction at the leaf of p, which is labeled by (q, ε). The word ε trivially fulfills the conditions for locality ofθ_R. Let nowv⁰ be a node ofplabeled by (q⁰, u⁰) for a stateq⁰ and a suffixu⁰ of wthat fulfills the conditions for locality of θ_R. If v⁰ is the root node, we are done. Otherwise, we show the same for the predecessor v of v⁰, which also lies on the path p. Let (q, u) be the label of v and consider the following cases:

• If u=u⁰, then u fulfills the condition for locality ofθ_R, since u⁰ does.

• Otherwise, u = αu⁰ for some α ∈ Σ and q must be of the form (i, λ) for some inequation i:Y ⊆L₀∪L₁X₁∪. . . L_nX_n in I. Then the label (q⁰, u⁰) of v⁰ can only have one of the following forms:

– Ifq⁰ =f₀, then α∈L₀. SinceR is successful, we then haveu⁰ =εand u=α ∈L₀.

– Otherwise, q⁰ = (X_i,0) for some i ∈ {1, . . . , n} and α ∈ L_i. But then u⁰ ∈ θ_R(X_i) by definition of θ_R and thus, u = αu⁰ ∈ {α}θ_R(X_i) ⊆ (L_i \ {ε})θ_R(X_i).

Thus, the wordufulfills the condition of locality since it is contained in the right-hand side of iunder θ_R.

In the following, let Γ be a flat EL^−>-unification problem, τ a subsumption mapping for Γ, and γ^τ, ∆Γ,τ, IΓ,τ, and A(X, A) be defined as in Section 4.

Using the previous lemma, under some conditions we can construct a finite, local, admissible solution of I_Γ,τ.

Lemma 30. If for every X ∈ Var(I) there is a constant A(X) such that the automatonA(X, A(X))accepts a wordw_X, then there is a finite, local, admissible solution of I_Γ,τ that contains only suffixes of the words w_X.

Proof. By Lemma 29, we find for every X a finite, local solutionθ_X of I_Γ,τ that contains only suffixes of w_X and satisfies w_X ∈ θ_X(X_A(X)). By Lemma 10, the union θ of all θX is still a solution of IΓ,τ. It is finite since it is a finite union of finite solutions. It is also admissible since for every X the set θ(X_A(X₎) is non-empty. Finally, it is local since all contained words satisfy the conditions on locality by locality of the component solutions θX.

The following lemma proves a connection between finite, local, admissible solu-tions of I_Γ,τ and local unifiers of Γ by analyzing one direction of Lemma 9 in more detail.

Lemma 31. Let θ be a finite, local, admissible solution of I_Γ,τ. Then there is a local EL^−>-unifier σ of Γ.

Proof. Consider the EL^−>-unifierσ of ∆_Γ,τ constructed in the proof of Lemma 9 which has the property that S^τ ≤ S^σ. It was defined by induction on the order

> on the variables as follows:

σ(X) := l

D∈S^τ(X)

σ(D)u l

A∈N_c

w∈θ(X_A)

∃w.A

for every variable X, where σ(Y) has already been defined for each variable Y with X > Y. In the proof of Lemma 8, it was shown that σ is also a unifier of Γ.

To show that σ is local, we consider all top-level atoms of σ(X) for each X ∈ Var(Γ). For those top-level atoms of the form σ(D) for D ∈S^τ(X), this follows immediately from the fact thatS^τ(X)⊆NV(Γ). Now consider a top-level particle

∃w.A of σ(X). If w = ε, then A is a non-variable atom of Γ since we assumed that all elements ofN_C occur in Γ. Otherwise,w∈θ(X_A)\{ε}and, by locality of θ, there is an inclusion inI_Γ,τ that containswin the substitution of its right-hand side under θ.

This inclusion must be of the form I_A(s), i.e., X_A ⊆ f_A(C₁)∪. . .∪f_A(C_n), for some subsumptions of the formC₁u. . .uC_nv^? X in ∆_Γ,τ. Locality ofθ yields an index i ∈ {1, . . . , n} with w∈ θ(f_A(C_i)), whereC_i is neither a variable nor a constant.⁵

Thus, C_i is of the form ∃r.C⁰, where C⁰ is either a variable or the constant A. Consequently, either w∈ {r} or w ∈ {r}θ(C_A⁰ ). In the former case, ∃w.A =

∃r.A =C_iis a ground atom of Γ. In the latter case,w=rw⁰ for somew⁰ ∈θ(C_A⁰ ).

This implies σ(C⁰) v ∃w⁰.A, which yields σ(Ci)v ∃w.A. By Lemma 3, ∃w.A is a particle of σ(C_i). Since C_i ∈NV(Γ), the particle ∃w.Afulfills the condition for locality of σ.

5Recall the definition offA(C) from Section 4.2.

Since we want to obtain a complexity result, we also have to consider the size of σ. In the following, size always means the number of symbols it takes to write something down and is denoted by | · |. For example, for a solution θ of IΓ,τ,

|θ| denotes the number of symbols it takes to write down all the sets θ(X_A) for X ∈Var(Γ) and A∈N_C.

Lemma 32. If θ is a finite, local, admissible solution of I_Γ,τ, then the size of the local EL^−>-unifier σ constructed in Lemma 31 is at most exponential in the size of Γ and polynomial in the size of θ.

Proof. For a variableX ∈Var(Γ), we consider all sequences X₁ <· · ·< X_n=X where X₁ is a minimal variable w.r.t. <. The length of such a sequence is the number of variables it contains, i.e.,n. Theheight ofX is defined as the maximal length of all such sequences. This means that the height of a minimal variable is 1 and the height is bounded by |Var(Γ)| since <is acyclic.

We prove the following claim by induction on the height n of the variables X ∈ Var(Γ): For every X ∈Var(Γ),

Let n = 1, i.e., X be a minimal variable w.r.t. <. Then all non-variable atoms in S^τ(X) are ground and the size of σ(X) is bounded by 5(|S^τ(X)|+ |θ|) ≤ 5(|Γ|+|θ|).

If n > 1, then we know that the height of all variables Y < X must be smaller than n. Since all the non-variable atoms D ∈ S^τ(X) contain only variables smaller than X, by induction we can bound the size of eachσ(D) forD ∈S^τ(X)

Since the height of any variable is bounded by the number of variables, and thus by |Γ|, this means that the overall size ofσ is bounded by

|Γ|5^|Γ|

6The constant 5 accounts for additional symbols likeuor∃that are added in the definition ofσ.

i.e., an expression that is exponential in |Γ|and polynomial in |θ|.

Im Dokument Unification in the Description Logic EL Without the Top Concept (Seite 23-37)