Answer-Sentence Syntactic Alignment Prob- Prob-lem

GA for Answer-Sentence Syntactic Alignment

6.2 Answer-Sentence Syntactic Alignment Prob- Prob-lem

Let us assume that we have a tuple: {sentence, answer}, in which the answer is a substring of the sentence. ConsiderS to be the sentence where the answer is already replaced with a special string w₀, and γ =len(S) is the number of words inS. For example, S =“The First Helicopter was invented in Kyiv by w₀ in 1909”, where the answer candidate is given by w₀ =“Igor Sikorsky” and γ = 11. Consider also a function τ(S) which returns the position of w₀ in the sentence S, and two other functionsP_l(w_i, ²) andP_r(w_i, ²), which returns the likelihood that a word w_i occurs

² words to the left and to the right of w₀ respectively². In the working example, τ(S) = 9 and P_r and P_l are given by some external model (1≤i≤γ).

Consider that words fromS are removed in such a way that the syntactic fitness of the answer candidate is maximized, that is, that the remaining words maximize the likelihood of the answer candidate to the EAT. For the purpose of keeping track of words that remain on the aligned sentenceS⁰, consider the next binary variable:

Y_i =

½ 1 if the wordwi is in the aligned sentenceS⁰ 0 otherwise.

2These functions are described in details in section 5.2.1.

July 14, 2006

6.2. Answer-Sentence Syntactic Alignment Problem 57 For instance, an aligned sentenceS⁰ is “The * Helicopter was invented * * by w₀ in 1909”³, whereY₂ =Y₆ =Y₇ = 0 and Y₁ =Y₃ =Y₄ =Y₅ =Y₈ =Y₁₀ =Y₁₁= 1.

The number of remaining words (NRW) between the word w_i and w₀ is defined as follows:

In our working example, NRW(11)=NRW(5)=1. Since the goal is to find an align-ment that maximizes the syntactic fitness of the answer candidate with respect to its context, the new fitness function takes into account the set of values for Y_i as follows: This functionK particularly equally favours query terms by giving them a weight of α(w_sk). δ_l and δ_r are the left and right offset respectively. An offset is a translation of context terms. For instance, consider the following sentenceS⁰ to be an alignment of S: “The * Helicopter was invented * * by + + + w0 + + in 1909”. In S⁰, the offset are marked with a “+”, thus, δ_l= 3 and δ_r = 2.

Sometimes constituents or words are inserted next to the answer candidate in sentences on the training data, which can distort the alignment. Offsets attempt to tackle this problem head-on. In the instructive example, S⁰ can align a sentence in the training tuples, such as “The Helicopter was invented by a man named w0 in Kyiv in 1909”.

In short, removing words aims for improving the alignment when a smaller num-ber of words within new sentences are desired, and offset a larger numnum-ber of words.

It is extremely clear that not all possible alignments are considered, but the number of possible alignments exponentially increases as long as the number of words in the sentence also increases.

The number of possible alignmentsConsider τ(S) to be the position where the answer candidatew₀ occurs. Then,L_l=τ(S)−1 is the number of words to the left of the answer candidate, andL_r =γ−τ(S) is the number of words to the right.

All possible combinations of Ll words are given by Ll!. At this point, we consider each word different from each other. It is a good approximation, because sentences are split into small pieces of text in which each word rarely occurs more than once.

Similarly, the number of combinations to the right of w0 is Lr!. Incidentally, every

6.2. Answer-Sentence Syntactic Alignment Problem 58 combination of words to the left can occur simultaneously along with any combina-tion of words to the right.

Then, the total number of Possible Alignments (PA) is given by:

P A=L_l!∗L_r!

In addition, consider that words can be arbitrarily removed from both contexts.

Combinations regarding different context lengths must then be taken into account (L_l,L_l−1,. . .,0). Therefore, the number of possible alignments is defined as follows:

P A=

For our working example, the number of possible word alignments is 184936:

P A=

If word orderings are deliberately restricted to combinations that preserve their relative order. The number of possible combinations is:

P A=

In the example, the number of possible alignments is 1024:

P A=

All effects of offsets are not considered yet. The current formula takes into account only when values for offsets are zero. At the same time that the value for any of the offsets is greater than zero, the corresponding value for the word next to w₀ is one.

In the case of the left context, the number of new possible combinations is:

∆l∗

Similarly, the number of new possible combinations due to the right context is:

∆r∗

6.3. The GA for Answer-Sentence Syntactic Alignment 59 Eventually, the number of possible alignments is defined as follows:

P A=

where ∆_land ∆_rare upper bounds for their respective offset. Regarding the working example, if values for ∆_l = 5 and ∆_r = 5 are considered, then the number of possible combinations is 17024:

This result considers only one tuple {question, sentence, answer} consisting of ten words. However, the overall number dramatically increases due to the following two factors: (a) for each sentence, many answer candidates are feasible, and (b) the number of sentences is larger than one.

Using the result in section 4.2. A reasonable estimate of the total number of Possible Alignmentsfor a set of σ different sentences is:

P A= σ∗Υ¯ ∗( ¯Υ−1)

Υ is the average number of words on a sentence. Thus, the factor¯ ^{Υ( ¯}^¯ ^Υ−1)₂ represents the number of possible answer candidates of different length on a sentence. In our illustrative example,

P A= 25∗14∗(14−1)

2 ∗12544 = 28537600

Then, the number of possible alignments for a set of 25 sentences is 28537600 ( ¯Υ = 14). Consequently, an efficient search algorithm is necessary to early detect and test promising alignments.

6.3 The GA for Answer-Sentence Syntactic

Im Dokument Genetic Algorithms for syntactic and data-driven Question Answering on the Web (Seite 67-70)