• Keine Ergebnisse gefunden

Answering Comparative Questions with Web-based Arguments

N/A
N/A
Protected

Academic year: 2022

Aktie "Answering Comparative Questions with Web-based Arguments"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Answering Comparative Questions with Web-based Arguments

Alexander Bondarenko

Martin-Luther-Universit¨at Halle-Wittenberg

Matthias Hagen

Martin-Luther-Universit¨at Halle-Wittenberg

Meriem Beloucif

Universit¨at Hamburg

Chris Biemann

Universit¨at Hamburg

Alexander Panchenko

Skolkovo Institute of Science and Technology

Abstract

Question answering platforms such as Yahoo! Answers or Quora contain many comparative questions. Daily questions about comparisons range from choosing what to wear or eat, to more important decisions like where to best study or what library to use for software projects. Surprisingly, search engines or other automatic systems are not yet good at supporting answers to such questions with arguments and explanations. We propose CAM (comparative argumenta- tive machine), an open-domain information retrieval system to argumentatively compare options using information ex- tracted from the Common Crawl. In our empirical study, the CAM users were 15% more accurate and 20% faster at answering comparative questions of the type “How does X compare to Y with respect to Z?” than when using stan- dard keyword-based search. In public platforms, a good (men-made) answer needs to be based on objective expert level argumentation / explanation why to favour one of the choice candidates. However, there currently is no “automatic”

retrieval-based model that would be able to satisfy compar- ative information needs in a general domain with sufficient coverage and explanations. Web search engines, while being able to directly answer many factoid questions, do not treat comparative questions any special but simply return “ten blue links”. In this extended abstract of our CHIIR 2019 publica- tion (Schildw¨achter et al. 2019), we present CAM1a system aimed at general domain comparisons with argumentative ex- planations.

To ensure a wide coverage, our CAM system compares two objects based on argumentative structures extracted from a web-scale Common Crawl index. The CAM system retrieves sentences stating that one of the compared objects is supe- rior to the other, that they are equal, or that they are not comparable. A comparison of two objects oand o0 in the CAM sense is formally defined as “o [> | < | = | 6=

] o0w.r.t.ai, . . . , aj ∈ A”, where A = {a1, . . . , ak} is the set of comparison aspects of o and o0; for example,

“Python (o) is better than (>) Matlab (o0) for web develop- ment (ai).” The five CAM system components are: sentence (answer) retrieval, sentence classification, sentence ranking, aspect extraction, and the user interface. For thesentence Copyright c2020, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

1Demo, API & code: http://ltdemos.informatik.

uni-hamburg.de/cam/

retrieval, CAM uses an Elasticsearch index of the Com- mon Crawl-based DepCC containing 14.3 billion linguisti- cally pre-processed English sentences. On a user input, sen- tences matching the input objects and comparison aspect(s) are retrieved. In subsequentsentence classification, a classi- fier distinguishes between four classes of support sentences:

the first object from the user input is better / equal / worse than the second one (>,=, <) w.r.t. a comparison aspect, or no comparison is found (6=). The classifier exploits the text between the objects to identify the comparison polarity.

Thesentence rankingscores sentences by combining their classifier’s confidence and the Elasticsearch retrieval score;

the CAM output is selected by summing up the scores of all sentences supporting and attacking the statement. Inas- pect identification, CAM also generates up to ten supple- mentary aspects, even when no comparison aspect is provided by the user. We use three different methods for aspect min- ing: (1) searching for comparative adjectives and adverbs;

(2) searching for phrases with comparative adjectives, ad- verbs and prepositions; (3) searching for specific hand-crafted patterns to match expressions like “because of higher speed”

or “reason for this is the price”. Finally,the user interface consists of a question input form and an answer presentation component. The input form allows to enter two compared ob- jects and their comparison aspects. The answer presentation displays the sentences retrieved from the Common Crawl for the user input, summarizes the sentences in an overall de- cision support score, and gives aspect-specific score bars to show the distribution for the individual user-specified aspects.

We conducted two user studies, where we compared our new CAM system to a keyword-based search. The 23 par- ticipants were prompted to answer 34 different comparative questions extracted from Quora using CAM or the keyword- based search (random shuffle of which user was prompted to use CAM on what topic; no user did one topic with both systems). The results show that the participants’ CAM-based answers were 15% more accurate and 20% faster.

References

Schildw¨achter, M.; Bondarenko, A.; Zenker, J.; Hagen, M.;

Biemann, C.; and Panchenko, A. 2019. Answering compara- tive questions: Better than ten-blue-links? InProceedings of the 2019 Conference on Human Information Interaction and Retrieval, CHIIR 2019, Glasgow, Scotland, UK, 361–365.

Referenzen

ÄHNLICHE DOKUMENTE

In a user study, the participants obtained 15% more accurate answers using CAM compared to a “traditional” keyword-based search and were 20% faster in finding the answer to

or constructions like (1) and (3) involves disjunction of clauses with consequent ellipsis in the second clause, as well as movement of whether/Q associated with a disjunctive clause

30 Cassam grants that transcendental arguments may provide synthetic a priori knowledge but insists that they do not answer how it is possible (Cassam 2007, 56). As I will

Answer these questions. What kind of wash does Hector select on the washing machine?B. a) hot b)

B. Answer the questions. Well, I am going to the traffic lights. Emily might return. You can't say: ‘I have lost my winning Lottery ticket’. Why can't Hector say this to

a) An English person who knows Latin America b) A Latin American who speaks English. Why does Bridget want Hector to audition?. a) Because she wants to help Hector. c) Because

4. Why does Hector want Annie to be like Nick?.. a) Because Nick has very

a) She met the owner of Channel 9. What does Nick do when he takes the guinea pigs into the girls' flat?.. a) He gives them something to eat. What happens when Nick changes channels