• Keine Ergebnisse gefunden

Unsupervised Semantic Frame Induction using Triclustering:

N/A
N/A
Protected

Academic year: 2022

Aktie "Unsupervised Semantic Frame Induction using Triclustering:"

Copied!
3
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Unsupervised Semantic Frame Induction using Triclustering:

Supplementary Materials

Dmitry Ustalov,Alexander Panchenko,Andrei Kutuzov?, Chris Biemann, andSimone Paolo Ponzetto

University of Mannheim, Germany

{dmitry,simone}@informatik.uni-mannheim.de

?University of Oslo, Norway andreku@ifi.uio.no

University of Hamburg, Germany

{panchenko,biemann}@informatik.uni-hamburg.de

Abstract

We use dependency triples automatically extracted from a Web-scale corpus to per- form unsupervised semantic frame induc- tion. We cast the frame induction problem as a triclustering problem that is a gen- eralization of clustering for triadic data.

Our replicable benchmarks demonstrate that the proposed graph-based approach, Triframes, shows state-of-the art results on this task on a FrameNet-derived dataset and performing on par with competitive methods on a verb class clustering task.

This document contains supplementary materials to the main paper.

1 Triple Vector Representation

Figure 1 illustrates our approach for triple vector representation. In our representation, given a syntactic subject-verb-object (SVO) triple (people,make,money), we concatenate the word embeddings corresponding to these words into a single vector representing the whole triple. This explains the core assumption underlying in the Triframes approach: triples representing similar roles appear in similar contexts.

2 Implementation Details

We use a parallel implementation of the WAT-

SET1 algorithm in Java for graph clustering, the Gensim2 library for handling word embeddings, and the Faiss3 library for indexing of word em-

1https://github.com/dustalov/

watset-java

2https://radimrehurek.com/gensim/

3https://github.com/facebookresearch/

faiss

Figure 1: Concatenation of the vectors corre- sponding to the SVO triple elements expresses structural similarity of the triples.

Method # of clusters Triframes WATSET 37,535

HOSG 10,000

NOAC 46,984

Triadic Spectral 500

Triadick-means 500

Triframes CW 1862

LDA-Frames 109

Singletons 648,432

Whole 1

Table 1: Number of induced frames.

beddings and retrieval of nearest neighbors. The source code and the data presented in this paper are available online under a permissive license.4 3 Cluster Sizes

Table1shows the amount of clusters produced by clustering algorithms during the frame induction experiment. Note that the Singletons baseline pro- duced a distinct cluster for each triple and yet re- ceived low scores on each scale.

4https://github.com/uhh-lt/triframes

(2)

4 Examples of Induced Frames

Figures 2, 3 and 4 demonstrate examples of

“good” frames, i.e. those which are seman- tically plausible according to our human judg- ment during a post-hoc manual analysis of clus- tering results. Figures 5, 6 and 7 show ex- amples of “bad” frames according to the same criteria. All the frames are produced by the Triframes WATSET method ranked best as ac- cording to the Frame F1 in the frame induc- tion experiment. In particular, the number of nearest neighbors is n = 30, and the WAT-

SET[CWtop, CWtop] fuzzy clustering algorithm has been used. These frames are available in the file triw2v-watset-n30-top-top-triples.txt

available in the “Downloads” section of our GitHub repository (cf. Section2).

Frame # 848

Subjects: Company, firm, company Verbs: buy, supply, discharge, purchase,

expect

Objects: book, supply, house, land, share, company, grain, which, item, product, ticket, work, this, equipment, House, it, film, water, something, she, what, service, plant, time

Figure 2: An example of a “good” frame.

Frame # 849

Subjects: student, scientist, we, pupil, member, company, man, no- body, you, they, US, group, it, people, Man, user, he

Verbs: do, test, perform, execute, con- duct

Objects: experiment, test

Figure 3: An example of a “good” frame.

Frame # 3207 Subjects: people, we, they, you Verbs: feel, seek, look, search

Objects: housing, inspiration, gold, wit- ness, partner, accommodation, Partner

Figure 4: An example of a “good” frame.

Frame # 1

Subjects: you, she, he, return, they, we, themselves, road, help, who Verbs: govern, discourage, resem-

ble, encumber, urge, pummel, . . .911 more verbs . . . , demol- ish, swarm, anticipate, spew, derail, emit, snap

Objects: you, pass, she, he, it, product, change, solution, total, any, wall, they, something, people, classic, this, interest, itself, flat, place, part, controversy

Figure 5: An example of a “bad” frame.

Frame # 852

Subjects: Word, glue, pill, speed, drug, pot, they, those, mine, item, re- source, this, its, it, something, most, horse, material, chemical, plant, information, word

Verbs: use, attach, apply, follow Objects: we, they, you, it, report, he

Figure 6: An example of a “bad” frame.

Frame # 37535 Subjects: he

Verbs: phone, book Objects: you

Figure 7: An example of a “bad” frame.

(3)

Officer|chair|Committee

officer|head|team mayor|lead|city

officer|lead|company Mayor|lead|city

boss|lead|company chairman|lead|company

director|lead|department chief|lead|department president|lead|government

president|lead|state

director|lead|company president|lead|department

officer|chair|committee Chief|lead|department

chairman|lead|committee

Director|lead|Department Director|lead|department

Director|lead|agency

Director|lead|company minister|lead|team

Director|head|team director|head|team

Chairman|lead|company

Chairman|lead|Committee

President|lead|company

Director|chair|Committee

President|lead|party

President|head|team leader|head|team

Director|chair|committee director|chair|committee

Director|head|Department

president|head|team director|head|department

director|head|agency

director|head|committee Chairman|run|committee

Chairman|chair|Committee Chairman|chair|committee

President|chair|Committee President|chair|committee

Governor|lead|state

chairman|head|committee chairman|run|committee

president|chair|committee

president|head|committee president|chair|Committee

Minister|chair|committee

representative|chair|committee

representative|head|committee General|command|department General|command|Department

General|head|Department

General|head|department

officer|head|department

minister|head|department

leader|head|agency

leader|head|party

leader|head|committee leader|head|department

minister|head|committee

King|run|company

leader|head|government Minister|head|government

president|head|government

Officer|chair|Committee

officer|head|team mayor|lead|city

officer|lead|company Mayor|lead|city

boss|lead|company chairman|lead|company

director|lead|department chief|lead|department president|lead|government

president|lead|state

director|lead|company president|lead|department

officer|chair|committee Chief|lead|department

chairman|lead|committee

Director|lead|Department Director|lead|department

Director|lead|agency

Director|lead|company minister|lead|team

Director|head|team director|head|team

Chairman|lead|company

Chairman|lead|Committee

President|lead|company

Director|chair|Committee

President|lead|party

President|head|team leader|head|team

Director|chair|committee director|chair|committee

Director|head|Department

president|head|team director|head|department

director|head|agency

director|head|committee Chairman|run|committee

Chairman|chair|Committee Chairman|chair|committee

President|chair|Committee President|chair|committee

Governor|lead|state

chairman|head|committee chairman|run|committee

president|chair|committee

president|head|committee president|chair|Committee

Minister|chair|committee

representative|chair|committee

representative|head|committee General|command|department General|command|Department

General|head|Department

General|head|department

officer|head|department

minister|head|department

leader|head|agency

leader|head|party

leader|head|committee leader|head|department

minister|head|committee

King|run|company

leader|head|government Minister|head|government

president|head|government

Figure 8: Visualization of an SVO triple graph, where edges represent distributional relatedness of the triples estimated using word embeddings.

5 Visualization of Triple Graph

Figure 8 presents a densly connected part of the triple graph related to the concept of “leadership”.

A similar cluster of triples can represent a seman- tic frame induced automatically from text using our approach.

Referenzen

ÄHNLICHE DOKUMENTE

time-history plot of sway displacements, damped plastic system plastic limit load increased by 1,5.. phase portrait of sway displacements, damped plastic system plastic limit

Based on the evaluation results, we discussed the influence of each of the similarity dimensions, and demonstrated empirically that text reuse can be best detected if measures

As the method operates exclusively on pseudo glosses generated on the basis of the hypernymy relation, this type of information from definitions turns out to be the most important

Its form comprises the panels in horizontal format that form a sequence.. The other order

Prognostic framing contains the proposed solutions to the identified problems (from the diagnostic framing) and the indication of strategies, goals, etc. Motivational framing

Durch die Angabe von Vor- und Nachbedingungen sowie Frame Conditions steht eine vollst¨andige Beschreibung der Funktionalit¨at einer Operation auf Modellebene zur Verf¨ugung..

The methodology of the neighborhood development strategy guarantees the inclusion of all stakeholders and supports a prioritization in order to decide on future measures that can

Using Hebrew-to-Greek word alignments, cantillation marks are projected from each Hebrew word to its corresponding Greek chunk and subtree (Table 5).. Only the mark on the last word