• Keine Ergebnisse gefunden

UBY – A Large-Scale Unified

N/A
N/A
Protected

Academic year: 2022

Aktie "UBY – A Large-Scale Unified"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Enable semantic operability between resources in UBY:

• UBY contains:

• Cross-lingual and mono-lingual alignments

• More than 760,000 sense alignments

• Alignments modeled by SenseAxis class

• Generating methods:

• Import from existing alignments (created by experts, users, or automatically)

• Automatic alignments

• Created by Alignment Framework

• Based on semantic similarity

• Cross-lingual alignment via Machine Translation

Iryna Gurevych, Judith Eckle-Kohler, Silvana Hartmann, Michael Matuschek, Christian M.

Meyer, and Christian Wirth: UBY - A Large-Scale Unified Lexical-Semantic Resource Based on LMF, in: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), p. 580--590, April 2012, Avignon, France.

Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek, and Christian M. Meyer: UBY-LMF – A Uniform Model for Standardizing Heterogeneous Lexical-

Semantic Resources in ISO-LMF, in: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC), p. 275--282, May 2012. Istanbul, Turkey.

Judith Eckle-Kohler and Iryna Gurevych: Subcat-LMF – Fleshing out a standardized format for subcategorization frame interoperability, in: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL), p. 550--560, April 2012. Avignon, France.

See also Sense Alignments

Iryna Gurevych, Judith Eckle-Kohler, Silvana Hartmann, Michael Matuschek, Christian M. Meyer, Tri-Duc Nghiem http://www.ukp.tu-darmstadt.de/uby/

Developing a large-scale unified

lexical-semantic resource (LSR) in a standardized format for NLP

Motivation:

• Problem: Limited resource coverage in NLP

• Solution: Integration of LSRs

• Problem: Incompatible formats of (integrated) LSRs

• Solution: Standards for modeling LSRs

Vision: One-stop resource Contributions:

1. UBY-LMF: Lexicon Model

• ISO Standard Lexical Markup Framework (LMF)

• Expert-built and collaboratively created LSRs

• Fine-grained modeling of information types

• Attributes and values refer to ISOCat data categories

2. UBY

• 11 LSRs in English and German

• 10 pairwise sense alignments between LSRs

3. UBY-API:

• Uniform access to LSRs in UBY via JAVA API

4. UBY Web Interface

• Visual exploration of UBY

UBY-LMF

This work has been supported by the

Emmy Noether Program of the German Research Foundation (DFG) under the grant No. 798/3-1

and by the Volkswagen Foundation as part of the Lichtenberg-Professorship Program under the grant No. I/82806.

UBY in a Nutshell

UBY

UBY Lexicons with sense alignments (dotted lines: planned)

UBY – A Large-Scale Unified Lexical-Semantic Resource

1. To produce musical or harmonious sounds with one’s voice.

2. To express audibly by means of a

harmonious vocalization.

3. To confess under interrogation.

to sing

1. Mit der Stimme harmonische Töne erzeugen.

singen

1. Produce tones with the voice

2. divulge confidential information or secrets to sing

1. To produce

harmonious sounds with one's voice.

to sing

UBY-API

• JAVA with Hibernate

• NLP application:

• Easy swapping of resources

• Easy combination of resources

UBY Web Interface

• UBY-API and Apache Wicket

• Visualization of sense alignments

• Supports exploration of UBY

Open licenses for data www.ukp.tu-darmstadt.de/uby/

and software http://code.google.com/p/uby/

Using UBY

UBY

NLP

VerbNet EN

WordNet

EN Wiktionary

EN

Wikipedia EN

OmegaWiki EN

Wiktionary DE OmegaWiki

DE

Wikipedia DE FrameNet

EN

GermaNet DE

IMSLex DE

UBY

UBY Lexicon

Lexical Entry

Sense Sense

Relation

FrameNet 1.5 9,702 11,942 -

GermaNet 7.0 87,535 99,523 350,259

IMSLex 15,342 32,396 -

OmegaWiki-DE 30,967 34,691 23,106

OmegaWiki-EN 51,715 57,921 40,348

Wikipedia-DE 789,781 838,435 571,286 Wikipedia-EN 2,709,447 2,921,455 3,364,083

Wiktionary-DE 85,575 72,752 183,684

Wiktionary-EN 335469 420,672 23,020

WordNet 3.0 156,584 206,978 8,559

VerbNet 3,962 31,891 -

UBY 4,276,079 4,728,656 4,564,345

Referenzen

ÄHNLICHE DOKUMENTE

– D¨oll, P., Berkhoff, K., Bormann, H., Fohrer, N., Gerten, D., Hagemann, S., and Krol, M.: Advances and visions in large-scale hydrological modelling: Findings from the 11th

3 German Federal Agency for Nature Conservation (BfN), Division II 2.4 – Water Ecosystems, Hydrology, Blue

and if one assumes further that on the average the fissioned material in the reactor was fissioned one year earlier, the radioactive inventory corresponds to 3000/60 = 5 0 k g

Today we are at the peak of hype in almost all computing-related fields, such as Big Data, exa-scale computing, real-time everything, artificial neural networks and simulating

Snow slab avalanches result from a sequence of frac- ture processes including (i) failure initiation in a weak layer underlying a cohesive snow slab, (ii) the onset of

The fermentation starts with a batch phase and is followed by a fed-batch phase, where a feeding solution containing glucose as limiting substrate is fed from a reservoir into

The solid circles with 1σ errors in the inset show the power spectrum ratioed to a smooth model (calculated using a cubic spline fit as described in Percival et al... 2006) compared

Section 4 investigates how the LA method af- fects parsing accuracy on sentences containing un- known words and explores the possibility of using newly acquired lexical entries in