• Keine Ergebnisse gefunden

Experimental Evaluation of General Concept Inclusions Learned from Textual Data

N/A
N/A
Protected

Academic year: 2022

Aktie "Experimental Evaluation of General Concept Inclusions Learned from Textual Data"

Copied!
132
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Experimental Evaluation of General Concept Inclusions Learned from Textual Data.

Hybris B1: Automatic Generation of Description Logic-based Biomedical Ontologies

Daniel Borchmann, Anas Elghafari, and Yue Ma

TU Dresden

2015-06-08

(2)

Goal

Automatically construct biomedical ontologies from text:

Learn concept definitions from text Learn terminological knowledge from text Evalutation

Example (Terminological Knowledge)

Genes are not protein complexes, and vice versa. Gene[ProteinComplexĎK Proteins contain amino acids

ProteinDomain[ DhasPart.JĎDhasPart.AminoAcid

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 2 / 14

(3)

Goal

Automatically construct biomedical ontologies from text:

Learn concept definitions from text

Learn terminological knowledge from text Evalutation

Example (Terminological Knowledge)

Genes are not protein complexes, and vice versa. Gene[ProteinComplexĎK Proteins contain amino acids

ProteinDomain[ DhasPart.JĎDhasPart.AminoAcid

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 2 / 14

(4)

Goal

Automatically construct biomedical ontologies from text:

Learn concept definitions from text Learn terminological knowledge from text

Evalutation

Example (Terminological Knowledge)

Genes are not protein complexes, and vice versa. Gene[ProteinComplexĎK Proteins contain amino acids

ProteinDomain[ DhasPart.JĎDhasPart.AminoAcid

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 2 / 14

(5)

Goal

Automatically construct biomedical ontologies from text:

Learn concept definitions from text Learn terminological knowledge from text Evalutation

Example (Terminological Knowledge)

Genes are not protein complexes, and vice versa. Gene[ProteinComplexĎK Proteins contain amino acids

ProteinDomain[ DhasPart.JĎDhasPart.AminoAcid

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 2 / 14

(6)

Goal

Automatically construct biomedical ontologies from text:

Learn concept definitions from text Learn terminological knowledge from text Evalutation

Example (Terminological Knowledge)

Genes are not protein complexes, and vice versa.

Gene[ProteinComplexĎK Proteins contain amino acids

ProteinDomain[ DhasPart.JĎDhasPart.AminoAcid

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 2 / 14

(7)

Goal

Automatically construct biomedical ontologies from text:

Learn concept definitions from text Learn terminological knowledge from text Evalutation

Example (Terminological Knowledge)

Genes are not protein complexes, and vice versa.

Gene[ProteinComplexĎK

Proteins contain amino acids

ProteinDomain[ DhasPart.JĎDhasPart.AminoAcid

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 2 / 14

(8)

Goal

Automatically construct biomedical ontologies from text:

Learn concept definitions from text Learn terminological knowledge from text Evalutation

Example (Terminological Knowledge)

Genes are not protein complexes, and vice versa.

Gene[ProteinComplexĎK

Proteins contain amino acids

ProteinDomain[ DhasPart.JĎDhasPart.AminoAcid

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 2 / 14

(9)

Goal

Automatically construct biomedical ontologies from text:

Learn concept definitions from text Learn terminological knowledge from text Evalutation

Example (Terminological Knowledge)

Genes are not protein complexes, and vice versa.

Gene[ProteinComplexĎK Proteins contain amino acids

ProteinDomain[ DhasPart.JĎDhasPart.AminoAcid

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 2 / 14

(10)

Goal

Automatically construct biomedical ontologies from text:

Learn concept definitions from text Learn terminological knowledge from text Evalutation

Example (Terminological Knowledge)

Genes are not protein complexes, and vice versa.

Gene[ProteinComplexĎK Proteins contain amino acids

ProteinDomain[ DhasPart.JĎDhasPart.AminoAcid

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 2 / 14

(11)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute exploration of certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text

obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(12)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute exploration of certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text

obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(13)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute exploration of certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text

obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(14)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text

obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(15)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text

obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(16)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks)

GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text

obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(17)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text

obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(18)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text

obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(19)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(20)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(21)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(22)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(23)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus

Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(24)

Looking Back

Previous Approaches

Exploit approach of learning SNOMED definitions from text.

Generate GCIs and check for their occurrence in the text.

GCIs fromattribute explorationof certain basic concept description, with DL reasoner as expert

did not finish (ě2 weeks) GCIs produced mostly nonsense

Compute implicationsin instance-data generated from annotated text obtained terminological knowledge

“good” quality, measured with precision and recall

only restricted form of concept descriptions (at most 2 conjuncts on the left-hand side, of pre-defined form)

Current Goal

Learn all GCIs that are valid in the text corpus Find a way to evaluate them

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 3 / 14

(25)

Learning GCIs [Baader and Distel, 2007]

Allows to learnall valid ℰℒ-GCIs fromfinite interpretations

Person Artist

Person

Person Writer child

child

Dchild.WriterĎArtist, . . .

Computes abaseof all such GCIs

Can also compute base ofminimal cardinality

Can include role-depth bounds[Distel, 2012; Borchmann et.al., 2015] Implementations available (prototypes)

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 4 / 14

(26)

Learning GCIs [Baader and Distel, 2007]

Allows to learnall valid ℰℒ-GCIs fromfinite interpretations

Person Artist

Person

Person Writer child

child

Dchild.WriterĎArtist, . . .

Computes abaseof all such GCIs

Can also compute base ofminimal cardinality

Can include role-depth bounds[Distel, 2012; Borchmann et.al., 2015] Implementations available (prototypes)

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 4 / 14

(27)

Learning GCIs [Baader and Distel, 2007]

Allows to learnall valid ℰℒ-GCIs fromfinite interpretations

Person Artist

Person

Person Writer child

child

Dchild.WriterĎArtist, . . .

Computes abaseof all such GCIs

Can also compute base ofminimal cardinality

Can include role-depth bounds[Distel, 2012; Borchmann et.al., 2015] Implementations available (prototypes)

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 4 / 14

(28)

Learning GCIs [Baader and Distel, 2007]

Allows to learnall valid ℰℒ-GCIs fromfinite interpretations

Person Artist

Person

Person Writer child

child

Dchild.WriterĎArtist, . . .

Computes abaseof all such GCIs

Can also compute base ofminimal cardinality

Can include role-depth bounds[Distel, 2012; Borchmann et.al., 2015] Implementations available (prototypes)

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 4 / 14

(29)

Learning GCIs [Baader and Distel, 2007]

Allows to learnall valid ℰℒ-GCIs fromfinite interpretations

Person Artist

Person

Person Writer child

child

Dchild.WriterĎArtist, . . .

Computes abaseof all such GCIs

Can also compute base ofminimal cardinality

Can include role-depth bounds[Distel, 2012; Borchmann et.al., 2015] Implementations available (prototypes)

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 4 / 14

(30)

Learning GCIs [Baader and Distel, 2007]

Allows to learnall valid ℰℒ-GCIs fromfinite interpretations

Person Artist

Person

Person Writer child

child

Dchild.WriterĎArtist, . . .

Computes abaseof all such GCIs

Can also compute base ofminimal cardinality

Can include role-depth bounds[Distel, 2012; Borchmann et.al., 2015] Implementations available (prototypes)

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 4 / 14

(31)

Learning GCIs [Baader and Distel, 2007]

Allows to learnall valid ℰℒ-GCIs fromfinite interpretations

Person Artist

Person

Person Writer child

child

Dchild.WriterĎArtist, . . .

Computes abaseof all such GCIs

Can also compute base ofminimal cardinality

Can include role-depth bounds[Distel, 2012; Borchmann et.al., 2015]

Implementations available (prototypes)

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 4 / 14

(32)

Learning GCIs [Baader and Distel, 2007]

Allows to learnall valid ℰℒ-GCIs fromfinite interpretations

Person Artist

Person

Person Writer child

child

Dchild.WriterĎArtist, . . .

Computes abaseof all such GCIs

Can also compute base ofminimal cardinality

Can include role-depth bounds[Distel, 2012; Borchmann et.al., 2015]

Implementations available (prototypes)

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 4 / 14

(33)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency or unsatisfiable classesin the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(34)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency or unsatisfiable classesin the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(35)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO)

Turn annotation into relational data Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency or unsatisfiable classesin the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(36)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency or unsatisfiable classesin the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(37)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth

Evaluate Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency or unsatisfiable classesin the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(38)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency or unsatisfiable classesin the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(39)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency orunsatisfiable classes in the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(40)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives)

How many GCIs causeinconsistency orunsatisfiable classes in the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(41)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency orunsatisfiable classes in the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(42)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency orunsatisfiable classes in the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(43)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency orunsatisfiable classes in the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(44)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency orunsatisfiable classes in the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics

Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(45)

Application

Experimental Setup

Take annotated text from the biomedical domain (GRO) Turn annotation into relational data

Learn valid GCIs of a particular role-depth Evaluate

Evaluation

How many GCIs learned follow from the GRO? (certainly true positives) How many GCIs causeinconsistency orunsatisfiable classes in the GRO? (certainly false positives)

How many GCIs of the GRO follow from the GCIs we learned? (“recall”)

“Small” Issue

Annotation uses open-world semantics Learning uses closed-world semantics

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 5 / 14

(46)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(47)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(48)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes

Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(49)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(50)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(51)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . )

Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(52)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(53)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(54)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(55)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins .

Protein Protein

Activation ProteinProteinInteraction

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(56)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins . Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(57)

The Data-Set

Gene Regulation Ontology task at BioNLP Shared Task 2013 (http://2013.bionlp-st.org)

200 manually annotated PubMed abstracts on gene regulation processes Annotations from the Gene Regulation Ontology (GRO)

Entities (Cell, Protein, Tissue, . . . )

Events (Mutation, Localization, Experimental Intervention, . . . ) Relations (encodes, locatedIn, fromSpecies, . . . )

Example (Entities and Events)

Activin addition strongly promotes an interaction between these two proteins . Protein Activation ProteinProteinInteraction Protein

hasAgent hasPatient hasPatient

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 6 / 14

(58)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1

Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(59)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(60)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance)

Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(61)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(62)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1

Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(63)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(64)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(65)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent

has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(66)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(67)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%)

Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(68)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%)

319 are entailed by the GRO («20.6%) Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(69)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(70)

Evaluation

Experiment

considered only 30 most frequent concept-names (reason: performance) Resulting interpretation has 7399 elements, 30 concept-names, and 7 role-names

role-depth bound 1 Results

1552 GCIs extracted

GRO with these GCIs is still consistent has 321 unsatisfiable classes (out of 507)

49 GCIs (each on its own) cause unsatisfiable classes («3.2%) Removal of 56 GCIs results in no unsatisfiable classes («3.6%) 319 are entailed by the GRO («20.6%)

Recall not yet available

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 7 / 14

(71)

Certainly correct GCIs

Example

Gene[ProteinComplexĎK

Dencodes.J [ DhasPart.J [ChromosomeĎK DhasPart.J [CellĎDhasPart.CellComponent Dencodes.J [ProteinĎGene

DhasPart.J [ DlocatedIn.J [Gene[ProteinĎDfromSpecies.Eukaryote Dencodes.J [ DfromSpecies.Eukaryote[ DhasPart.Peptide[

DhasPart.ProteinDomain[Gene[ProteinĎDencodes.MessengerRNA

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 8 / 14

(72)

Certainly correct GCIs

Example

Gene[ProteinComplexĎK

Dencodes.J [ DhasPart.J [ChromosomeĎK DhasPart.J [CellĎDhasPart.CellComponent Dencodes.J [ProteinĎGene

DhasPart.J [ DlocatedIn.J [Gene[ProteinĎDfromSpecies.Eukaryote Dencodes.J [ DfromSpecies.Eukaryote[ DhasPart.Peptide[

DhasPart.ProteinDomain[Gene[ProteinĎDencodes.MessengerRNA

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 8 / 14

(73)

Certainly correct GCIs

Example

Gene[ProteinComplexĎK

Dencodes.J [ DhasPart.J [ChromosomeĎK

DhasPart.J [CellĎDhasPart.CellComponent Dencodes.J [ProteinĎGene

DhasPart.J [ DlocatedIn.J [Gene[ProteinĎDfromSpecies.Eukaryote Dencodes.J [ DfromSpecies.Eukaryote[ DhasPart.Peptide[

DhasPart.ProteinDomain[Gene[ProteinĎDencodes.MessengerRNA

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 8 / 14

(74)

Certainly correct GCIs

Example

Gene[ProteinComplexĎK

Dencodes.J [ DhasPart.J [ChromosomeĎK DhasPart.J [CellĎDhasPart.CellComponent

Dencodes.J [ProteinĎGene

DhasPart.J [ DlocatedIn.J [Gene[ProteinĎDfromSpecies.Eukaryote Dencodes.J [ DfromSpecies.Eukaryote[ DhasPart.Peptide[

DhasPart.ProteinDomain[Gene[ProteinĎDencodes.MessengerRNA

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 8 / 14

(75)

Certainly correct GCIs

Example

Gene[ProteinComplexĎK

Dencodes.J [ DhasPart.J [ChromosomeĎK DhasPart.J [CellĎDhasPart.CellComponent Dencodes.J [ProteinĎGene

DhasPart.J [ DlocatedIn.J [Gene[ProteinĎDfromSpecies.Eukaryote Dencodes.J [ DfromSpecies.Eukaryote[ DhasPart.Peptide[

DhasPart.ProteinDomain[Gene[ProteinĎDencodes.MessengerRNA

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 8 / 14

(76)

Certainly correct GCIs

Example

Gene[ProteinComplexĎK

Dencodes.J [ DhasPart.J [ChromosomeĎK DhasPart.J [CellĎDhasPart.CellComponent Dencodes.J [ProteinĎGene

DhasPart.J [ DlocatedIn.J [Gene[ProteinĎDfromSpecies.Eukaryote

Dencodes.J [ DfromSpecies.Eukaryote[ DhasPart.Peptide[

DhasPart.ProteinDomain[Gene[ProteinĎDencodes.MessengerRNA

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 8 / 14

(77)

Certainly correct GCIs

Example

Gene[ProteinComplexĎK

Dencodes.J [ DhasPart.J [ChromosomeĎK DhasPart.J [CellĎDhasPart.CellComponent Dencodes.J [ProteinĎGene

DhasPart.J [ DlocatedIn.J [Gene[ProteinĎDfromSpecies.Eukaryote Dencodes.J [ DfromSpecies.Eukaryote[ DhasPart.Peptide[

DhasPart.ProteinDomain[Gene[ProteinĎDencodes.MessengerRNA

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 8 / 14

(78)

Inconclusive GCIs

Example

Cell[EukaryoteĎK Dencodes.EukaryoteĎK Cell[VirusĎK

Eukaryote[SignalingPathwayĎK Observation

Two reasons (at least) for inconclusive GCIs

simply wrong GRO incomplete

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 9 / 14

(79)

Inconclusive GCIs

Example

Cell[EukaryoteĎK

Dencodes.EukaryoteĎK Cell[VirusĎK

Eukaryote[SignalingPathwayĎK Observation

Two reasons (at least) for inconclusive GCIs

simply wrong GRO incomplete

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 9 / 14

(80)

Inconclusive GCIs

Example

Cell[EukaryoteĎK Dencodes.EukaryoteĎK

Cell[VirusĎK

Eukaryote[SignalingPathwayĎK Observation

Two reasons (at least) for inconclusive GCIs

simply wrong GRO incomplete

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 9 / 14

(81)

Inconclusive GCIs

Example

Cell[EukaryoteĎK Dencodes.EukaryoteĎK Cell[VirusĎK

Eukaryote[SignalingPathwayĎK Observation

Two reasons (at least) for inconclusive GCIs

simply wrong GRO incomplete

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 9 / 14

(82)

Inconclusive GCIs

Example

Cell[EukaryoteĎK Dencodes.EukaryoteĎK Cell[VirusĎK

Eukaryote[SignalingPathwayĎK

Observation

Two reasons (at least) for inconclusive GCIs

simply wrong GRO incomplete

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 9 / 14

(83)

Inconclusive GCIs

Example

Cell[EukaryoteĎK Dencodes.EukaryoteĎK Cell[VirusĎK

Eukaryote[SignalingPathwayĎK Observation

Two reasons (at least) for inconclusive GCIs

simply wrong GRO incomplete

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 9 / 14

(84)

Inconclusive GCIs

Example

Cell[EukaryoteĎK Dencodes.EukaryoteĎK Cell[VirusĎK

Eukaryote[SignalingPathwayĎK Observation

Two reasons (at least) for inconclusive GCIs simply wrong

GRO incomplete

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 9 / 14

(85)

Inconclusive GCIs

Example

Cell[EukaryoteĎK Dencodes.EukaryoteĎK Cell[VirusĎK

Eukaryote[SignalingPathwayĎK Observation

Two reasons (at least) for inconclusive GCIs simply wrong

GRO incomplete

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 9 / 14

(86)

Unsatisfiable Classes

Question: Where do they come from? Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus So, the annotation isincomplete

Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(87)

Unsatisfiable Classes

Question: Where do they come from?

Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus So, the annotation isincomplete

Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(88)

Unsatisfiable Classes

Question: Where do they come from?

Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus So, the annotation isincomplete

Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(89)

Unsatisfiable Classes

Question: Where do they come from?

Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus So, the annotation isincomplete

Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(90)

Unsatisfiable Classes

Question: Where do they come from?

Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus So, the annotation isincomplete

Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(91)

Unsatisfiable Classes

Question: Where do they come from?

Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus

So, the annotation isincomplete Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(92)

Unsatisfiable Classes

Question: Where do they come from?

Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus So, the annotation isincomplete

Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(93)

Unsatisfiable Classes

Question: Where do they come from?

Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus So, the annotation isincomplete

Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(94)

Unsatisfiable Classes

Question: Where do they come from?

Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus So, the annotation isincomplete

Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(95)

Unsatisfiable Classes

Question: Where do they come from?

Example

CellComponent[NucleusĎK

Data-set did not contain any occurrence of an individual that is both CellComponent and Nucleus

In the GRO, CellComponent is a super-class of Nucleus So, the annotation isincomplete

Conclusion

unsatisfiable classes can arise through theclosed-world interpretation of the open-worlddata-set.

alldisjointness axioms containing only concept-names are caused by this

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 10 / 14

(96)

Unsatisfiable Classes

Example

DlocatedIn.Cell[ DlocatedIn.NucleusĎProtein

Causes the class NuclearExportOfmRNA to become unsatisfiable GRO entails

NuclearExportOfmRNA[ProteinĎK NuclearExportOfmRNAĎDlocatedIn.Nucleus

NuclearExportOfmRNAĎProteinTargetingĎDlocatedIn.Cell

But data-set does not contain any reference to NuclearExportOfmRNA Approach could not learn this counterexample

Idea

Remove concept-names not occurring in the data-set before evaluation?

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 11 / 14

(97)

Unsatisfiable Classes

Example

DlocatedIn.Cell[ DlocatedIn.NucleusĎProtein

Causes the class NuclearExportOfmRNA to become unsatisfiable

GRO entails

NuclearExportOfmRNA[ProteinĎK NuclearExportOfmRNAĎDlocatedIn.Nucleus

NuclearExportOfmRNAĎProteinTargetingĎDlocatedIn.Cell

But data-set does not contain any reference to NuclearExportOfmRNA Approach could not learn this counterexample

Idea

Remove concept-names not occurring in the data-set before evaluation?

Experimental Evaluation of GCIs Learned from Textual Data 2015-06-08 11 / 14

Referenzen

ÄHNLICHE DOKUMENTE

First, we calculate the values and 1σ ranges of the running quark and lepton Yukawa couplings as well as of the quark mixing parameters at various energy scales to provide useful

compatibilities between manual qualitative data analysis methods and computational, rather quantitative approaches for large scale mixed method text analysis designs2. Table

Assuming that there are seven labeling styles used in business process models [LSM12], they designed an al- gorithm to recognise the label style by comparing words, their order

Beam telescope with 4 layers of MuPix7 and scintillating tiles One MuPix layer used as DUT.. Pixel Detector:

In the second stage of our calculations, the spherical Fiedeldey- Frahn optical potential was employed for the pure theoretical description or the above mentioned

In case we can decompose all the lexical information included in the rich and compact original representation of the headword and port it to different elements of lemon, we might

In a ranking task, each translation is assigned an integer (further called a rank), which indicates its quality as compared to the competing translations for the same source

Our task, Recognizing Textual Entailment (RTE), is motivated from both the demand for a more accurate semantic model of natural languages, and from active application needs like