• Keine Ergebnisse gefunden

Integrating Knowledge Discovery into Knowledge Management

N/A
N/A
Protected

Academic year: 2022

Aktie "Integrating Knowledge Discovery into Knowledge Management"

Copied!
19
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Integrating Knowledge Discovery into Knowledge Management

Katharina Morik, Christian Hüppe, Klaus Unterstein

Univ. Dortmund LS8

www-ai.cs.uni-dortmund.de

(2)

Overview

• Integrating given data into a knowledge management system (KMS)

• System architecture of EAMS

• Integrating given document collections by learning the right retrieval function

• Integrating given databases by knowledge discovery

(3)

Knowledge Management

Business Process

? ?

?

?

! !

(4)

Integrating Given Data into KMS 1

• Preparing documents for a KMS is an extra effort

• Structuring document collections according to an ontology is time-consuming, too

• Why not having the machine learn which

document a user wants as the answer to his query?

– Learning the retrieval function for each user – according to an ontology

!

(5)

Integrating Given Data into KMS 2

• The main data sources in organizations are databases.

• Why not using them?

– Knowledge discovery is a high-level query language.

– Meta-data about knowledge discovery cases can be organized according to an ontology.

!

(6)

System Architecture

Contract

Web Display

DB-Data Display

Person GUI

CONCEPTUAL DATA MODEL

ontology initializes

INTERNET STRIVER

interface

CONCEPTUAL CASE MODEL

DATABASE

www- Interaction-

module

DB- Interaction-

module

interacts interacts

displays displays

(7)

System Architecture

Contract

Web Display

DB-Data Display

Person GUI

CONCEPTUAL DATA MODEL

ontology initializes

INTERNET STRIVER interface

www- Interaction-

module

interacts displays

CONCEPTUAL CASE MODEL

DATABASE

DB- Interaction-

module

interacts displays

(8)

Striver: Learning a Retrieval Function

Thorsten Joachims KDD 2002 !

Query q ?

Ordering r D x D !

Documents D {d1, d2, ..., dn} Clickthrough

r‘ r

(q1, r‘1) ...,

(qm , r‘m)

(9)

Striver: Learning a Retrieval Function

Thorsten Joachims KDD 2002 !

Query q ?

Ordering r# D x D !

Documents D {d1, d2, ..., dn} (q1, r‘1)

r‘ r

l1 click l2

...

li click ...

lj

l1 > l2 ...

li > l2 Minimize distance between r‘ and learned ranking r#

(10)

Search String for a Web Query

(11)

Result of Web Query

(12)

Web document

(13)

Learning a Retrieval Function

• New version of support vector machine for ranking (Thorsten Joachims 2002).

• Optimizes given retrieval functions.

• Automatically adapts to users (tasks).

• Can be applied to the intranet without preparation.

• Inspection of the learned function shows that the weights of words make sense!

(14)

Language to Databases

• Ontological concepts:

– Person, – Contract

• Query types:

– Frequencies of attributes – Segmentation (subgroups) – Correlation of attributes – Classification

• Algorithms (operators):

– Statistical stored procedures – Data cube

– APRIORI – C4.5

– mySVM

• Preprocessing chain

!

(15)

KDD Query -- already executed job

(16)

KDD Result

(17)

KDD Result

sex age group profession quantity male 0-22 years profession group 1 67

male 0-22 years profession group 2 4373 male 0-22 years profession group 3 1967 male 0-22 years profession group 4 3

(18)

KDD Query -- creating a new job

(19)

Mining Mart for Knowledge Management

• Making existing sources (databases) available to users – a case answers a high-level question

• The conceptual model (ontology) eases the

integration with other services of a knowledge management system (e.g., web navigation).

• The conceptual model and the cases create the GUI for the EAMS user.

Referenzen

ÄHNLICHE DOKUMENTE

Other boundary thresholds (existing development, flood prone areas, steep slopes,

• Integrating given document collections by learning the right retrieval function.. • Integrating given databases by knowledge

With our feature modeling extension, based on the cardinality-based Czarnecki-Eisenecker notation, we were also addressing the objective of providing a basis for different

Diese Verfahren liefern Resultate, welche die Grundlage f¨ur die Realisierung von Softwarekomponenten zur Diagnose indivi- dueller Defizite wie auch zur tutoriellen Betreuung

dialogical context what has been said by whom dialogue model ontological context world/conceptual knowledge domain model situational context time, place, etc situation

Task The problem of assigning multiple terms to a document can be addressed by multi-label classification algorithms. More precisely, our task is to assign multiple index terms in

For the evaluation, we use two different test collec- tions in the German language: (i) GIRT [5] for the infor- mation retrieval task, and (ii) a collection of descriptions

The design of sustainable cities and buildings needs to include thoughts on circumstances influencing human satisfaction be it for thermal, visual, or other dimensions of