• Keine Ergebnisse gefunden

Chapter 3.2 Basic Concepts of the MDD-Model

N/A
N/A
Protected

Academic year: 2022

Aktie "Chapter 3.2 Basic Concepts of the MDD-Model"

Copied!
14
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Chapter 3.2 Basic Concepts of the MDD-Model

Def.: A Dimension is a data type (almost always finite), which is used as a component of a composite

(multidimensional) key.

Def.: Dimension-Members are elements of a dimension.

Examples: frequently enumeration types or intervals:

Month =( January, February, …, December)

(2)

Further examples:

BMW-engines = {1600, 1800, 2000, 2300, 2800, 3000, 3500, 4000, 5000, 2500D, 3000D}

BMW-bodies = {3er, 5er, 7er, 7erL, 3er Kombi, 3er Cabrio, 5er Kombi, 8er, Z3}

Note : not all combinations of engines and bodies are

built

(3)

Data Cube

Def: Data-Cube with m dimensions D1, D2, …, Dm and k fact –types F1, F2,…, Fk is

W = {( d1, d2,…, dm) ( f1, f2,…fk)}:

di  Di for i = 1, …, m  fj  Fj for j =1, …k  (d1, d2,…, dm) is key}

Def.: cell-address = (d

1

,…, d

m

)

cell-content = (f ,…,f )

(4)

Data Cube Example

Product

Trekking Bike

GB Region

Time

35

Cell Facts (Measures)

Dimension

June

Dimension- Members

Mountain Bike

D

30 25

(5)

Single cube vs. Multiple cubes

Note: model as 1 cube with fact (f

1

,…,f

k

)

or as k cubes W

i

with fact f

i

for i = 1, …, k and all W

i

have the same dimensions

Choice depends on practical considerations and

performance, e.g. (sales#, sales€)

(6)

Consequences for query formulation and execution:

1 cube : select key, sales#, sales€

from W

Multiple cubes: select key, sales#, sales€

from W

1

, W

2

where W

1

.key = W

2

.key

(7)

Def.: Dimension-Attributes: additional attributes for a detailed description of the dimension members.

Examples:

• Number of days per month

dimension Month = ((January,31), (February, 29), :

(April, 30),..., (December,31))

• Gasoline type and number of cylinders of an engine dimension BMW-engine = {(1600, Super,4),...,

(2500D, Diesel,5),..., (4000, Regular,8)…}

(8)

Density and Sparsity

Def.: Dense data-cube: all combinations of (d1, …, dm) occur.

Sparse data-cube is not dense

Def.:

Note:

• logical model assumes dense cubes

• physical storage model deals with dense and sparse cubes.

cells of

number total

cells occupied

of number 1 

sparsity

(9)

Example: BMW Sales

Months

BMW- BMW-engines

(10)

Systematic Sparsity of the MDD-Model:

1600 1800 2000 2300 2800 3000 3500 4000 5000 2500D

3er 3erKombi 3erCabrio 5er 5erKombi 7er 7erL 850 Z3 …

(11)

Relational Modeling

Dimension finite fixed relation Dimension-Element key

Dimension-Attribute other non-key attributes

Cube(Würfel) relation, on E/R level a relationship between dimensions

Cube-key key composed of foreign keys of dimensions

Measures non-key attributes of cube

(12)

Relational Model

cyl fuel B-E

BMW-engines Facts

B-E B-B M € # Name days

Bodies

B-B Month

(13)

E/R-Model: Star-Schema

BMW-Engines Facts BMW-bodies

Months

(14)

E/R-Model: Snowflake-Schema

The Snowflake-Schema arises from the Star-Schema by more details plus normalization:

engine

Basic body brakes

Drive-train

trans- mission

Facts

Months extras

BMW-bodies

Referenzen

ÄHNLICHE DOKUMENTE

Although the statistical model-based method is quite popular and has been successfully applied to different medical imaging fields such as segmenting 2-D anatomical structures [31],

Model-management operators can be used for solving schema evolution, data integration, and other scenarios using short programs, or scripts, which are executed by a model man-

For the experiments we implemented the sparse stress model, Algorithm 1, as well as different sampling techniques in Java using Oracle SDK 1.8 and the yFiles 2.9 graph

Motivated by the Examples II.5.2 and II.6.1 we introduce a notion of convergence that is weaker than convergence in mean and convergence almost surely.. In the sequel, X, X n

‘⇒’ holds trivially. Analogously to the construction of the Lebesgue measure; see ¨

In their model, …rms have the same three sources of information as in the standard information cascade model: a prior knowledge on some state of the world, private signals, and

Agriculture's Economic Research Service have been collaborating with the Food and Agriculture Program of the International Institute for Applied Systems Analysis (IIASA/FAP) on

If the number of patients that can be treated by doctors (PCTD) is greater than the number of RS in a given phase of a disease, the difference between these two figures can be