• Keine Ergebnisse gefunden

Exercises for DW & DM Sheet 3 (until 24.01.2017)

N/A
N/A
Protected

Academic year: 2021

Aktie "Exercises for DW & DM Sheet 3 (until 24.01.2017)"

Copied!
2
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Exercises for DW & DM

Institut für Informationssysteme – TU Braunschweig - http://www.ifis.cs.tu-bs.de

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Kinda El Maarry

Exercises for DW & DM Sheet 3 (until 24.01.2017)

Please drop your solution in the silver homework box (second floor where the IfIS is locat- ed) until Thursday, before the lecture (date is also mentioned above). You may answer in either German or English. You are encouraged to work in teams of 2 students (not more than 2), and send your solution as a team. Please mention in your email the name of both students together with the corresponding inmatriculation numbers.

Exercise 1

Simulate the functionality of the Multiple Minimum Supports mining algorithm on the trans- actions provided in Annex 1, describing each step starting from k=1 up to the rules. Mini- mum support values are also provided in Annex 1. 𝜑 = 20% and minconf = 60%.

Exercise 2

Simulate the functionality of the GSP mining algorithm of the transactions provided in Annex 2, considering a minimum support of 2.

Hint for the generalization step:

Joining:

Two sequences, s1 and s2 can be joined if after dropping the first item from s1 and the last item from s2, we obtain the same sequence. E.g.: <bc> and <ca> can be joined since by dropping b from <bc> and a from <ca> we obtain <c> for both. The joined result is <bca>.

<ba> and <(ab)> can also be joined and we obtain <b(ab)>

Pruning:

Similar to the apriori algorithm <bca> passes pruning only if <bc>, <ba> and <ca> ∈ F2

<b(ab)> passes pruning only if <ba>, <bb> and <(ab)> ∈ F2.

Exercise 3

1. Calculate the moving average of order 4, MA(4), for the following dataset:

4.38, 4.19, 4.65, 6.40, 6.26, 13.51, 4.19, 8.41, 6.50, 8.43, 9.87, 9.56, 6.57, 9.03, 10.18

(2)

Exercises for DW & DM

Institut für Informationssysteme – TU Braunschweig - http://www.ifis.cs.tu-bs.de

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Kinda El Maarry

And then represent the two lines (the dataset and the MA in a chart).

2. Shortly present the role of R-trees in similarity search for time series.

Annex 1:

Annex 2:

Item MIS %

1 70

2 17

3 15

4 30

5 30

6 35

Transactions 1,4,6

1 1,5,6

1,6 4,6 1,2,3,5 1,2,3,5

6 1 1,6

SID Sequence

1 <(dc)b(ac)>

2 <bc(bac)>

3 <(ab)a>

Referenzen

ÄHNLICHE DOKUMENTE

cost algorithm and as heuristics, the least enlargement cri- Tree according to the obtained graphical representation of the Graphically represent (as in the lecture) the

Consider a star schema with a fact table for sales, and 3 dimensions, the Geo, Time and Product dimension.. (Express all the intermediate results in MB, GB, or TB

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Silviu Homoceanu.. Exercises for DW &amp; DM Sheet 8

a. Build a decision tree based on the training set data, using the algorithm pro- vided in the lecture, considering all attributes as possible classification attrib- utes, and

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Silviu Homoceanu!. Exercises for DW &amp; DM Sheet 1

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Silviu Homoceanu!. Exercises for DW &amp; DM Sheet 2

The Exchange Rates cube can be de- fined as follows: Exchange Rates((Day, Bank, Country),(Buy$_Opening, Buy$_Closing, Buy$_Average, Sell$_Opening, Sell$_Closing,

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Silviu Homoceanu!. Exercises for DW &amp; DM Sheet 4