• Keine Ergebnisse gefunden

Exercises for DW & DM Sheet 8 (until 11.06.2008)

N/A
N/A
Protected

Academic year: 2021

Aktie "Exercises for DW & DM Sheet 8 (until 11.06.2008)"

Copied!
2
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Exercises for DW & DM

Institut für Informationssysteme – TU Braunschweig - http://www.ifis.cs.tu-bs.de

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Silviu Homoceanu

Exercises for DW & DM Sheet 8 (until 11.06.2008)

Please note that you need 50% of all exercise points to be admitted for the final exams. Ex- ercises have to be turned in until Thursday before the next lecture and should be com- pleted in teams of two students each. Write both names and “Matrikelnummer” on each page. If you have multiple pages, staple them together! Please hand in your solutions on pa- per into the mailbox at the IFIS floor or to our secretary (Mühlenpfordtstraße 23, 2

nd

floor).

You may answer in either German or English.

Exercise 1 (4P)

1. What is a staging area? (1P)

2. Why are users not allowed to interact with the staging area? (1P) 3. When should we use flat files in the staging area? (1P)

4. What is ETL, and when should it be applied? (1P) Exercise 2 (9P)

1. Install Eobjects Data Cleaner (http://datacleaner.eobjects.org/downloads). Perform the following tasks, by using the sample database provided with the software (by choosing it from the drop down menu as observed in the Annex1)

a. Compose a regular expression which validates only strings which contain let- ters only (no spaces or other characters than letters), start with only one capi- tal letter, and continue with at least one, up to 20 small letters. See examples in Annex 2. (3P)

b. Use the regular expression from 2.a, and create a validation task, add as vali- dation rule a “regex validation”, choose as data selections the CUSTOMER ta- ble, and as data subset the CONTACTLASTNAME and CONTACTFIRSTNAME attributes. Write the lastname and firstname of the clients which did not pass the validation. (If there are too many you did something wrong!!!) (3P) c. Give three examples (of different patterns) of strings which pass the valida-

tion of the following regular expression, and one that doesn’t:

(\+\d{1,2})?((\(\d{1,4}\))|(\d){3,5}[-/]?)((\d){1,5}) (3P) Exercise 3 (6P)

1. Briefly describe the basic steps in schema integration? (2P)

(2)

Institut für Informationssysteme

2. Explain how schema mapping is performed in praxis.

3. When should we use bulk loading and why is it a good Annex 1

Annex 2

Exercises for DW & DM

Institut für Informationssysteme – TU Braunschweig - http://www.ifis.cs.tu-bs.de

Technische Universität Braunschweig Institut für Informationssysteme

http://www.ifis.cs Wolf-Tilo Balke,

Explain how schema mapping is performed in praxis. (2P) use bulk loading and why is it a good idea?

bs.de

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Tilo Balke, Silviu Homoceanu

(2P)

Referenzen

ÄHNLICHE DOKUMENTE

Imagine a conceptual model, and represent it in mE/R for the Lufthansa sales de- partment, knowing that the department wants to be able to investigate ticket sales.

cost algorithm and as heuristics, the least enlargement cri- Tree according to the obtained graphical representation of the Graphically represent (as in the lecture) the

Consider a star schema with a fact table for sales, and 3 dimensions, the Geo, Time and Product dimension.. (Express all the intermediate results in MB, GB, or TB

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Silviu Homoceanu.. Exercises for DW & DM Sheet 8

a. Build a decision tree based on the training set data, using the algorithm pro- vided in the lecture, considering all attributes as possible classification attrib- utes, and

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Silviu Homoceanu!. Exercises for DW & DM Sheet 1

Technische Universität Braunschweig Institut für Informationssysteme http://www.ifis.cs.tu-bs.de Wolf-Tilo Balke, Silviu Homoceanu!. Exercises for DW & DM Sheet 2

The Exchange Rates cube can be de- fined as follows: Exchange Rates((Day, Bank, Country),(Buy$_Opening, Buy$_Closing, Buy$_Average, Sell$_Opening, Sell$_Closing,