• Keine Ergebnisse gefunden

Computer¨ubungzurVorlesungModerneMethodenderDatenanalyseExercise8:DataMiningCup:LikelihoodRatio Institutf”urexperimentelleKernphysik Fakult¨atf¨urPhysik

N/A
N/A
Protected

Academic year: 2022

Aktie "Computer¨ubungzurVorlesungModerneMethodenderDatenanalyseExercise8:DataMiningCup:LikelihoodRatio Institutf”urexperimentelleKernphysik Fakult¨atf¨urPhysik"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Fakult¨ at f¨ ur Physik

parbox[t]12.cm

Institut f ”ur experimentelle Kernphysik

Prof. Dr. G. Quast, Prof. Dr. M. Feindt, Dr. A. Zupanc

“Ubungsgruppen: G. Sieber, B. Kronenbitter, A. Heller Ausgabe: 21.06.2012

Computer¨ ubung zur Vorlesung Moderne Methoden der Datenanalyse Exercise 8: Data Mining Cup: Likelihood Ratio

In this exercise we continue to work on the Data Mining Cup task introduced in the last exercise.

There, we made a classification based on cuts on multiple variables. But if the parameter space of variables has a high dimension, this is usually not the best approach. Therefore a possible improvement is to combine information from different variables into one quantity which is then used for the selection. One possibility to do this is to form a likelihood ratio.

• Exercise 8.1:

Take the variables which you used for the cut based approach in the last exercise and calculate the ratio of the probability density functions for good and bad customersPgood(~x)/Pbad(~x).

To calculate this use the training sample to obtainPgood(xi) and Pbad(xi) for each individual variable xi and assume all variables to be uncorrelated. As a first approximation take the histograms for the two classes of events and normalise them to obtain the corresponding probability density functions. How can numerical problems with small numbers be avoided?

Determine the likelihood ratio for each event and plot the likelihood ratio distribution se- parately for good and bad customers.

• Exercise 8.2:

Find a reasonable cut on the formed likelihood ratio to classify the events of the training sample. Calculate a score for this probability cut as you did before in the cut based approach.

• Exercise 8.3:

Try to improve your selection by using a parametrised function instead of a histogram for the probability density distribution of one or more variables. First identify distributions which could be reasonably parametrised. Then make up a parametrised function (TF1) and fit it to the histogram of that variable.

Again calculate the score on the training sample for a cut on the likelihood ratio constructed with a parametrised probability distribution.

• Exercise 8.4:

Classify the test dataset class.root with the selection which you think is your best and give it to a tutor, who will calculate a score for you.

Referenzen

ÄHNLICHE DOKUMENTE

Sie können es den Frauen auch er- schweren, sich auf die örtlichen Gegebenheiten einzulassen, wenn sie bei Geburtsbeginn feststellen müssen, dass sie unrealistische Vorstellungen

The other files needed for this exercise are provided there as well: A root file containing the training data where it is known whether the customer paid, a root file containing

One way how to check, that the network is not overtrained is to split the training sample into N subsamples, training a Neural Network N times with N − 1 subsamples and applying

Take the variables which you used for the cut based approach in the last exercise and calculate the ratio of the probability density functions for good and bad customers P good (~

Dalam kerangka ini motivasi inovasi keuangan bisa dipertimbangkan dengan melihat pada faktor penentu yang luas, terutama berkaitan dengan efek kekayaan (wealth effects), perilaku

We integrate radial and non- radial efficiency measurements in DEA using the hybrid measure while we extend the proposed model by considering inputs and good and bad outputs

The estimated trees for 1931 trees can be separated into their type of citrus by using their 1946 weights of trees, and thus estimate volume and value of citrus production in

Since 25 January 2011, Egypt has undergone a frantic succession of political and social changes, including: (1) the loss of fear that led the population to demand the overthrow of