• Keine Ergebnisse gefunden

Data analysis:

N/A
N/A
Protected

Academic year: 2022

Aktie "Data analysis:"

Copied!
16
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Data analysis:

Statistical principals and computational methods

Summary

Dmitrij Schlesinger, Carsten Rother

SS2014, 16.07.2014

(2)

I. Markov Chains – the probabilistic model

Random variables yiK for each iI

state sequence y= (y1, y2, . . . , yn) with yiK p(y) =p(y1, y2, . . . , yn) = p(yi)

n

Y

i=2

p(yi|yi−1)

HMM:p(x, y) =p(y)·p(x|y)...

(3)

I. Markov Chains – the probability of observation

p(x) = X

y

p(x, y) =

=X

y

"

p(y1)

n

Y

i=2

p(yi|yi−1)

n

Y

i=1

p(xi|yi)

#

+ other marginal probabilities ...

(4)

I. Markov Chains – SumProd algorithm

The Idea: propagate Bellmann functions Fi (aka messages) that represent partial solutions (sums)

for ( k = 1. . . K ) F1(k) =q1(k) for ( i= 2. . . n )

for ( k = 1. . . K ) Fi(k) = 0

for ( k0 = 1. . . K )

Fi(k) = Fi(k) +Fi−1(k0)gi(k0, k) Fi(k) = Fi(k)·qi(k)

Z =PkFn(k)

(5)

I. Most probable state sequence

p(x, y) = p(y1)

n

Y

i=2

p(yi|yi−1)

n

Y

i=1

p(xi|yi)

arg min

y

" n X

i=1

ψi(yi) +

n

X

i=2

ψi−1,i(yi−1, yi)

#

Dynamic Programming (Vitterbi, Dijkstra ...) – propagate Bellman Functions Fi by

Fi(k) =ψi(k) + min

k0 [Fi−1(k0) +ψi−1,i(k0, k)]

The functionsFi represent the quality of the the best extension into the already processed part

(6)

II. Energy Minimization

y = arg min

y

X

i

ψi(yi) +X

ij

ψij(yi, yj)

(7)

II. Iterated Conditional Modes

(8)

II. Equivalent transformations

Binary MinSum Problems – canonical forms E(y) = (. . .) +X

rr0

βij ·δ(yi6=yj)

(9)

II. Binary MinSum Problems ↔ MinCut

C = arg min

C

X

ij∈C

cij

– The relation MinSum ↔MinCut works always – MinCut is NP-complete in general

– MinCut is polynomially solvable if all edge costs are non-negative, i.e. a+db+cholds for all edges – Such problems are called submodular

(10)

II. Search techniques

General idea:

α-expansion,αβ-swap:

(11)

III. Bayesian Decision Theory

TheBayesian Risk of a strategy e is the expected loss:

R(e) =X

x

X

k

p(x, k)·Ce(x), k→min

e

It should be minimized with respect to the decision strategy Special cases:

C(k, k0) =δ(k6=k0) →Maximum A-posteriori decision – Additive loss C(k, k0) = Pici(ki, ki0)→ the strategy is

based on marginal probability distributions - Hamming lossC(k, k0) =Piδ(ki6=k0i)

→ Maximum Marginal decision - "Metric" lossC(k, k0) =Pi(kiki0)2

→ Minimum Marginal Square Error

(12)

IV. Gibbs Sampling

How to sample in general:

Sampling in MRF-s:

Markovian property:

p(yi|yV\i) =p(yi|yN(i)) It should be sampled from

p(yi=k|yN(i))∝

∝exp

−ψi(k)− X

j∈N(i)

ψij(k, yj)

(13)

IV. Maximum Likelihood for MRF-s (supervised)

From the Maximum Likelihood formulation F(θ) = lnp(L;θ) = X

l

h−E(yl;θ)−lnZ(θ)i=

=−X

l

E(yl;θ)− |L| ·lnZ(θ)→max

θ

... to the gradient

∂F(θ)

∂θ =−Edata

"

∂E(y;θ)

∂θ

#

+Emodel

"

∂E(y;θ)

∂θ

#

(14)

V. Discriminative Learning

A ”hierarchy of abstraction“:

Generative models →Discriminative models → Classifiers Linear classifiers, Perceptron Algorithm, Multi-class Perceptron

x1 x2

Feature spaces – mappingsφ(x)

(15)

V. Discriminative Learning

Energy Minimization is a linear classifier

Multi-class perceptron + Energy Minimization:

(16)

Exam

On 5.08 at 10:00 (the room not known jet – see www later) Questions, examples

– Which loss leads to MAP (explain, derive)?

– Write down the probability of a state sequence in a Markov Chain (explain the notations)

– How to obtain an auxiliary binary Energy minimization problem forα-expansion?

– An simple Energy Minimization Problem is given.

Perform the ICM (orα-expansion orαβ-swap), starting from a given labeling.

– A simple Energy Minimization Problem for a chain is given. Find the optimal labeling by Dynamic

Programming, compute all necessary Bellman functions – Transform a given pairwise term ψij(yi, yj) into the

canonical form

All (i.e. our part) should be manageable in 15-20 minutes

Referenzen

ÄHNLICHE DOKUMENTE

Based on the performed analysis of mathematical models for minimizing the power shortage, as used in domestic and foreign software packages to assess the adequacy of the EPS, we

For instance, higher consumer demand variability may lead to higher safety stocks for both the retailers and the distributor due to an increase of the variance of outgoing

The example presented in Figure 1 (encountered during our experiments) is challenging for an automated bound analysis: (C1) There are loops whose loop counter is modified by an

Utility Function for the Number o f Good Recreational Areas.. Utility Function for the Number o f Bad Recreational

Following the theorem three types of computational proce- dures could be

Let C 0 be an arbitrary small cube in + j and consider the 2 n rays from the above family which contain the centre of C 0.. We claim that at least one of these rays intersects

For most, the term 'piracy' conjures images of kidnapping and hijacking on the high seas, but this type of piracy is only one of a number of incident types catalogued as

The findings of the minimiza- tion problem as well as simulations with an active axon model lead to the conclusion that short duration, high amplitude pulses are favorable and that