Extreme eigenvalues - Clustering with Spectral Methods

The last approach is suitable for calculating few extreme eigenvalues. It is called Lanczos approach and is originally used to solve linear equation systems. For details see [QSS02, sec. 5.11]. It gained its popularity since it maintains the matrix’s sparsity and has an acceptable runtime as well as needed space. Some versions lack numerically stability.

Literature note

During the study of the computation of eigenvalues and eigenvectors we also used [Wil72], [GvL96] and [TDB97]. In these references we found different introductions to this topic as well as various versions and different analysis tools.

Some basic calculation

regarding the graph structure

In this part we have collected some standard calculations of linear algebra which can be expressed by the use of graph notation.

Lemma C.1

Let H = (V, E) be a graph and s an edge weight function. Then the graph H⁰ :=G A(H)²

can be calculated inO(|V| ·∆(H)²) time.

Proof:

With the weighted analogue of lemma 3.25 it is sufficient to calculate all paths of length two between all pairs of vertices. Let us fix an arbitrary vertex v. Then we reach all vertices in N with a path of length 2 with

N := [

w∈N^→(v)

N^→(w).

The set N contains at most ∆(H)² vertices, since deg^→(u) ≤ ∆(H) for all vertices u∈V.

To calculate H⁰ we override the original input graph H and add an edge weight function t to H⁰. Only the edge set E⁰ of H⁰ differs from the edge set E of H. The vertex set is the same for both H and H⁰. Both H⁰ and t will be calculated in an incremental way.

We have to keep track of newly inserted edges. Therefore we use an additional edge weight initialised to:

∀e∈E: (e) = 1.

135

Alsotis initialised to be zero for all elements inE. To update one vertex we use algorithm4. We omit to prove the correctness of it.

Algorithm 4: Update vertexv Input : H⁰, s, t, and v

Output: update of H⁰, s, t and begin

forall w∈N^→(v) do

lete_w be the edge connecting v with w in H⁰ if (ew)6= 0 then

forall u∈N^→(w) do

lete_u be the edge connecting w with u inH⁰ if (ew)6= 0 then

if (v, u)6∈E⁰ then

insert edgee⁰ connectingv with u in H⁰ (e⁰)←0

t(e⁰)←0

lete⁰ be the edge connecting v with u inH⁰ t(e⁰)←t(e⁰) +s(e_w)·s(e_u)

end

Since every vertex needs only to be updated once, we call algorithm 4 for each vertex. Since algorithm 4 considers each element of N once and uses constant time for the inner most loop, we have proved the postulated time

complexity.

Next we state some lemmas containing edge weight functions and vectors indexed by the vertex set. Therefore letT_xbe the access time for one element using a vector and T_s the access time for one element using an edge weight function.

Lemma C.2

LetH = (V, E)be a graph, san edge weight function andxa vector indexed by the vertices of V. Let A be the weighted adjacency matrix of H with respect tosand having the same vertex ordering as x. Then the vectorA ·x

can be calculated inO(|E| ·T_s·T_x+|V| ·T_x)time and by the use of O(|V|) space.

Proof:

Algorithm 5 does the computation. It requires O(|V|) space to save the result and runs in the desired time, since every edge is considered twice and we need its weight and the weight of its source and its target.

Algorithm 5: Matrix–vector multiplication Input : A, s and x

Output: Ax begin

allocate a new vector y //fixing one entry of y forall v ∈V do

[y]_v ←0

//calculating the addends of entry of [y]_v forall w∈N^→(v) do

lete_w be the edge connecting v with w [y]_v ←[y]_v+s(e_w)·[x]_w

outputy end

Algorithm 5is correct, because it is just a serial implementation of the

defi-nition of a matrix–vector multiplication.

If we assume Tx, Ts ∈ O(1) then we can replace A by L(H) or N (H) without changing the runtime of lemma C.2. This is due to the fact that these matrices respects the adjacency structure and add a diagonal element or rescale each row.

Lemma C.3

Let H = (V, E) be a graph and x, y to vectors. Then any weighted scalar product of these two vectors can be calculated in O(|V| ·T_x³) time and con-stant space. If the weight is concon-stant we require onlyO(|V| ·T_x²)time. Scal-ing one vector can be done in O(|V| ·T_x) time and requires O(|V|) space for saving the result.

Proof:

Simply use the definition in lemma3.13and the scalar–vector multiplication.

Lemma C.4

LetH = (V, E)be a graph,x₁, . . . , x_k orthogonal vectors andx an arbitrary vector not contained in the span of x₁, . . . , x_k. Let x_k+1 be the orthogonal projection, see lemma 3.17. Thenxk+1 can be calculated withO(|V|)space and inO(|V| ·(2k)·T_x²+|V| ·(k+ 1)·T_x) time.

Proof:

Algorithm 6 does the computation. It requires O(|V|) space. It has the desired runtime, since we have to compute the sum of k+ 1 vectors and 2k scalar products.

Algorithm 6: Vector orthogonalisation Input : V, x₁, . . . , x_k and x

Output: xk+1

begin

allocate a new vector y forall v ∈V do

[y]_v ←[x]_v for i= 1, . . . , k do

Z ←0 N ←0

//calculating the coefficients for the linear combination forall v ∈V do

Z ←Z+ [x]_v·[x_i]_v N ←N + ([x_i]_v)² Z ←Z/N

//calculating the next part for the linear combination forall v ∈V do

[y]_v ←[y]_v−Z[x_i]_v outputy

end

Algorithm 6is correct, since it is an implementation of the lemma 3.17.

With these lemmas we can evaluate the power–method, see appendix B, in the graph situation. Therefore we assume Tx, Ts∈ O(1).

Proposition C.5

Let H = (V, E) be a graph, s an edge weight function and x a vector. If we calculate k iterations of the power–method with A∈ {A,N,L} then we require O(k·(|E|+|V|+T)) time, where T is the runtime of the adjust–

step. In addition we need O(|V|) space.

Proof:

For each step we first calculate a matrix–vector multiplication, second call adjustand finally calculate a matrix–vector multiplication and a scalar prod-uct. Lemma C.2 implies that we require O(|E|+|V|) time for a matrix–

vector. By lemma C.3 we know that we need O(|V|) time to calculate a scalar product. Therefore we have the following runtime for each step:

2·(|E|+|V|) +|V|+T

⊆ O(|E|+|V|+T)

This completes the proof.

List of symbols

D.1 Graph notation

[G] underlying undirected skeleton of G page 11

source(e) source of the edge e page 12

target(e) target of the edge e page 12

N(v) neighbourhood of v page 12

N^←(v) all neighbours which are adjacent tov page 12 N^→(v) all neighbours to which v is adjacent page 12

deg(v) degree of v page 13

deg^←(v) in–degree of v page 13

deg^→(v) out–degree of v page 13

δ(G) minimal degree inG page 13

∆(G) maximal degree in G page 13

u ^p v path p connected uwith v page 13

η(v_s, v_t) number of edge disjoint path connecting v_s with vt

page 13 κ(v_s, v_t) number of vertex disjoint path connectingv_s

with v_t

page 13 η(G) minimal number of edge disjoint paths

con-necting two (different) vertices in G

page 14 κ(G) minimal number of vertex disjoint paths

con-necting two (different) vertices in G

page 14

141

E|V⁰ the subset of all edges of E such that both source and target if contained in V⁰

page 14

1 vector with all entries equal to one page 23

E_λ eigenspace of eigenvalue λ page 24

p_A characteristic polynomial of A page 24

Λ (A) the spectrum of A page 24

%(A) the spectral radius of A page 24

A≥0 A is nonnegative page 25

kxk the (euclidian) length of x page 30

G(A) to A associated digraph page 34

I(G) a incidency matrix of G page 34

L(G) the Laplacian matrix of G page 39

D(G) the degree matrix of G page 39

Λ_L(G) the Laplacian spectrum of G page 43

LN(G) the normalised Laplacian matrix of G page 47 Λ_L_N(G) the normalised Laplacian spectrum of G page 47

N(G) the normal matrix of G page 48

Λ_N (G) the normalised spectrum of G page 48

Im Dokument Clustering with Spectral Methods (Seite 133-142)