Multidimensional Scaling for Graph Drawing

and w have in common when walking towards the root in the hierarchy.

The descendants(v) are all leafs reachable from a vertexv along the treeT.

2.2. Multidimensional Scaling for Graph Drawing

Let V = {1, . . . , n} be a set of n objects and let D ∈ R^n×n be a square matrix of dissimilarities (or distances) for each pair of objectsi, j ∈V. The goal of MDS is to find a matrixX= [x₁, . . . , x_n]^T ∈R^n×dof d-dimensional coordinatesx₁, . . . , x_n∈R^d such that

kx_i−xjk ≈dij (2.2)

is met as closely as possible, for alli, j∈V. For two-dimensional layouts we consider d= 2.

The objects could be, e.g., cities and dij the Euclidean distance on them would be the dissimilarities¹. Then x_i would be the 2d-coordinates of city i. Since the Euclidean distance is invariant against translation, reflection, or rotation, one can only recover coordinates with respect to these transformations. See Cox and Cox (2001) or Borg and Groenen (2005) for a general introduction to this topic.

Application to Graph Drawing In the case of graph drawing the distance d_ij reflects the graph-theoretic distance between two vertices i and j. The commonly used techniques are

• Classical Scaling and

• Distance Scaling.

Classical scaling is based on spectral decomposition of the derived inner product matrix and yields an essentially unique solution (coordinates), while distance scaling iteratively improves the layout, given initial positions, by adapting the coordinates to fit given distances.

The experimental study of Brandes and Pich (2009) suggests that using classical scaling as initialization for the graph layout and then improving the layout by min-imizing the stress with distance scaling using stress majorization (Gansner et al., 2005a) yields the best results in general. While the classical scaling of the first step creates layouts with good representations of large distances, the second step improves local details of the layout. For the first step, the study suggests to use PivotMDS (Brandes and Pich, 2007) as an approximation of classical scaling, which scales very well to large graphs.

The study suggests in particular that the combination of these techniques is superior to the often used force-directed methods (Eades, 1984; Fruchterman and Reingold, 1991; Kamada and Kawai, 1989), since it is faster and yields better results with

1For simplicity consider a flat 2d surface.

respect to representing the graph-theoretic distances as Euclidean distances in the final layout. Figure 2.1 shows an example.

Since this combination of techniques will be used throughout this thesis, we will explain them in the context of classical and distance scaling in the next two sections.

An extensive review of MDS methods in graph drawing is given by Pich (2009) and Klimenta (2012).

2.2.1. Classical Scaling

Originally proposed by Schoenberg (1935) and Young and Householder (1938) the goal is to derive coordinates only by using given pairwise Euclidean distances on a set of points. We briefly explain classical scaling and its approximation PivotMDS.

Let the coordinatesx_i andx_j be the points in a d-dimensional Euclidean space for i, j∈V with Euclidean distance

d²_ij = (x_i−x_j)^T(x_i−x_j). (2.3) The inner product matrixB is defined as

[B]_ij =b_ij =x^T_i x_j. (2.4) After deriving the inner product matrix B using the squared distances of dij, the coordinates can be recovered by using the spectral decomposition ofB into

B =QΛQ^T, (2.5)

where Λ = diag(λ₁, λ₂, . . . , λ_n), the diagonal matrix of eigenvalues {λ_i} of B, and Q= [q1, q2, . . . , qn] the matrix of normalized eigenvectors qi with qiq^T_i = 1. Let the eigenvalues be sorted in decreasing orderλ1 ≥λ2≥. . .≥λn. Sincen−deigenvalues are zero B can be written as

B = ¯QΛ ¯¯Q^T, (2.6)

with ¯Q = [q₁, . . . , q_d] and ¯Λ = diag(λ₁, . . . , λ_d). The sought coordinates matrix X= [x1, . . . , xn] is then given by

X = ¯QΛ¯¹² (2.7)

with ¯Λ¹² =diag(λ

1 2

1, . . . , λ

1 2

d).

Using the fact that the distances do not change by a global translation, one can assume that the centroid over all coordinates is placed at the origin

i=1

x_ik= 0 (k= 1, . . . , d), (2.8)

2.2. Multidimensional Scaling for Graph Drawing

(a)

(b)

Figure 2.1.: Drawing of a collaboration network (from Chapter 1, page 2) using a spring-embedder (a) and a stress based approach (b). Both methods are initialized with PivotMDS. Pairwise graph-theoretic distances are represented more clearly by the stress based approach (Brandes and Pich, 2009).

which allows reconstruction of the inner product matrixB (Cox and Cox, 2001, page 33)

B=−1

2J_nD⁽²⁾J_n, (2.9)

with centering matrixJn=In−¹_n1nn,1nn being then-by-nmatrix of ones, andD⁽²⁾ the distance matrix where each entry is squared.

PivotMDS: Approximating classical scaling Brandes and Pich (2007) propose to improve the scalability of classical scaling by approximating the distance matrix Dusing only kpivots from V. The resulting partial distance matrix DP

D_P =







d_1,1 . . . d_1,k d_2,1 . . . d_2,k d3,1 . . . d_3,k ... ... dn,1 . . . d_n,k







∈R^n×k (2.10)

contains only a subset of the columns of D. The pivots can be chosen with various strategies (Brandes and Pich, 2007), such that a good approximation to the shortest-path distances in the graph is given. The eigenvectors ofBare approximated without computing B, but a smaller matrixC

C =−1

2J_nD_P⁽²⁾J_k, (2.11)

with Jn and Jk being the centering matrices as in Eq. (2.9). Since the eigenvectors of BB^T are the same as of B, but with all eigenvalues squared, the eigenvectors of CC^T can be used to approximate the eigenvectors ofBB^T and thus B.

As already stated PivotMDS approximates the global distances very well. Using it as initialization for distance scaling allows to improve the local details.

2.2.2. Distance Scaling by Stress Minimization

In distance scaling the vertices of a graph are directly arranged to improve a given objective function which incorporates the given distances. Unlike for classical scaling, no algebraic solution is known.

While there are different variants of distance scaling, often also referred to as force-directed or energy-based layout methods, we will concentrate on the so called stress as an objective function. This objective function will be minimized iteratively until a local minimum is reached. The main advantages of stress based distance scaling compared to classical scaling are its:

• Flexibility: Incorporation of various layout constraints using stress terms (Dwyer et al., 2005a, 2009; Dwyer and Marriott, 2008).

2.2. Multidimensional Scaling for Graph Drawing

• Local Details: Initialized with classical scaling, local distances are represented very well (Brandes and Pich, 2009).

As in the classical scaling for each pair of nodes i, j∈V there is an ideal distance d_ij ∈ R⁺. The goal is to find a matrix X = [x₁, . . . , x_n]^T ∈ R^n×d of d-dimensional coordinatesx1, . . . , xn∈R^d such that

kx_i−xjk ≈dij (2.12)

is met as closely as possible, fori, j∈V. A vertexihas the position xi ∈R^dand the axes of the layout are given by X⁽¹⁾, . . . , X^(d) ∈Rⁿ, with d= 2 for two-dimensional drawings.

The deviation of the ideal distances of the vertices causes the so-called stress (Kruskal, 1964):

stress(X) =X

i<j

wij(||xi−xj|| −dij)², (2.13) where the weighting is typically chosen to be wij = 1/d²_ij to better emphasize local distances. The stress is reduced in each iteration, until convergence.

Stress Majorization Gansner et al. (2005a) show how to bound the stress from above using a quadratic majorantF^X(Y)

stress(X)≤F^X(Y) (2.14)

with equality for X=Y. Instead of the original function, this function can then be minimized more robustly using various methods. The localized method, as Gansner et al. (2005a) call it, iteratively moves each vertexi∈V to the weighted average of the voted positions it got from all other vertices, thus for each dimensiona={1,2}

X_i^(a) ← P

j6=iwij

x^(a)_j +dij(x^(a)_i −x^(a)_j )/kxi−xjk P

j6=iwij

. (2.15)

More formally, the minimum of F^X(Y) can be derived by differentiating by Y and solving the resulting system of linear equations for each dimension aseparately

L^wY^(a) =L^XX^(a)

| {z }

Ax=b

, (2.16)

whereL^wis the weighted Laplacian matrix,Y^(a)the vector of unknown coordinates for dimension a, and L^X a matrix depending only on the current layout X: More precisely fori, j∈V

L^w =

(−w_ij ifi6=j P

k6=iw_ik ifi=j (2.17)

L^X =

(−w_ijd_ij/kX_i−X_jk ifi6=j

−P

k6=iL^X_ik ifi=j (2.18)

In fact, the localized method corresponds to the first iteration of the Jacobi method for solving (2.16) (Klimenta, 2012, page 135).

Since (2.16) is of the formAx=bwhereAis a positive semi-definite matrix one can use the Conjugate Gradients method (Hestenes and Stiefel, 1952; Shewchuk, 1994) to solve the system of linear equations iteratively. Due to the sparseness of A one iteration needs onlyO(|V|) time using, e.g., a compressed row format for the matrix representation (Saad, 2003). Although using the localized method will be sufficient for most cases (Brandes and Pich, 2009), we will see in Chapter 7 that in some special cases it might make a huge difference in terms of runtime which method is used.

Im Dokument Untangling Networks : Focus on Less to See More (Seite 23-28)