• Keine Ergebnisse gefunden

Uniqueness of Ordinal Embedding

N/A
N/A
Protected

Academic year: 2022

Aktie "Uniqueness of Ordinal Embedding"

Copied!
28
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Uniqueness of Ordinal Embedding

Matth¨aus Kleindessner KLEINDESSNER@INFORMATIK.UNI-HAMBURG.DE

Ulrike von Luxburg LUXBURG@INFORMATIK.UNI-HAMBURG.DE Department of Computer Science, University of Hamburg, Germany

Abstract

Ordinal embedding refers to the following problem: all we know about an unknown set of points x1, . . . , xnRdare ordinal constraints of the formkxixjk<kxkxlk; the task is to construct a realizationy1, . . . , ynRdthat preserves these ordinal constraints. It has been conjectured since the 1960ies that upon knowledge of all ordinal constraints a large but finite set of points can be approximately reconstructed up to a similarity transformation. The main result of our paper is a formal proof of this conjecture.

Keywords:non-metric multidimensional scaling, monotone mapping, isotonic mapping

1. Introduction

We consider the problem of ordinal embedding, also called ordinal scaling, non-metric multidimen- sional scaling, monotonic embedding, or isotonic embedding. Consider a set x1, . . . , xnin some metric space(X,dist), but assume that the distances between these points are unknown. All we get to see are ordinal relationships, namely whetherdist(xi, xj)<dist(xk, xl)or vice versa. The goal of ordinal embedding is to constructy1, . . . , ynRdsuch that all ordinal constraints are preserved (throughout the paper,k · kdenotes the Euclidean norm):

dist(xi, xj)<dist(xk, xl)⇒ kyi−yjk<kyk−ylk.

The problem of ordinal embedding has first been studied in the psychometric community by Shepard(1962a,b) andKruskal(1964a,b), see also the monographBorg and Groenen(2005). Lately it has drawn quite some attention in the machine learning community (Quist and Yona,2004;Ros- ales and Fung,2006;Agarwal et al., 2007;Shaw and Jebara, 2009;McFee and Lanckriet, 2009;

Jamieson and Nowak, 2011a;McFee and Lanckriet,2011;Tamuz et al.,2011;Ailon,2012), also in its special case of ranking (Ouyang and Gray,2008;McFee and Lanckriet,2010;Jamieson and Nowak, 2011b; Lan et al., 2012; Wauthier et al., 2013). Even though ordinal embedding dates back to the 1960ies and is widely used in practice, surprisingly little is known about its theoreti- cal properties. Particularly striking, one of the most elementary properties, namely the uniqueness of ordinal embeddings, has never been established in a finite sample setting. It is widely believed that, upon knowledge of all ordinal relationships, a point setx1, . . . , xnRdcan be approximately reconstructed up to a similarity transformation if nis “large enough” (p. 294 ofShepard, 1966;

Section 2.2 ofBorg and Groenen,2005; Section 4.13.2 ofDattorro,2005). Numerous simulation experiments have been published as supporting evidence (Shepard,1966;Young,1970;Sherman, 1972). The main result of our paper is a formal proof that this uniqueness conjecture is indeed true:

Consider a sequence of points(xn)n∈N that are dense in some “nice” setK ⊆Rd. Letyn1, . . . , ynn

(2)

KLEINDESSNER VONLUXBURG

be any ordinal embedding ofx1, . . . , xn. Then, asn→ ∞, the set of embedded points always con- verges to the set of original points, up to similarity transformations such as rotations, translations, rescalings, or reflections. This even holds if we only know about “local ordinal relationships”, that is distance comparisons between points in small subregions.

Our proofs are elementary in the sense that we do not apply any heavy mathematical machinery.

However, details are delicate and require a careful treatment.

2. Setup, definitions and notation

We start this section with the definition of the two central notions in our paper, ordinal embeddings and isotonic functions. We will see below that these two notions are closely related.

Definition 1 (Ordinal embedding) Consider two sets Xn = {x1, . . . , xn} ⊆ Rd and Yn = {y1, . . . , yn} ⊆Rd.Ynis anordinal embeddingofXnif for all1≤i, j, k, l≤n,

kxi−xjk<kxk−xlk ⇒ kyi−yjk<kyk−ylk. (1) Ynis a weak ordinal embeddingofXnif (1) holds for all1 ≤ i, j, k, l ≤ nwith i = k. Yn is a strong ordinal embeddingofXnif (1)holds for all1≤i, j, k, l≤n, and additionallykxi−xjk= kxk−xlk ⇒ kyi−yjk=kyk−ylkfor all1≤i, j, k, l≤n.

Definition 2 (Isotonic functions) LetΩ ⊆ Rdandf : Ω → Rdbe an arbitrary function. f is a similarityif there isλ > 0 such that for allx, y ∈ Ωwe havekf(x)−f(y)k = λkx−yk. f is isotonicor anisotonyif for allx, y, z, w∈Ω,

kx−yk<kz−wk ⇒ kf(x)−f(y)k<kf(z)−f(w)k.

f isweakly isotonicif this property only holds forx, y, z, w∈Ωwithx=z.f isstrongly isotonic if it is isotonic and additionally satisfieskx−yk=kz−wk ⇒ kf(x)−f(y)k=kf(z)−f(w)kfor allx, y, z, w∈Ω. We say thatf islocallya similarity / (weakly / strongly) isotonic if for each point x ∈ Ωthere exists a neighborhoodU(x)inΩsuch thatf|U(x) has the corresponding property. If we want to emphasize that a functionf : Ω → Rdhas a property not only locally but on all ofΩ, we sometimes say thatf isgloballya similarity / (weakly / strongly) isotonic.

Let us mention some obvious but important observations. Similaritiesf :RdRdare nothing else than the well-known similarity transformations, given byf(x) =λOx+bfor some orthogonal matrixOand an offsetb∈Rd. For generalΩ, they are simply given by the restrictions of similarity transformations toΩ(see LemmaAin AppendixA). Obviously, we have

similarity⇒strongly isotonic⇒isotonic⇒weakly isotonic,

but for general Ωnone of the converses are true. Any weakly isotonic function is injective. Iff is a similarity or a strong isotony, so is f1, but this does not necessarily hold for isotonies. A composition of similarities / (weak / strong) isotonies is again a similarity / (weak / strong) isotony.

Obviously, y1, . . . , yn is a (weak / strong) ordinal embedding of x1, . . . , xnif and only if the mappingf :{x1, . . . , xn} → {y1, . . . , yn}given byf(xi) =yiis (weakly / strongly) isotonic. The uniqueness question for ordinal embedding can thus be formalized as follows: iff is a (weakly / strongly) isotonic mapping between two finite point sets, can it be approximated by a similarity? It

(3)

is well-known that any strongly isotonic functionf :RdRddefined on the full domainRdis a similarity transformation. One can see this by exploiting properties of sphere-preserving mappings in Euclidean geometry (see McKemie and V¨ais¨al¨a(1999) and also the argumentation inShepard, 1966), by an elegant argument related to positive definite functions (Schoenberg,1938), and also by the Beckman-Quarles theorem (Beckman and Quarles,1953). The key question of this paper is in what sense such a property already holds for functions defined on a finite set.

Let us conclude this section with some standard notation for the rest of the paper. For any subset A ⊆ Rd we denote its linear hull by[A] = {Pn

i=1λiai : n ∈ N, ai ∈ A, λiR}and its affine hull byH(A) ={Pn

i=1λiai : n∈N, ai ∈A, λiR,Pn

i=1λi = 1}. Forz ∈ Rdandr > 0the open ball with centerzand radiusr isUr(z) = {x ∈ Rd : kx−zk < r} and the closed ball is Ur(z) = {x ∈ Rd :kx−zk ≤ r}. For a vector-valued functionf :X → Rd andj = 1, . . . , d we write fj for the jth component of f. For two functions f : X1Rd and g : X2Rd and an arbitrary subsetX ⊆ X1 ∩X2 we denote the supremum norm betweenf andgon X by kf−gk∞(X)= supxXkf(x)−g(x)k. At some points we will speak of a cross-polytope. By this we mean the imageT(C)of thed-dimensional standard cross-polytopeC, which is given by the convex hull of all permutations of(±1/0/0/ . . . /0)∈Rd, under some similarity transformationT. 3. Main results

In this section we present our main results. The proofs of the theorems are deferred to Sections4 and5. Our key question is to what extent an isotonic functionf is uniquely determined, up to a similarity transformation. Our first result concerns the infinite case. We show that iff is defined on a dense subset of some “nice” setGandf is locally isotonic, then it is actually a similarity.

Theorem 3 (Isotonic on a dense set implies similarity) LetG ⊆ Rd be an open and connected domain andΩ⊆Ga dense subset. Letf : Ω→Rdbe a locally isotonic function. Then there exists a unique extension off to a similarity transformationF :RdRd.

The next theorem deals with the finite case and is the main result of this paper. We consider Xn = {x1, . . . , xn} ⊆ Rd and an isotonic mappingϕn : Xn → ϕn(Xn) — hence ϕn(Xn) = {ϕn(x1), . . . , ϕn(xn)}is an ordinal embedding ofXn. We prove thatϕncan be approximated by a similarity transformation, up to arbitrary precision asn→ ∞.

Theorem 4 (Isotonic on a finite set implies approximate similarity)

1. Global Version: LetK =Ur(z)⊆Rdbe a closed and bounded ball (for some arbitraryr >0, z∈Rd). Let(xn)nNbe a sequence of pointsxn∈Ksuch that{xn:n∈N}is dense inK. Let 0 < R < ∞and(ϕn)nN be a sequence of isotonic functionsϕn :{x1, . . . , xn} →UR(0)⊆ Rd. Then there exists a sequence(Sn)n∈Nof similarity transformationsSn:RdRdsuch that kSn−ϕnk({x1,...,xn})→0 as n→ ∞. (2) 2. Local Version: More generally, letK =Sk

i=1KiRdbe a finite union of closed and bounded balls such thatSk

i=1Ki is connected. Let(xn)nN be a sequence of pointsxn ∈ K such that

(4)

KLEINDESSNER VONLUXBURG

{xn : n ∈ N} is dense in K. Let 0 < R < ∞ and (ϕn)nN be a sequence of functions ϕn:{x1, . . . , xn} →UR(0)⊆Rdsuch that

∀i∈ {1, . . . , k}:ϕn|{x1,...,xn}∩Kiis isotonic.

Then there exists a sequence(Sn)n∈Nof similarity transformationsSn:RdRdwith(2). Our proofs show that we can replace the set Kin Part1of Theorem4by a cross-polytope or any convex set “between a cross-polytope and a ball”. Consequently, we can replaceKin Part2by any finite union of such sets if we additionally assume that all these sets satisfyKi⊆Ki. Note that the assumption that all functionsϕnmap to the same bounded ballUR(0)is necessary. Otherwise we could blow up the configuration of the image points by a larger and larger constant and prevent the approximation errorkSn−ϕnkfrom converging.

4. Proof of Theorem3(the infinite case)

The proof of Theorem3consists of a number of steps, which we formulate as separate lemmas.

Lemma 5 (Isotonic implies continuous) LetΩ ⊆Rdandf : Ω→Rdbe a locally isotonic func- tion. Thenf is continuous. If we additionally assumeΩto be a set with at least one limit point which is contained in it andf to be globally isotonic, thenf is even uniformly continuous.

Proof (sketch) Since continuity is a local property, it suffices to show that for any pointx ∈ Ω there is a neigborhoodU(x)inΩsuch thatf|U(x)is continuous. Hence, w.l.o.g. we may assumef to be globally isotonic. The key observation is that iff was discontinuous at one point, the distance between different points inf(Ω)would be bounded from below by a positive constant. In case that Ωis uncountable, this immediately contradicts the separability of Rd. In general, a compactness argument leads to the desired contradiction. In case Ωhas a limit point which is contained in it, denote one such point byx0and letε >0be arbitrary. We already know thatf is continuous, and hence there existsδ >0such thatkf(x)−f(x0)k< εfor allx∈Ωwithkx−x0k< δ. Letx0 ∈Ω with0<kx0−x0k =δ0 < δ(sincex0 is a limit point, there is such a pointx0). For allx, y ∈Ω withkx−yk< δ0we havekx−yk<kx0−x0kand hencekf(x)−f(y)k<kf(x0)−f(x0)k< ε.

The next lemma shows that ifΩ ⊆ Rd is a ball andf : Ω → Rdis weakly isotonic, then it is even strongly isotonic, at least on a slightly smaller ball.

Lemma 6 (Weakly isotonic implies strongly isotonic on balls) Let Ω = Uε(z) ⊆ Rd and f : Ω→Rdbe weakly isotonic. Thenf|Uε/4(z)is strongly isotonic.

Proof (sketch) In order to prove thatf|Uε/4(z)is isotonic, we have to show thatkf(x)−f(y)k<

kf(v)−f(w)k for all x, y, v, w ∈ Uε/4(z) with kx−yk < kv −wk. The idea is to use in- termediate points u1, . . . , un ∈ Ω such that kx −yk < ky −u1k < ku1 −u2k < . . . <

kun1 −unk < kun−vk < kv −wk. Since f is assumed to be weakly isotonic, it follows thatkf(x)−f(y)k < kf(y)−f(u1)k < kf(u1) −f(u2)k < . . . < kf(un−1)−f(un)k <

kf(un)−f(v)k< kf(v)−f(w)k. Using continuity we can show thatf|Uε/4(z) is even strongly isotonic.

(5)

The following proposition already shows that for functions defined on all points of an open and connected domain, all the properties we defined in Definition2are equivalent. The key ingredient in the proof is that the midpoint of a line segment between two points inΩis mapped by an isotony to the midpoint of the line segment between the corresponding image points.

Proposition 7 (Weakly isotonic implies similarity) Let Ω ⊆ Rd be an open and connected do- main andf : Ω→Rdbe a locally weakly isotonic function. Thenf is globally a similarity.

Proof (details can be found in AppendixB) First we consider a globally strongly isotonic function f : Ω = Ur(z) → Rd. This allows us to define a function µ : [0,diam Ω] → [0,diamf(Ω)] by µ(kx−yk) = kf(x)−f(y)kfor all x, y ∈ Ω. In order to show thatf is a similarity, we have to show thatµ is linear. By showing that the midpoint of a line segment between two points in Ωis mapped by f to the midpoint of the line segment between the corresponding image points, we iteratively obtainµ(2jidiam Ω) = 2jidiamf(Ω),i∈ N,j ∈ {0, . . . ,2i}(see AppendixBfor details). By Lemma5,f is continuous and so isµ, implying thatµ(t) =t·(diamf(Ω)/diam Ω).

Now assume thatΩis open and connected andf : Ω→Rdis a locally weakly isotonic function.

By Lemma6,f is locally strongly isotonic. Hence, givenx∈Ωwe can chooseε(x)>0such that Uε(x)(x)⊆Ωand thatf|Uε(x)(x):Uε(x)(x)→ Rdis globally strongly isotonic. It follows from the above thatf|U

ε(x)(x)is a similarity and hencef : Ω→Rdis locally a similarity. By LemmaB(see AppendixA),f is even globally a similarity.

Finally, a continuous extension of an isotonic mapping is isotonic too. The proof is elementary.

Lemma 8 (Continuous extension inherits isotony) LetΩ ⊆Rdsuch thatK = Ωis convex. Let f : Ω→Rdbe isotonic andF :K →Rdbe a continuous extension off. ThenF is isotonic.

Now, we have collected all ingredients to prove Theorem3.

Proof of Theorem3(sketch) In case thatfis globally isotonic andΩ =Gis convex, we consider the unique continuous extensionFeoff toΩ. This is possible sincef is uniformly continuous by Lemma5. By Lemma8, Feis isotonic. According to Proposition7,Fe|G is even a similarity. By Lemma A(see AppendixA),Fe|G can be uniquely extended to a similarity F : RdRd. For the general case we restrictf to several intersections ofΩand small balls. Considering one such a restriction, we are in the situation of the previous case and obtain a unique extension. We can show that all these extensions have to coincide similarly to the proof of LemmaBfrom AppendixA.

5. Proof of Theorem4(the finite case)

Cased = 1. The case d = 1 is particularly simple: it is easy to see that any weakly isotonic functionf : Ω→R(withΩ⊆R) is either strictly increasing or decreasing. The following lemma is the main step of the proof in the one-dimensional case. It considers points that approximate a grid, and proves that this property remains intact after an isotonic mapping. See Figure1 for an illustration.

Lemma 9 (Isotonic maps approximately preserve a grid) LetN ∈ N. For someε1 <1/22N+1 setεk12k1,2 ≤k ≤N, andδ = ε1/2. Fork∈ {1, . . . , N}andi ∈ {1,3, . . . ,2k−1}set

(6)

KLEINDESSNER VONLUXBURG

JMLR: Workshop and Conference Proceedings vol 35:1–2, 2014 27th Annual Conference on Learning Theory

Full Title of Article

author names withheld Editor:Under Review for COLT 2014

Abstract This is a great paper and it has a concise abstract.

Keywords:List of keywords 1. Introduction

0 12 1

yl1,1 y1,1r

0 1 12 1

4 3

4

yl1,1 y1,1r

yl2,1 y2,1r yl2,3 y2,3r

c 2014 .

(a)N = 1

JMLR: Workshop and Conference Proceedings vol 35:1–2, 2014 27th Annual Conference on Learning Theory

Full Title of Article

author names withheld Editor:Under Review for COLT 2014

Abstract This is a great paper and it has a concise abstract.

Keywords:List of keywords 1. Introduction

0 12 1

yl1,1 yr1,1

0 1 12 1

4 3

4

yl1,1 y1,1r

yl2,1 y2,1r yl2,3 y2,3r

c 2014 .

(b)N = 2

Figure 1: The idea of Lemma9is to place points in small intervals close to the grid pointsi/2N (yk,il on the left side,yrk,i on the right side) in such a way that the ordinal constraints between all these points are sufficient to determine the grid cells they belong to, independent of their exact location within the intervals.

xk,i=i/2kand letyk,il ,yrk,ibe arbitrary elements of(xk,i−εk−δ, xk,i−εk)and(xk,ik, xk,i+ εk+δ), respectively. Letϕ:{0,1} ∪ {ymk,i:m∈ {l, r}, k≤N, i∈ {1,3, . . . ,2k−1}} →[0,1]

be a weakly isotonic function withϕ(0) = 0andϕ(1) = 1. Then it holds that ymk,i−ϕ(yk,im)< 1

2N, m∈ {l, r}, k≤N, i∈ {1,3, . . . ,2k−1}. (3) Proof (details can be found in AppendixB) By induction overN we prove

ϕ(yk,il )∈

2N−ki−1

2N ,2N−ki 2N

, ϕ(yrk,i)∈

2N−ki

2N ,2N−ki+ 1 2N

,

for all k ≤ N, i ∈ {1,3, . . . ,2k−1}, which immediately implies (3). The basis is clear (see Figure 1(a)): Due to ϕ(0) = 0 and ϕ(1) = 1, ϕ is strictly increasing and hence0 = ϕ(0) <

ϕ(y1,1l ) < ϕ(y1,1r ) < ϕ(1) = 1. Since |y1,1l −0| < |y1,1l −1|and ϕ is weakly isotonic, we have |ϕ(y1,1l )−ϕ(0)| < |ϕ(yl1,1)−ϕ(1)|and thus can conclude thatϕ(yl1,1) ∈ (0,1/2). In the same way we obtainϕ(y1,1r ) ∈ (1/2,1). We demonstrate the inductive step by proving that the statement also holds forN = 2(see Figure1(b)): We already know thatϕ(y1,1l ) ∈ (0,1/2)and ϕ(y1,1r ) ∈ (1/2,1). Furthermore, due to ϕ being strictly increasing, we have 0 < ϕ(y2,1l ) <

ϕ(y2,1r ) < ϕ(y1,1l ) < ϕ(yr1,1) < ϕ(yl2,3) < ϕ(y2,3r ) < 1. The choice of (εk)1kN andδ guar- antees that |yl2,1 −0| < |y2,1l −y1,1l |and |y2,1r −yr1,1| < |y2,1r −0|leading to|ϕ(y2,1l )−0| <

|ϕ(yl2,1)−ϕ(yl1,1)|and|ϕ(yr2,1)−ϕ(yr1,1)|<|ϕ(yr2,1)−0|. This yields2ϕ(y2,1l )< ϕ(y1,1l )<1/2 and1/2 < ϕ(y1,1r ) < 2ϕ(yr2,1)and thusϕ(y2,1l ) ∈ (0,1/4)andϕ(yr2,1), ϕ(yl1,1) ∈(1/4,1/2). In the same way we can show thatϕ(y2,3r )∈(3/4,1)andϕ(yl2,3), ϕ(yr1,1)∈(1/2,3/4).

Now it is straightforward to prove Theorem4for the cased= 1(Proposition10implies Part1 of Theorem4; the proof of Part2is the same as for the cased≥2, which follows later on).

Proposition 10 (Statement ford= 1) Let I = [a, b] (for some−∞ < a < b < ∞) and let (xn)nNbe a sequence of pointsxn ∈ I such that{xn :n ∈ N}is dense inI. Let0 < R < ∞ and(ϕn)n∈Nbe a sequence of weakly isotonic functionsϕn:{x1, . . . , xn} →[−R, R]. Then there exists a sequence(Sn)nNof similarity transformationsSn:RRwith(2).

Proof (sketch) By appropriately rescaling the domain and the image ofϕnwe may assume that I = [0,1]and that ϕn maps to [0,1] withϕn(0) = 0, ϕn(1) = 1. We use Lemma 9 in order to show thatϕn for large values ofncan be approximated by the identity: Choose N ∈ Nsuch that 1/2N is sufficiently small. Since {xn : n ∈ N} is dense in I, there exists N0N such that in each of the intervals(xk,i−εk−δ, xk,i−εk)and(xk,ik, xk,ik+δ)as defined in

(7)

Lemma9(for the chosenN) there lies an element of{x1, . . . , xN0}. Ifn≥N0,y∈ {x1, . . . , xn}, andy is one of these elements, we immediately obtain|y−ϕn(y)| < 1/2N according to (3). If yis not one of these elements, we can use the monotonicity ofϕnto infer that|y−ϕn(y)|is small.

Case d ≥ 2. The case d ≥ 2 is harder to deal with. Our basic idea is to show that an isotonic mappingϕn : {x1, . . . , xn} → Rd, up to some rescaling, is anε(n)-nearisometry, that is ϕnsatisfies

kx−yk −ε(n)≤ kϕn(x)−ϕn(y)k ≤ kx−yk+ε(n), x, y∈ {x1, . . . , xn}. (4) Then, by a theorem ofAlestalo et al.(2001),ϕncan be approximated by an isometry up to an error depending (essentially) only onε(n)and going to zero asε(n)→0.

For proving thatϕnis anε(n)-nearisometry we observe the following: sinceϕn is isotonic, it is sufficient to prove (4) only for some pairsx, ysuch thatkx−ykis roughly uniformly distributed in[0,diam{x1, . . . , xn}]. Hence, we would like to consider points close to a straight line and argue in a way similar to Lemma9that their relative positions along the line are almost preserved by an isotonic mapping. Yet the problem is that, in general, there is no guarantee that the points are still close to a straight line after applying an isotony. However, assuming that there are points located close to the vertices of a cross-polytope and that these are “fixed” points (this is Assumption (?) in the following lemma), we can show that this is indeed the case and Lemma 9can be generalized in the following sense. Here we just provide a sketch of the lemma (see also Figure 2 for an explanation). A detailed version can be found in AppendixC.

Lemma 11 (Under Assumption(?)isotonic mappings preserve an approximately straight line) Letd≥2. LetN ∈Nsuch that

ω= 24 Γ(d2 + 1) πd2

!1d 1 2N −1

1d

< 1 2(d−1)

be fixed. Let Us+, Us,Ues+, Ues, s = 1, . . . , d, and Uk,ij , Uk,il ,Uk,ir , k ≤ N, i ∈ {1,3, . . . ,2k− 1}, j ∈ {2, . . . , d}, be open balls with some certain properties (see Appendix Cfor details). Let Xs+, XsRd,s= 1, . . . , d, be arbitrary elements ofUs+andUs, respectively,zjk,iRdbe an arbitrary element ofUk,ij , andyk,il ,yk,irRdbe arbitrary elements ofUk,il andUk,ir , respectively.

Letϕ:{X1+, X1, . . . , Xd+, Xd} ∪ {zk,ij :k≤N, i∈ {1,3, . . . ,2k−1}, j∈ {2, . . . , d}} ∪ {yk,im : m∈ {l, r}, k≤N, i∈ {1,3, . . . ,2k−1}} →Rdbe an isotonic function and assume that

ϕ(Xs+)∈Ues+, ϕ(Xs)∈Ues, s= 1, . . . , d. (?) Setγ(−1) =γ(1) = ˜α1 andγ(0) = ˜α1+d21(ω+ρ)(whereα˜1is the radius of the ballsUe1+,Ue1 andρa small number depending on size and location of the ballsUes+,Ues,s= 2, . . . , d), and define for2≤k≤N andi∈ {1,3, . . . ,2k−1}the positive expressionγ(−1 +i/2k1)recursively by

γ

−1 + i 2k1

= 1 2

γ

−1 +i−1 2k1

−1 +i+ 1 2k1

+ (d−1)(ω+ 2ρ)

.

(8)

KLEINDESSNER VONLUXBURG B

A

Z2 Z2+

Z3 Z+3

x12U(( 1 +1n/0/0)) e+3,(1/1/1)2U"

E3,(1/+ 1/1)

( 1/0)

1 2

1

X1 2U1 2 X1+2U1+ X2+2U2+

X2 2U2 z22,12U2,12 z21,12U1,12

yr1,12U1,1r

y2,1l 2U2,1l

2

(a)

SHORTTITLE

( 1/0)

3 +3

⇢ >

{p:kp '(X2+)k<kp '(X2)k}

'(X1)2Ue1

'(X1+)2Ue1+ '(X2+)2Ue2+

'(X2)2Ue2

3

(b)

Figure 2: Explanation of Lemma11(ford= 2)We consider an isotonic mapϕdefined on the following point set (see2(a)): (i)X1+, X1, X2+, X2are located in small balls around the vertices of a cross-polytope and assumed to be “fixed” underϕ(this is Assumption (?)). (ii) The pointsylk,i, yrk,iapproximate a grid as in Lemma9on the line segment betweenX1 andX1+and are closer toX2 than toX2+. (iii) The points zk,i2 are close to the pointsylk,i andyk,ir but are closer toX2+ than toX2. Sinceϕis isotonic, the points ϕ(ylk,i), ϕ(yk,ir )are closer toϕ(X2)than toϕ(X2+)and henceϕ2(yk,il ), ϕ2(yrk,i)< ρwhereas for the points ϕ(z2k,i)it is the other way round such thatϕ2(zk,i2 )>ρ(see2(b)). However,yk,im (m∈ {l, r}) andzk,i2 are close to each other and so areϕ(yk,im)andϕ(zk,i2 ). We can conclude that all pointsϕ(yk,im)are close to the first coordinate axis. This allows us to estimate the location ofϕ(ymk,i)along similar lines as in Lemma9.

LetN < N such thatN·2N< 5(d+1)(ω+ρ+ ˜1 α1). Then we have ymk,i−ϕ yk,im< γ(xk,i) +ω+ (d−1)(ω+ρ)<3d√

ω, m∈ {l, r}, (5) wherexk,i=−1 +2ki1, for all1≤k≤Nandi∈ {1,3, . . . ,2k−1}.

Proof (sketch) We prove that for all1≤k≤Nandi∈ {1,3, . . . ,2k−1}, ϕ(ylk,i)∈(xk,i−γ(xk,i)−ω, xk,i+γ(xk,i))×(−ρ−ω, ρ)d1,

ϕ(yrk,i)∈(xk,i−γ(xk,i), xk,i+γ(xk,i) +ω)×(−ρ−ω, ρ)d−1. (6) It is elementary to show thatγ(xk,i) < 12(d−1)√

3ω, k ≤ N, and because ofyk,il ∈ (xk,i− ω, xk,i)×(−ω,0)d−1, yrk,i∈(xk,i, xk,i+ω)×(−ω,0)d−1this immediately yields (5).

All pointsylk,i, yrk,i, zk,ij lie in the convex hull of the pointsX1+, X1, . . . , Xd+, Xd. Sinceϕis isotonic and satisfies Assumption (?), one can roughly estimate that

ϕ(ylk,i), ϕ(yk,ir ), ϕ(zk,ij )∈[−3,3]d. (7) The idea for provingϕj(yk,il ), ϕj(yk,ir ) ∈ (−ρ−ω, ρ), j ∈ {2, . . . , d}, is the following: Letj be fixed. Form∈ {l, r},k∈ {1, . . . , N},i∈ {1,3, . . . ,2k−1}we havekyk,im −Xjk<kyk,im −Xj+k and kzjk,i−Xj+k < kzk,ij −Xjk. Since ϕ is isotonic, it follows thatkϕ(ymk,i) −ϕ(Xj)k <

kϕ(yk,im)−ϕ(Xj+)kandkϕ(zk,ij )−ϕ(Xj+)k < kϕ(zk,ij )−ϕ(Xj)k. Because of (?) and (7), we can conclude thatϕj(ymk,i) < ρ andϕj(zjk,i) > −ρ(see Figure 2(b)). The distance between any

(9)

two points zjk

1,i1, zkj

2,i2 is larger than the distance between any two pointszk,ij , ylk,i (or zjk,i, yk,ir , respectively), that is form∈ {l, r},k∈ {1, . . . , N},i∈ {1,3, . . . ,2k−1}it holds that

kzk,ij −yk,imk<minn

ku−vk:u6=v∈n z˜j

k,˜i : ˜k≤N,˜i∈n

1,3, . . . ,2˜k−1ooo

. (8)

Letm∈ {l, r},k0≤N,i0∈ {1,3, . . . ,2k0−1}be arbitrary and writer=kϕ(zkj

0,i0)−ϕ(ymk

0,i0)k. Due to (8) andϕbeing isotonic, all pointsϕ(zk,ij )are located at distance larger thanrto each other which implies that the intersection of two balls (whether open or closed) with radiusr/2and centers ϕ(zjk

1,i1)andϕ(zkj

2,i2), respectively, is empty. Recall (7). Due to (?) and, again,ϕbeing isotonic, we clearly haver ≤3. Hence, with each pointϕ(zk,ij )at least a fraction of1/2dof the volume of the ballUr/2(ϕ(zk,ij ))is contained in[−3,3]dtoo. We can infer that

(2N −1)1 2d

πd2 Γ(d2 + 1)

r 2

d

≤6d, or equivalentlyr ≤ω. Hence, we have|ϕj(zkj

0,i0)−ϕj(ykm

0,i0)| ≤ kϕ(zkj

0,i0)−ϕ(ymk

0,i0)k ≤ωand finally obtainϕj(ylk0,i0), ϕj(yrk0,i0)∈(−ρ−ω, ρ). Similar to (8), we also have

kyk,il −yrk,ik<minn

ku−vk:u6=v∈n z˜j

k,˜i: ˜k≤N,˜i∈n

1,3, . . . ,2k˜−1ooo , and with the same argument as above obtain|ϕ1(yk,il )−ϕ1(yrk,i)| ≤ω,k≤N,i∈ {1,3, . . . ,2k− 1}. Now, (6) can be shown by induction overk.

The following lemma shows that the Assumption (?), which says that points close to the vertices of a cross-polytope are mapped approximately to themselves, can be taken as satisfied if the isotonic function acts on sufficiently many points. See Figure 3 for an explanation. Again, here we just provide a sketch of the lemma and the detailed version is in AppendixC.

Lemma 12 (Assumption(?)can be taken as satisfied) Letd≥2. LetN0Nsuch that

ω0 = 32 Γ(d2+ 1) πd2

!1d 1

d

N0

is sufficiently small andr <1andµ, δ, ε > 0be appropriately chosen real numbers (see Appendix Cfor details). Define pointsA, B ∈RdandZs, Zs+Rd,s∈ {2, . . . , d}, by

A= (−1/0/ . . . /0), B = (1/0/ . . . /0), Z2= (0/−r/0/0/ . . .),

Z2+= (0/r/0/0/ . . .), Z3= (0/0/−r/0/ . . .), Z3+= (0/0/r/0/ . . .), and so forth.

Fors∈ {2, . . . , d}andv∈ {−1,1}dsetEs,v =Zs+µv,Es,v+ =Zs++µvand letes,v, e+s,vRdbe arbitrary elements ofUε(Es,v )andUε(Es,v+ ), respectively. Fori∈ {1, . . . ,2N0−1}letxiRdbe an arbitrary element ofUδ((−1 +Ni0/0/ . . . /0)). Letϕ:{A, B} ∪ {es,v, e+s,v :s∈ {2, . . . , d}, v∈

(10)

KLEINDESSNER VONLUXBURG JMLR: Workshop and Conference Proceedings vol 35:1–1, 2014 27th Annual Conference on Learning Theory

Full Title of Article

author names withheld

Editor:Under Review for COLT 2014

Abstract

This is a great paper and it has a concise abstract.

Keywords:List of keywords

1. Introduction

B

A

Z2 Z2+

Z3 Z3+

x12U (( 1 +N10/0/0)) e+3,(1/ 1/1)2U"

E+3,(1/ 1/1)

c 2014 .

Figure 3: Explanation of Lemma 12We consider an isotonic mapping ϕdefined on the following point set: (i)AandB are op- posite vertices of a cross-polytope. (ii) The points es,v, e+s,v are located in small balls around the vertices of hypercubes placed around the remaining vertices of the cross- polytope. (iii) Numerous points xi are located in small balls which are placed equidistantly betweenAandB. This yields ordinal constraints sufficient to show that all points es,v, e+s,v are “fixed” under ϕ up to some similarity transformation. The figure shows the setting of Lemma12ford= 3.

{−1,1}d} ∪ {xi :i = 1, . . . ,2N0−1} →Rdbe an isotonic function withkϕ(A)−ϕ(B)k= 2.

Then there exist a constantC(depending only ond) and an isometryS :RdRdsuch that kA−S(ϕ(A))k ≤Cp

A(ω0), kB−S(ϕ(B))k ≤Cp A(ω0), kZsm−S(ϕ(ems,v))k ≤Cp

A(ω0), m∈ {−,+}, s∈ {2, . . . , d},

wherev= (1/1/1/ . . . /1)andA(ω0)only depends onω0anddand satisfiesA(ω0)→0asω0 →0.

Proof (sketch) With an argument similar to the one subsequent to (8) in the proof of Lemma 11 we can show that all pointsp1, p2 in the domain ofϕand withkp1−p2k<kA−x1kmust satisfy kϕ(p1)−ϕ(p2)k< ω0/2. The parametersµ, δandεare chosen in such a way thatkems,v1−ems,v2k<

kA−x1kfor anym ∈ {−,+},s ∈ {2, . . . , d}and for allv1, v2 ∈ {−1,1}d. Using this and the assumption ofϕbeing isotonic, we can show that

2−ω0<kp+s −psk ≤2, s= 1, . . . , d,

kp+s −ps0k −ω0 <kp+s −p+s0k<kp+s −ps0k+ω0, s6=s0 ∈ {1, . . . , d}, kps −ps0k −ω0 <kps −p+s0k<kps −ps0k+ω0, s6=s0 ∈ {1, . . . , d},

(9)

wherep+1 = ϕ(B),p1 = ϕ(A)andp+s = ϕ(e+s,v),ps = ϕ(es,v)fors = 2, . . . , d. For example, let us provekp1 −p2k −ω0 <kp1 −p+2k<kp1 −p2k+ω0: Elementary calculations show that kA−e2,vk<kA−e+2,vkandkA−e+2,vck<kA−e2,vckwithvc= (−1/−1/−1/ . . . /−1). We inferkp1 −p2k<kp1 −p+2kandkp1 −ϕ(e+2,vc)k<kp1 −ϕ(e2,vc)kand thus obtain

kp1 −p+2k ≤ kp1 −ϕ(e+2,vc)k+kϕ(e+2,vc)−p+2k<kp1 −ϕ(e2,vc)k+kϕ(e+2,vc)−p+2k

≤ kp1 −p2k+kp2 −ϕ(e2,vc)k+kϕ(e+2,vc)−p+2k<kp1 −p2k+ω0. From (9) we can infer that

|hp+s −ps, p+s0−ps0i|<10ω0, s6=s0 ∈ {1, . . . , d}. (10)

(11)

Furthermore, we can show thatk(p+s +ps)−(p+s0+ps0)k,s6=s0 ∈ {1, . . . , d}, is small (provided ω0is small), that is

k(p+s +ps)−(p+s0+ps0)k2 ≤d 20ω0+ 810dω4d−10(4d)d1 2−ω010dω4d10(4d)d−1

!2

, s6=s0∈ {1, . . . , d}. (11) This is done by first applying the Gram-Schmidt process to the vectors(p+s −ps),s = 1, . . . , d.

By doing so we obtain an orthonormal basis ofRd whose elements (appropriately rescaled) differ from the vectors(p+s −ps)only up to some small error (depending onω0). Considering the Fourier coefficients of(p+s +ps)−(p+s0+ps0)with respect to this orthonormal basis then leads to (11) .

Now, setting Z1 = A, Z1+ = B, we consider the mappingf : {Z1, Z1+, . . . , Zd, Zd+} → {p1, p+1, . . . , pd, p+d}given byf(Zsm) =pms form∈ {−,+},s∈ {1, . . . , d}. Using (10) and (11) it is straightforward to show thatf is a2p

A(ω0)-nearisometry, that is it holds that kx−yk −2p

A(ω0)≤ kf(x)−f(y)k ≤ kx−yk+ 2p

A(ω0), x, y∈ {Z1, Z1+, . . . , Zd, Zd+}. According toAlestalo et al.(2001), Theorem 3.3, there exists a constantC0 (depending only ond

— can be chosen independently of the parametersr, µ, δ, ε) and an isometryT :RdRdsuch that kT(x)−f(x)k ≤ 2C0p

A(ω0),x ∈ {Z1, Z1+, . . . , Zd, Zd+}. SettingS =T1andC = 2C0 the statement of Lemma12follows immediately.

Now we can prove Theorem4ford≥2.

Proof of Part 1of Theorem4 (sketch) By Lemma C(see AppendixA) it is sufficient to prove that for everyε0 > 0 there exists N(ε0) ∈ Nsuch that for all n ≥ N(ε0) there is a similarity transformationS(n, ε0) :RdRdwithkϕn−S(n, ε0)k∞({x1,...,xn})< ε0.

In a nutshell, the basic idea is the following: Assume K is a ball with diameter only slightly larger than two and containing all the balls of Lemma11. Ifn∈Nis sufficiently large, in each of these balls there is an element of{x1, . . . , xn}. Assume for the moment thatϕnsatisfies Assump- tion (?) of Lemma11. Then from (5) we obtain an estimate for the expressionkϕn(x)−ϕn(y)k for roughly uniformly distributed values ofkx−ykin[0,2]≈[0,diam{x1, . . . , xn}]. Sinceϕnis isotonic, this gives an estimate forkϕn(x)−ϕn(y)kfor allx, y∈ {x1, . . . , xn}which is sufficient to show thatϕnis anε-nearisometry for some smallε. Hence, we can uniformly approximateϕn by an isometry according toAlestalo et al.(2001). It remains to be argued why Assumption (?) of Lemma11indeed can be taken as satisfied. However, this is the statement of Lemma12.

A bit more precisely, the main steps of the proof can be summarized as follows:

1. Since {xn : n ∈ N} is dense in K, we can choose N0N so large that there are points xA, xB∈ {x1, . . . , xN0}and a similarity transformT :RdRdwith the following properties:

• T(xA) = (−1/0/0/ . . . /0),T(xB) = (1/0/0/ . . . /0)

• U1(0)⊆T(K),diamT(K)is “sufficiently small”

• ∀y ∈ T(K) :Ur0(y)∩ {T(x1), . . . , T(xN0)} 6=∅wherer0 >0is smaller than the minimal radius of the finitely many open balls considered in Step 3 and smaller thanδ0from Step 6.

In the following, we considerϕnfor a fixedn≥N0.

Referenzen

ÄHNLICHE DOKUMENTE

• A transformation definition is a set of transformation rules that together describe how a model in the source language can be transformed into a model in the target language. •

• A transformation definition is a set of transformation rules that together describe how a model in the source language can be transformed into a model in the target language.. •

tiresome, hard work at dusty archives and taken to the active virtual life via social media. Almost all photos got comments, disproving or confirming

The crea- tion of mixed-use and socially mixed areas—coupled with good access to public transport, housing diversity, and sufficient provision of vibrant public spac- es

The cointegration test, shown in Table 9, (see Engle &amp; Granger, 1987; Engle and Yoo, 1987, Table 2), shows that in the two cases with monthly data (models 5 and 6),

The main way jihadi groups get hold of these weapons is by stealing them from the regu- lar armed forces – like ISIL did in Iraq or Boko Haram in Nigeria.. In

Ceasefire/peace negotiations should therefore include representatives of the Assad regime, Syrian political opponents and armed rebels from the Free Syrian Army, and

This, in my opinion and the opinion of others that I’ll quote in just a second, will be a tragedy for Israel because they will either have to dominate the