Metric Embedding with Relaxed Guarantees Ofer Neiman Ittai Abraham Yair Bartal.

Slides:



Advertisements
Similar presentations
Routing Complexity of Faulty Networks Omer Angel Itai Benjamini Eran Ofek Udi Wieder The Weizmann Institute of Science.
Advertisements

Lower Bounds for Additive Spanners, Emulators, and More David P. Woodruff MIT and Tsinghua University To appear in FOCS, 2006.
Embedding Metric Spaces in Their Intrinsic Dimension Ittai Abraham, Yair Bartal*, Ofer Neiman The Hebrew University * also Caltech.
On Complexity, Sampling, and -Nets and -Samples. Range Spaces A range space is a pair, where is a ground set, it’s elements called points and is a family.
Trees and Markov convexity James R. Lee Institute for Advanced Study [ with Assaf Naor and Yuval Peres ] RdRd x y.
Searching on Multi-Dimensional Data
Metric Embeddings with Relaxed Guarantees Hubert Chan Joint work with Kedar Dhamdhere, Anupam Gupta, Jon Kleinberg, Aleksandrs Slivkins.
Metric embeddings, graph expansion, and high-dimensional convex geometry James R. Lee Institute for Advanced Study.
Geometric embeddings and graph expansion James R. Lee Institute for Advanced Study (Princeton) University of Washington (Seattle)
Embedding Metrics into Ultrametrics and Graphs into Spanning Trees with Constant Average Distortion Ittai Abraham, Yair Bartal, Ofer Neiman The Hebrew.
Interchanging distance and capacity in probabilistic mappings Uriel Feige Weizmann Institute.
A Nonlinear Approach to Dimension Reduction Lee-Ad Gottlieb Weizmann Institute of Science Joint work with Robert Krauthgamer TexPoint fonts used in EMF.
The Saitou&Nei Neighbor Joining Algorithm ©Shlomo Moran & Ilan Gronau.
Navigating Nets: Simple algorithms for proximity search Robert Krauthgamer (IBM Almaden) Joint work with James R. Lee (UC Berkeley)
The number of edge-disjoint transitive triples in a tournament.
June 3, 2015Windows Scheduling Problems for Broadcast System 1 Amotz Bar-Noy, and Richard E. Ladner Presented by Qiaosheng Shi.
Lecture 3: Polygon Triangulation Computational Geometry Prof. Dr. Th. Ottmann 1 Polygon Triangulation Motivation: Guarding art galleries Art gallery theorem.
Proximity algorithms for nearly-doubling spaces Lee-Ad Gottlieb Robert Krauthgamer Weizmann Institute TexPoint fonts used in EMF. Read the TexPoint manual.
Polynomial Time Approximation Scheme for Euclidian Traveling Salesman
On the Union of Cylinders in Esther Ezra Duke University On the Union of Cylinders in  3 Esther Ezra Duke University.
Network Design Adam Meyerson Carnegie-Mellon University.
Advances in Metric Embedding Theory Ofer Neiman Ittai Abraham Yair Bartal Hebrew University.
Probabilistic Methods in Coding Theory: Asymmetric Covering Codes Joshua N. Cooper UCSD Dept. of Mathematics Robert B. Ellis Texas A&M Dept. of Mathematics.
Testing Metric Properties Michal Parnas and Dana Ron.
Preference Analysis Joachim Giesen and Eva Schuberth May 24, 2006.
Almost tight bound for the union of fat tetrahedra in R 3 Esther Ezra Micha Sharir Duke University Tel-Aviv University.
Chapter 4: Straight Line Drawing Ronald Kieft. Contents Introduction Algorithm 1: Shift Method Algorithm 2: Realizer Method Other parts of chapter 4 Questions?
1 Separator Theorems for Planar Graphs Presented by Shira Zucker.
Doubling Dimension in Real-World Graphs Melitta Lorraine Geistdoerfer Andersen.
A General Approach to Online Network Optimization Problems Seffi Naor Computer Science Dept. Technion Haifa, Israel Joint work: Noga Alon, Yossi Azar,
Distance scales, embeddings, and efficient relaxations of the cut cone James R. Lee University of California, Berkeley.
Algorithms on negatively curved spaces James R. Lee University of Washington Robert Krauthgamer IBM Research (Almaden) TexPoint fonts used in EMF. Read.
Design Techniques for Approximation Algorithms and Approximation Classes.
On the union of cylinders in 3-space Esther Ezra Duke University.
1 The number of orientations having no fixed tournament Noga Alon Raphael Yuster.
1/19 Minimizing weighted completion time with precedence constraints Nikhil Bansal (IBM) Subhash Khot (NYU)
1 Embedding and Similarity Search for Point Sets under Translation Minkyoung Cho and David M. Mount University of Maryland SoCG 2008.
Topics in Algorithms 2007 Ramesh Hariharan. Tree Embeddings.
Doubling Dimension: a short survey Anupam Gupta Carnegie Mellon University Barriers in Computational Complexity II, CCI, Princeton.
Graph Partitioning using Single Commodity Flows
Dense graphs with a large triangle cover have a large triangle packing Raphael Yuster SIAM DM’10.
Lower Bounds for Embedding Edit Distance into Normed Spaces A. Andoni, M. Deza, A. Gupta, P. Indyk, S. Raskhodnikova.
What is a metric embedding?Embedding ultrametrics into R d An embedding of an input metric space into a host metric space is a mapping that sends each.
Approximating Buy-at-Bulk and Shallow-Light k-Steiner Trees Mohammad T. Hajiaghayi (CMU) Guy Kortsarz (Rutgers) Mohammad R. Salavatipour (U. Alberta) Presented.
On the Impossibility of Dimension Reduction for Doubling Subsets of L p Yair Bartal Lee-Ad Gottlieb Ofer Neiman.
Advances in Metric Embedding Theory Yair Bartal Hebrew University &Caltech UCLA IPAM 07.
Approximation Algorithms based on linear programming.
Oct 23, 2005FOCS Metric Embeddings with Relaxed Guarantees Alex Slivkins Cornell University Joint work with Ittai Abraham, Yair Bartal, Hubert Chan,
Theory of Computational Complexity Probability and Computing Chapter Hikaru Inada Iwama and Ito lab M1.
Coarse Differentiation and Planar Multiflows
Algorithms for Big Data: Streaming and Sublinear Time Algorithms
Approximating Graphs by Trees Marcin Bieńkowski University of Wrocław
Dimension reduction for finite trees in L1
Generalized Sparsest Cut and Embeddings of Negative-Type Metrics
Minimum Spanning Tree 8/7/2018 4:26 AM
Advances in Metric Embedding Theory
Chapter 5. Optimal Matchings
Sublinear Algorithmic Tools 3
Lecture 16: Earth-Mover Distance
Near-Optimal (Euclidean) Metric Compression
Yair Bartal Lee-Ad Gottlieb Hebrew U. Ariel University
Random Geometric Graph Diameter in the Unit Disk
András Sebő and Anke van Zuylen
Metric Methods and Approximation Algorithms
Dimension versus Distortion a.k.a. Euclidean Dimension Reduction
Embedding Metrics into Geometric Spaces
Lecture 15: Least Square Regression Metric Embeddings
Clustering.
Switching Lemmas and Proof Complexity
Routing in Networks with Low Doubling Dimension
Presentation transcript:

Metric Embedding with Relaxed Guarantees Ofer Neiman Ittai Abraham Yair Bartal

Embedding metric spaces Representation of the metric in a simple and structured space. Representation of the metric in a simple and structured space. Common target spaces: l p, trees (ultra-metrics). Common target spaces: l p, trees (ultra-metrics). The price of simplicity: distortion, which is the multiplicative amount by which distances can change. The price of simplicity: distortion, which is the multiplicative amount by which distances can change. Goal: find low distortion embeddings. Goal: find low distortion embeddings. A tool for approximation algorithms A tool for approximation algorithms Useful for many practical applications. Useful for many practical applications.

Metric embedding Let X,Y be metric spaces with metrics d x, d y respectively. Let X,Y be metric spaces with metrics d x, d y respectively. f : X→Y is an embedding of X into Y. f : X→Y is an embedding of X into Y. The distortion of f is the minimal α such that for some c: The distortion of f is the minimal α such that for some c:

Every metric space on n points can be embedded into Euclidean space with distortion O(log n) and dimension O(log 2 n). [Bourgain/LLR] Every metric space on n points can be embedded into Euclidean space with distortion O(log n) and dimension O(log 2 n). [Bourgain/LLR] Basic results Every metric space on n points can be embedded into a tree metric with distortion Every metric space on n points can be embedded into a tree metric with distortion [Bartal/BLMN/RR]. [Bartal/BLMN/RR].

problem The lower bounds on the distortion and the dimension are high, and grow with n. The lower bounds on the distortion and the dimension are high, and grow with n. In some cases, weaker guarantees are acceptable.. In some cases, weaker guarantees are acceptable..

Some Alternative Schemes Probabilistic embedding: considering the expected distortion. [Bartal, FRT] Probabilistic embedding: considering the expected distortion. [Bartal, FRT] Ramsey theorems: embedding a large subspace of the original metric. [BFM, BLMN] Ramsey theorems: embedding a large subspace of the original metric. [BFM, BLMN] Partial embedding: embedding all but a fraction of the distances. [KSW, ABCDGKNS] Partial embedding: embedding all but a fraction of the distances. [KSW, ABCDGKNS]

Motivation Estimating latencies (round-trip time) in the internet. Estimating latencies (round-trip time) in the internet. - the distance matrix is almost a metric. - the distance matrix is almost a metric. - embedding heuristics yield surprisingly good results... - embedding heuristics yield surprisingly good results... [Ng+Zhang ’ 02, ST ’ 03, DCKM ’ 04] [Ng+Zhang ’ 02, ST ’ 03, DCKM ’ 04] Practical network embedding requires: Practical network embedding requires: - Small number of dimensions. - Small number of dimensions. - No centralized co-ordination. - No centralized co-ordination. - Linear number of distances measurement. - Linear number of distances measurement. Finding nearest: copy of a file, service from some server, ect. Finding nearest: copy of a file, service from some server, ect.

(1-ε) partial embedding X, Y are metric spaces. X, Y are metric spaces. f : X→Y has (1-ε) partial distortion at f : X→Y has (1-ε) partial distortion at most α if there exists a set of pairs G ε such that: most α if there exists a set of pairs G ε such that: For all pairs (u,v)єG ε.

Scaling Embedding A stronger requirement is a map that will be good for all ε simultaneously. A stronger requirement is a map that will be good for all ε simultaneously. Definition: an embedding f has scaling distortion D(ε) if for any ε>0, it is an (1-ε) partial embedding with distortion D(ε). Definition: an embedding f has scaling distortion D(ε) if for any ε>0, it is an (1-ε) partial embedding with distortion D(ε).

Scaling & Average Distortion Thm: every metric space has scaling probabilistic embedding with distortion O(log(1/ε)) into trees. Thm: every metric space has scaling probabilistic embedding with distortion O(log(1/ε)) into trees. Thm: every metric space has scaling embedding with distortion O(log(1/ε)) and dimension O(log n) into Euclidean space. Thm: every metric space has scaling embedding with distortion O(log(1/ε)) and dimension O(log n) into Euclidean space. implies constant average distortion! implies constant average distortion! Applications: weighted average problems Applications: weighted average problems sparsest cut, quadratic assignment, linear arrangement, ect. sparsest cut, quadratic assignment, linear arrangement, ect.

Any λ-doubling metric space X can be embedded into l 2 with (1-ε) partial distortion [KSW]. Any λ-doubling metric space X can be embedded into l 2 with (1-ε) partial distortion [KSW]. Previous work Definition: a metric space X is called λ-doubling if for any r>0, any ball of radius r can be covered by λ balls of radius r/2. Definition: a metric space X is called λ-doubling if for any r>0, any ball of radius r can be covered by λ balls of radius r/2.

Partial embedding into trees with distortion. Partial embedding into trees with distortion. Distortion & Dimension don ’ t depend on the size of X! Distortion & Dimension don ’ t depend on the size of X! Our Results Partial embedding into l 2 with distortion and dimension O(log(1/ε)). Partial embedding into l 2 with distortion and dimension O(log(1/ε)). General theorem converting classical l p embeddings into the partial model. General theorem converting classical l p embeddings into the partial model. Tight lower bounds. Tight lower bounds. Appeared in FOCS05 together with CDGKS

Thm: Any subset-closed family of metric spaces X, that has Thm: Any subset-closed family of metric spaces X, that has for any Xє X on n points, an embedding φ:X→ l p with for any Xє X on n points, an embedding φ:X→ l p with - distortion α(n). - distortion α(n). - dimension β(n). - dimension β(n). φ can be converted into (1-ε) partial embedding of X with φ can be converted into (1-ε) partial embedding of X with - distortion - distortion - dimension - dimension Embedding into l p In practice..

(1-ε) partial embedding of any metric space into l p with distortion (1-ε) partial embedding of any metric space into l p with distortion and dimension [ Bourgain,Matousek,Bartal] and dimension [ Bourgain,Matousek,Bartal] (1-ε) partial embedding of any negative type metric (l 1 metrics) into l 2 with distortion and dimension (1-ε) partial embedding of any negative type metric (l 1 metrics) into l 2 with distortion and dimension [ARV, ALN] [ARV, ALN] (1-ε) partial embedding of any doubling metric into l p with distortion and dimension [KLMN] (1-ε) partial embedding of any doubling metric into l p with distortion and dimension [KLMN] (1-ε) partial embedding of any tree metric into l 2 with distortion (1-ε) partial embedding of any tree metric into l 2 with distortion and dimension [Matousek] and dimension [Matousek] Main Results

Definitions Let r ε (u) be the minimal radius such that |B(u,r ε (u))| ≥ εn. Let r ε (u) be the minimal radius such that |B(u,r ε (u))| ≥ εn. A pair (u,v), w.l.o.g r ε (u) ≥ r ε (v): has short distance if d(u,v) < r ε (u) has short distance if d(u,v) < r ε (u) has medium distance if r ε (u) ≤ d(u,v) < 4∙r ε (u). has medium distance if r ε (u) ≤ d(u,v) < 4∙r ε (u). has long distance if 4·r ε (u) ≤ d(u,v). has long distance if 4·r ε (u) ≤ d(u,v).

Close Distances (u,v) is a short pair. (u,v) is a short pair. Short pairs are ignored - at most εn 2. Short pairs are ignored - at most εn 2. u r ε (u) v r ε (v) r ε (w) w

Beacon Based Embedding Randomly choose beacons = B. Randomly choose beacons = B. Each point attached to nearest beacon. Each point attached to nearest beacon.

Some More Bad Points If d(u,B) > r ε (u) then If d(u,B) > r ε (u) then is bad. is bad. For each uєX : For each uєX : With probability ½ at most 2εn 2 bad pairs. With probability ½ at most 2εn 2 bad pairs. u r ε (u) v r ε (v) r ε (w) w

Partial Embedding Use the embedding φ:B→l p. Use the embedding φ:B→l p. φ has distortion guarantee of. φ has distortion guarantee of. The partial embedding is: The partial embedding is: φ(b)φ(b)φ(b)φ(b) h(u)h(u)h(u)h(u) f (u) u attached to beacon b

bvbv bubu Upper Bound We assume for the pair (u,v): - Each point has a beacon in its ball. - Each point has a beacon in its ball. - Both u,v are outside each other ’ s ball. - Both u,v are outside each other ’ s ball. - The mapping φ is a contraction. - The mapping φ is a contraction. u r ε (u) r ε (v) v

bvbv d(u,v) ≥ 4·max{r ε (u), r ε (v)} Lower Bound - Long Distances r ε (u) u r ε (v) v bubu d(b u,b v ) ≥ d(u,v)/2

u v Medium Distances?? There is a problem in this case: There is a problem in this case: r ε (u) r ε (v) u,v are attached to the same beacon!! The additional coordinates h will guarantee enough contribution.. The additional coordinates h will guarantee enough contribution..

Medium Distances With probability < ε the pair (u,v) will be smaller than half its expectation. Pairs satisfying: r ε (u) ≤d(u,v) ≤ 4r ε (u) r ε (u) ≤ d(u,v) ≤ 4r ε (u) [w.l.o.g r ε (u) ≥ r ε (v) ] [w.l.o.g r ε (u) ≥ r ε (v) ] r ε (u),0 r ε (v),0 rε(u)rε(u)rε(u)rε(u)0 rε(u)rε(u)rε(u)rε(u) In expectation ¼ of the coordinates will be r ε (u). With probability ¼ we get r ε (u) h(u)-h(v)

Medium Distances With probability ½, 2εn 2 medium pairs failed, but for the others: With probability ½, 2εn 2 medium pairs failed, but for the others: End of proof! End of proof!

Coarse Partial Embedding Another version: ignoring only the short distances (i.e., from each point to its nearest εn neighbors). the dimension increases to O(log(n)·β(1/ε)). Another version: ignoring only the short distances (i.e., from each point to its nearest εn neighbors). the dimension increases to O(log(n)·β(1/ε)).

Partial Embedding into Trees Thm: every metric space has (1-ε) partial embedding with distortion into a tree (ultra-metric). Thm: every metric space has (1-ε) partial embedding with distortion into a tree (ultra-metric).

Ultra-metrics Metric on leaves of rooted labeled tree. Metric on leaves of rooted labeled tree. 0 ≤ Δ(D) ≤ Δ(B) ≤ Δ(A). 0 ≤ Δ(D) ≤ Δ(B) ≤ Δ(A). d(x,y) = Δ(lca(x,y)). d(x,y) = Δ(lca(x,y)). d(x,y) = Δ(D). d(x,w) = Δ(B). d(w,z) = Δ(A). xyz Δ(A)Δ(A) Δ(B)Δ(B) Δ(C)Δ(C) Δ(D)Δ(D) w

Embedding into Ultra-metric Partition X into 2 sets X 1, X 2 Partition X into 2 sets X 1, X 2 Create a root labeled Δ = diam(X). Create a root labeled Δ = diam(X). The children of the root are created recursively on X 1, X 2 The children of the root are created recursively on X 1, X 2 Using induction the number of distances we ignore is Using induction the number of distances we ignore is B – bad distances for current level. B – bad distances for current level. X X1X1X1X1 X2X2X2X2 Δ X1X1X1X1 X2X2X2X2 |B|≤ ε|X 1 ||X 2 |

Take a point u such that |B(u,Δ/2)| ≤ n/2. Take a point u such that |B(u,Δ/2)| ≤ n/2. Let Let i=1,…,1/ε i=1,…,1/ε Let S i =A i+1 -A i Let S i =A i+1 -A i We need a “slim” shell… We need a “slim” shell… only distances inside the shell are distorted by more than only distances inside the shell are distorted by more than Where to Cut? A1A1A1A1 AiAiAiAi u A i+1

Case 1: |A 1 |< εn. X 1 = u, X 2 = X\{u} X 1 = u, X 2 = X\{u} Where to Cut? A1A1A1A1 AiAiAiAi Δ X\u u

Case 2: Case 2: Choose an i such that: Choose an i such that: Let X 1 =A i+½, X 2 = X\X 1 Let X 1 =A i+½, X 2 = X\X 1 A1A1A1A1 AiAiAiAi u A i+1 X1X1X1X1 X2X2X2X2 Δ

Assume by contradiction for all Assume by contradiction for all |S i | 2 > εn|A i | |S i | 2 > εn|A i | Then by induction |A i | ≥ εn·i 2. which implies |A t | ≥ n. Then by induction |A i | ≥ εn·i 2. which implies |A t | ≥ n. End of proof! End of proof! Finding Shell S i

Lower Bounds General method to obtain partial lower bounds from known classical ones. General method to obtain partial lower bounds from known classical ones. Thm: given a lower bound α for embedding a family X into a family Y : i.e. for any n there is Xє X on n points and any embedding of X requires distortion at least α(n). Thm: given a lower bound α for embedding a family X into a family Y : i.e. for any n there is Xє X on n points and any embedding of X requires distortion at least α(n). Then there is X’є X for which any (1-ε) partial embedding requires distortion Then there is X’є X for which any (1-ε) partial embedding requires distortion The family X must be nearly closed under composition!

distortion for partial embedding into trees. [Bartal/BLMN/RR]. distortion for partial embedding of doubling or l 1 metrics into l 2. [NR] distortion for probabilistic partial embedding into trees. [Bartal] Main corollaries distortion for partial embedding into l p. [LLR, Mat]

Choose Xє X such that Choose Xє X such that For each xєX create a metric C x such that For each xєX create a metric C x such that - C x є X. - C x є X. - X’ contain many “copies” of X. X’ contain many “copies” of X. Let f be a (1-ε) partial embedding that ignores the set of edges I. By definition. Let f be a (1-ε) partial embedding that ignores the set of edges I. By definition. General idea d δ X X’X’X’X’

T: vertices intersecting less than edges in I. T: vertices intersecting less than edges in I. For each xєX, choose some For each xєX, choose some v x єC x ∩T. v x єC x ∩T. For each pair (v x,v y ) find t єC y such that: For each pair (v x,v y ) find t єC y such that: Finding a copy of X in T vxvx vyvy t CxCx CyCy

Distortion of the Copy vxvxvxvx vyvyvyvy t f has distortion guarantees for both these distances Its distortion must be at least d(t,v y ) is negligible