Institute for Advanced Study, April 16 2012 Sushant Sachdeva Princeton University Joint work with Lorenzo Orecchia, Nisheeth K. Vishnoi Linear Time Graph.

Institute for Advanced Study, April 16 2012 Sushant Sachdeva Princeton University Joint work with Lorenzo Orecchia, Nisheeth K. Vishnoi Linear Time Graph Partitioning via Fast Simulation of Random walks

INTRODUCTION Theory of near-linear time algorithms

Why linear-time algorithms? Input graphs and data sets are massive. e.g. Web graph (10 11 nodes), Social networks (10 9 nodes) Super-linear algorithms are impractical. © 2004–2012 Michael K. Bergman. Need time algorithms for fundamental problems: e.g. M AX -M ATCHING, s-t M IN C UT /M AX F LOW, Balanced Graph Partitioning Approximate solutions maybe? (1- ² ) approx., time [Duan and Pettie] (1- ² ) approx., time [Duan and Pettie] (1- ² ) approx., time [Christiano et al.] (1- ² ) approx., time [Christiano et al.] Spectral approx., time [This talk] Spectral approx., time [This talk]

GRAPH PARTITIONING

Conductance Given a undirected, unweighted graph S Conductance of [Cheeger, Alon-Milman] [Spielman-Teng] allows us to approximate ¸ 2 (L) and find such a cut in time Assume G is d-regular Spectral approximation Conductance of G Fundamental quantity in Markov Chains, Riemannian Manifolds

Balanced Separator Given b, does G have a b-balanced cut of conductance < ° ? S G NP-Hard Applications to clustering, image segmentation, community detection, primitive for divide-and-conquer AlgorithmTechniqueDistinguishes ≤ ° and Running Time Recursive EigenvectorSpectral Use Cheeger to find a cut and remove it. Rinse and repeat. Can take time. Too slow. Use Cheeger to find a cut and remove it. Rinse and repeat. Can take time. Too slow.

Balanced Separator Given b, does G have a b-balanced cut of conductance < ° ? S G NP-Hard AlgorithmTechniqueDistinguishes ≤ ° and Running Time Recursive EigenvectorSpectral [Leighton-Rao]Flow [AK,OSVV,CKM + ] [Arora-Rao-Vazirani]SDP[AK,She,CKM + ] [Madry]

Balanced Separator Given b, does G have a b-balanced cut of conductance < ° ? S G NP-Hard AlgorithmTechniqueDistinguishes ≤ ° and Running Time Recursive EigenvectorSpectral [Spielman-Teng]Local Random walks [Andersen-Peres]Evolving Sets [Orecchia-Vishnoi]SDP

Balanced Separator Given b, does G have a b-balanced cut of conductance < ° ? S G NP-Hard AlgorithmTechniqueDistinguishes ≤ ° and Running Time Recursive EigenvectorSpectral [Spielman-Teng]Local Random walks [Andersen-Peres]Evolving Sets [Orecchia-Vishnoi]SDP [This talk]Simulating Random walks

Balanced Separator Given b, does G have a b-balanced cut of conductance < ° ? S G NP-Hard AlgorithmTechniqueDistinguish ≤ ° and Running Time Recursive EigenvectorSpectral [Leighton-Rao]Flow [AK,OSVV,CKM + ] [Arora-Rao-Vazirani]SDP[AK,She,CKM + ] [Madry] [This talk] Simulating random walks

Overview 1.Use continuous-time random walks to give an algorithm 2.Show how to simulate continuous-time random walks fast

Graph Cuts and Random Walks Intimate connection between Graph cuts and Random walks. [Alon-Milman][Mihail][Lovasz-Simonovits][Spielman-Teng] [Anderson-Chung-Lang] [Andersen-Peres] [Orecchia-Vishnoi][this talk]… [Mihail] Given the distribution of a random walk that has mixed, can find a cut S with in time. Usual Random Walks on a graph Page Rank Random walks Page Rank Random walks Evolving Sets Random Walks Continuous Time Random Walks

Bird’s eye view 1. Simulate random walks (one or several) 3. Else, modify the graph Regular walk / Page Rank walk / Evolving Sets / Continuous-time walks 2. Try to cut the graph using the random walks If there’s no low conductance cut, we get a certificate. If you find a balanced low conductance cut, we’re done. To remove the unbalanced low conductance cuts Remove the setAdd edges across the cut Using matchings / flows / stars Threshold cuts/ SDP rounding

Continuous Time Random Walks Thm. We can reduce Balanced Separator to simulating heat-kernel random walks with Thm. We can reduce Balanced Separator to simulating heat-kernel random walks with Given initial distribution, Random walk of length l ~ Poisson(t) Heat-Kernel random walk Why heat-kernel random walk? e -x is a nice function. Track progress using Golden Thompson inequality [AK] Don’t know such a result for other random walks Techniques from SDPs[OV], exponential updates aka Matrix Mult. Weights [AK] Previous results had a poly(1/ ° ) dependence

Algorithm We j An embedding in Random walk from vertex i Case 1: All vectors are small Long vectors are random walks that haven’t mixed We i All random walks have mixed well. No small cuts. Ensures mixing across cuts with conductance more than °

Algorithm Case 2: Walks have not mixed We j We i Random Projection + Threshold Cut “Cheeger Rounding” If we find a balanced cut of small conductance, we’re done. ELSE

Algorithm Case 2b: Walks have not mixed (and we didn’t find a balanced cut) We j We i We can find a ball cut S S S is the union of “all unbalanced low conductance cuts”

Algorithm Case 2b: Walks have not mixed (and we didn’t find a balanced cut) We j We i We can find a ball cut S S Add edges from all vertices outside the ball to all inside Soft removal of unbalanced cuts O(log n) rounds For efficiency, we use JL lemma and work with O(log n) dimensional embeddings. [AK] For efficiency, we use JL lemma and work with O(log n) dimensional embeddings. [AK] Goal: Given L, ¿,v. Compute Goal: Given L, ¿,v. Approximate

SIMULATING RANDOM WALKS

Approximating Heat-Kernel Goal: Given ¿, L, v, ±, find u s.t. Goal: Given ¿, L, v, ±, find u s.t. Taylor Series Approx.? Need Too Slow Heat-Kernel random walk required by algorithm Small error suffices for the algorithm Used in previous algorithms

Approximating Heat-Kernel Better polynomials? YES!! [Thm] There exists a polynomial p of degree such that [Thm] There exists a polynomial p of degree such that It suffices to approximate e -x on [Thm] Given G that has a balanced cut of conductance, we can find one of conductance in time Better guarantee, same running time as [Andersen-Peres] Still not near-linear! Goal: Given ¿, L, v, ±, find u s.t. Goal: Given ¿, L, v, ±, find u s.t.

Approximating Heat-Kernel [Thm] Any polynomial such that requires degree [Thm] Any polynomial such that requires degree Even better polynomials? Not Really! Simple proof using Markov’s theorem Simple proof using Markov’s theorem Goal: Given ¿, L, v, ±, find u s.t. Goal: Given ¿, L, v, ±, find u s.t.

Beyond Polynomials Rational Functions! [Saff-Schönhage-Varga] There is a degree k polynomial p k such that Infinite Interval! Geometric Decay! Need degree Geometric Decay! Need degree Assume: 1. We knew p k explicitly 2. ST computation was exact Assume: 1. We knew p k explicitly 2. ST computation was exact We’re DONE! Two Issues: 1. [SSV] result is existential 2. ST computation is approximate Two Issues: 1. [SSV] result is existential 2. ST computation is approximate Approximating up to error ± Approximating up to error ± Computing vectors Can use [Spielman-Teng]! [Thm] Given ¿, L, v, ±, we can compute u such that in time [Thm] Given ¿, L, v, ±, we can compute u such that in time [Spielman-Teng] (Informal) We can approximate (cI+L) -1 y in time. [Spielman-Teng] (Informal) We can approximate (cI+L) -1 y in time.

LANCZOS’ METHOD “But we don’t know the polynomial.”

Lanczos’ Method Let Define f so that Goal: Approximate f(B)v as well as any degree k poly. Lanczos Method Existence of a good approximate poly suffices Small degree implies efficiency From real approximation to matrices

Lanczos’ Method Goal: Approximate f(B)v as well as any degree k poly. Observe p – any degree k polynomial Krylov Subspace · of order k Let be an orthonormal basis for · Let T be the operator B restricted to · (k+1) x (k+1) matrix For any degree k polynomial p, For any degree k polynomial p, Let Projection on · Our guess: T is a much smaller matrix!

Error Guarantee Goal: Bound the norm of the error Let the error be Equal! Since T is B restricted to ·, Spectrum(T) is bounded by Spectrum(B) Since T is B restricted to ·, Spectrum(T) is bounded by Spectrum(B) * error of best degree k polynomial approximation * error of best degree k polynomial approximation Cost = k mult. by B + Construct basis + Diagonalize T for our setting Cost = k mult. by B + Construct basis + Diagonalize T for our setting Suppose p(x) is a good degree k approximation to f Bounded by r on Spectrum(B) Bounded by r on Spectrum(T) Don’t even need to know the polynomial!

HANDLING APPROXIMATE COMPUTATION What about the error?

We bound the error Why Lanczos is insufficient Multiplication with is only approximate Multiplication with is only approximate Subspace · is approximate. & Operator T is approximate Subspace · is approximate. & Operator T is approximate T is not even symmetric! We lose nice spectral properties [ST] computation is only approximate We symmetrize. Compute approximation with We symmetrize. Compute approximation with We needed to compute an orthonormal basis for

Sketch of Error Analysis Bound Polynomial part Error part Rational approx. result Bound Bound sum of coefficients of p k Bound spectrum of Bound spectrum of Error Analysis + Lanczos method = Fast random walks Fast random walks + Algorithm = Fast Balanced Separator

Conclusions Reduced balanced separator to simulating heat-kernel random walks Approximated the heat kernel random walk in time. Can be used as primitives for designing near- linear time algorithms.

Open Questions Other applications of fast matrix exponentiation Linear-time Graph decomposition? Linear time algorithms for small set expansion? Linear time algorithm for approximation for Balanced Separator

Thank you!

An Important Tool [Christiano et al.] s-t M AX F LOW s-t M IN C UT s-t M AX F LOW s-t M IN C UT Electrical Flows Solve Linear Systems Lx=y L is a graph Laplacian Solving Laplacian Linear Systems Direct Methods Gaussian Elimination / Cholesky Decomposition Gaussian Elimination / Cholesky Decomposition Iterative methods Too slow. O(n 3 ) time Hopeless Too slow. O(n 3 ) time Hopeless Conjugate Gradient Approximate method. time Approximate method. time [Spielman-Teng] (Informal) We can approximate L -1 y in time. [Spielman-Teng] (Informal) We can approximate L -1 y in time. Other Applications – Approximating second eigenvector [Spielman-Teng], Cover times [Ding-Lee-Peres], Generating random spanning trees[Madry-Kelner], Graph Sparsification [Spielman-Srivastava] Balanced graph partitioning, Simulating continuous-time random walks [This talk] Balanced graph partitioning, Simulating continuous-time random walks [This talk]

Beyond Polynomials Rational Functions! [Saff-Schönhage-Varga] There is a degree k polynomial p k such that Infinite Interval! Geometric Decay! Small required degree. Geometric Decay! Small required degree. L 1 bound on [0, 1 ) L 1 bound on [-1,1] L 2 bound on [-1,1] Change IntervalWrite function as an integralCauchy Schwarz Assume: 1. We knew p k explicitly 2. ST Solver was exact Assume: 1. We knew p k explicitly 2. ST Solver was exact We’re DONE! Two Issues: 1. We don’t know p k explicitly 2. ST Solver is approximate Two Issues: 1. We don’t know p k explicitly 2. ST Solver is approximate

Faster Simulation Polynomials don’t suffice, then, how do we simulate the random walk fast?

Lanczos Method Goal: Given symmetric B, v, k, and function f; approximate f(B)v as well as any degree k poly Goal: Given symmetric B, v, k, and function f; approximate f(B)v as well as any degree k poly Observe p – any degree k polynomial Krylov Subspace · of order k Let be an orthonormal basis for · k-Identity Projection on · Let For any degree k polynomial p, For any degree k polynomial p, Our guess: T is a much smaller matrix!

Error Guarantee Goal: Bound the norm of the error Write Any degree k polynomial Equal! Since and the columns of V are orthnormal, The columns of V are orthnormal True for any p!! As good as the best degree k polynomial! Cost = k mult. by B + Construct basis + Diagonalize T if B is a Laplacian Cost = k mult. by B + Construct basis + Diagonalize T if B is a Laplacian

Using Lanczos for our Algorithm Goal: Given A, wanted to approximate as Goal: Given A, wanted to approximate as Use Lanczos with suffices. Still need to address the issue of approximate multiplication with

Institute for Advanced Study, April 16 2012 Sushant Sachdeva Princeton University Joint work with Lorenzo Orecchia, Nisheeth K. Vishnoi Linear Time Graph.

Similar presentations

Presentation on theme: "Institute for Advanced Study, April 16 2012 Sushant Sachdeva Princeton University Joint work with Lorenzo Orecchia, Nisheeth K. Vishnoi Linear Time Graph."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Institute for Advanced Study, April 16 2012 Sushant Sachdeva Princeton University Joint work with Lorenzo Orecchia, Nisheeth K. Vishnoi Linear Time Graph.

Similar presentations

Presentation on theme: "Institute for Advanced Study, April 16 2012 Sushant Sachdeva Princeton University Joint work with Lorenzo Orecchia, Nisheeth K. Vishnoi Linear Time Graph."— Presentation transcript:

Similar presentations

About project

Feedback