Sampling in Graphs Alexandr Andoni (Microsoft Research)

Slides:



Advertisements
Similar presentations
Please correct your homework as efficiently as possible so that we have plenty of time to get through the lesson.
Advertisements

Numerical Linear Algebra in the Streaming Model Ken Clarkson - IBM David Woodruff - IBM.
Approximate Max-integral-flow/min-cut Theorems Kenji Obata UC Berkeley June 15, 2004.
Spectral Approaches to Nearest Neighbor Search arXiv: Robert Krauthgamer (Weizmann Institute) Joint with: Amirali Abdullah, Alexandr Andoni, Ravi.
Overcoming the L 1 Non- Embeddability Barrier Robert Krauthgamer (Weizmann Institute) Joint work with Alexandr Andoni and Piotr Indyk (MIT)
Value Iteration & Q-learning CS 5368 Song Cui. Outline Recap Value Iteration Q-learning.
Computing Classic Closeness Centrality, at Scale Edith Cohen Joint with: Thomas Pajor, Daniel Delling, Renato Werneck Microsoft Research.
Multicut Lower Bounds via Network Coding Anna Blasiak Cornell University.
Spectral Approaches to Nearest Neighbor Search Alex Andoni Joint work with:Amirali Abdullah Ravi Kannan Robi Krauthgamer.
Graph Sparsifiers by Edge-Connectivity and Random Spanning Trees Nick Harvey U. Waterloo Department of Combinatorics and Optimization Joint work with Isaac.
Graph Sparsifiers: A Survey Nick Harvey Based on work by: Batson, Benczur, de Carli Silva, Fung, Hariharan, Harvey, Karger, Panigrahi, Sato, Spielman,
Graph Sparsifiers: A Survey Nick Harvey UBC Based on work by: Batson, Benczur, de Carli Silva, Fung, Hariharan, Harvey, Karger, Panigrahi, Sato, Spielman,
Graph Sparsifiers by Edge-Connectivity and Random Spanning Trees Nick Harvey University of Waterloo Department of Combinatorics and Optimization Joint.
Graph Sparsifiers by Edge-Connectivity and Random Spanning Trees Nick Harvey U. Waterloo C&O Joint work with Isaac Fung TexPoint fonts used in EMF. Read.
On Sketching Quadratic Forms Robert Krauthgamer, Weizmann Institute of Science Joint with: Alex Andoni, Jiecao Chen, Bo Qin, David Woodruff and Qin Zhang.
Probably Approximately Correct Learning Yongsub Lim Applied Algorithm Laboratory KAIST.
1 Def: Let and be random variables of the discrete type with the joint p.m.f. on the space S. (1) is called the mean of (2) is called the variance of (3)
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 8 May 4, 2005
1 BSMOR: Block Structure-preserving Model Order Reduction http//:eda.ee.ucla.edu Hao Yu, Lei He Electrical Engineering Dept., UCLA Sheldon S.D. Tan Electrical.
1 University of Freiburg Computer Networks and Telematics Prof. Christian Schindelhauer Mobile Ad Hoc Networks Theory of Data Flow and Random Placement.
4. Convergence of random variables  Convergence in probability  Convergence in distribution  Convergence in quadratic mean  Properties  The law of.
Parametric Inference.
Presented by Ozgur D. Sahin. Outline Introduction Neighborhood Functions ANF Algorithm Modifications Experimental Results Data Mining using ANF Conclusions.
Right Protection via Watermarking with Provable Preservation of Distance-based Mining Spyros Zoumpoulis Joint work with Michalis Vlachos, Nick Freris and.
Approximating the MST Weight in Sublinear Time Bernard Chazelle (Princeton) Ronitt Rubinfeld (NEC) Luca Trevisan (U.C. Berkeley)
Primal-Dual Meets Local Search: Approximating MST’s with Non-uniform Degree Bounds Author: Jochen Könemann R. Ravi From CMU CS 3150 Presentation by Dan.
Graph Sparsifiers Nick Harvey University of British Columbia Based on joint work with Isaac Fung, and independent work of Ramesh Hariharan & Debmalya Panigrahi.
Geometric Problems in High Dimensions: Sketching Piotr Indyk.
July The Mathematical Challenge of Large Networks László Lovász Eötvös Loránd University, Budapest
Streaming Algorithms Piotr Indyk MIT. Data Streams A data stream is a sequence of data that is too large to be stored in available memory Examples: –Network.
Graph Sparsifiers Nick Harvey Joint work with Isaac Fung TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
The minimum cost flow problem. Simplifying assumptions Network is connected (as an undirected graph). – We can consider each connected component separately.
Spanning and Sparsifying Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopSpanning and Sparsifying1.
Spectrally Thin Trees Nick Harvey University of British Columbia Joint work with Neil Olver (MIT  Vrije Universiteit) TexPoint fonts used in EMF. Read.
Computing Classic Closeness Centrality, at Scale Edith Cohen Joint with: Thomas Pajor, Daniel Delling, Renato Werneck.
MAT 1236 Calculus III Section 15.3 Part II Double Integrals Over General Regions
Estimating PageRank on Graph Streams Atish Das Sarma (Georgia Tech) Sreenivas Gollapudi, Rina Panigrahy (Microsoft Research)
Nondeterministic property testing László Lovász Katalin Vesztergombi.
Sketching complexity of graph cuts Alexandr Andoni joint work with: Robi Krauthgamer, David Woodruff.
Energy Models for Graph Clustering Bo-Young Kim Applied Algorithm Lab, KAIST.
DOULION: Counting Triangles in Massive Graphs with a Coin
Alexander Shekhovtsov and Václav Hlaváč
Resparsification of Graphs
Approximating the MST Weight in Sublinear Time
Lecture 18: Uniformity Testing Monotonicity Testing
Sublinear Algorithmic Tools 3
Density Independent Algorithms for Sparsifying
MST in Log-Star Rounds of Congested Clique
COMS E F15 Lecture 2: Median trick + Chernoff, Distinct Count, Impossibility Results Left to the title, a presenter can insert his/her own image.
CIS 700: “algorithms for Big Data”
Transformations of graphs
15.082J & 6.855J & ESD.78J Visualizations
Lecture 6 Shortest Path Problem.
Dijkstra’s Algorithm for the Shortest Path Problem
Design of Hierarchical Classifiers for Efficient and Accurate Pattern Classification M N S S K Pavan Kumar Advisor : Dr. C. V. Jawahar.
Transformations of curves
Partitioning and decomposing graphs László Lovász
CSCI B609: “Foundations of Data Science”
Sampling in Graphs: node sparsifiers
Around the Regularity Lemma
Cryptographic Hash Functions Part I
Lecture 6: Counting triangles Dynamic graphs & sampling
and 6.855J Dijkstra’s Algorithm
Visualizations Dijkstra’s Algorithm
Introduction to Maximum Flows
15.082J & 6.855J & ESD.78J Visualizations
The Minimum Cost Spanning Tree Problem
President’s Day Lecture: Advanced Nearest Neighbor Search
Introduction to Maximum Flows
15.082J & 6.855J & ESD.78J Visualizations
Presentation transcript:

Sampling in Graphs Alexandr Andoni (Microsoft Research)

Graph compression Why smaller graphs? use less storage space faster algorithms easier visualization

Preserve some structure Cuts approximately Other properties: Distances, (multi-commodity) flows, effective resistances…

Plan 1) Cut sparsifiers 2) More efficient cut sparsifiers 3) Node sparsifiers

Cut sparsifiers

Approach? [Karger’94,’96]:

Concentration

Applying Chernoff bound

Enough?

Smaller size?

Non-uniform sampling [Benczur-Karger’96]

Strong connectivity Connectivity: 5 Strong conn.: 2

Proof of theorem

ii) Cut values are approximated

Iterative sampling

Comments

BREAK

Smaller relaxed cut sparsifiers [A-Krauthgamer-Woodruff’14]:

Motivating example

Proof of theorem

i) Sketch description

ii) Sketch size ???

iii) Estimation

Estimation illustration dense components

iii) Correctness of estimation

Variance

Dense component estimate

Concluding remarks

Open questions