Clustering In Large Graphs And Matrices Petros Drineas, Alan Frieze, Ravi Kannan, Santosh Vempala, V. Vinay Presented by Eric Anderson.

Slides:

Advertisements

Similar presentations

Numerical Linear Algebra in the Streaming Model Ken Clarkson - IBM David Woodruff - IBM.

Advertisements

Numerical Linear Algebra in the Streaming Model

RandNLA: Randomized Numerical Linear Algebra

Generalised Inverses Modal Analysis and Modal Testing S. Ziaei Rad.

Dimensionality reduction. Outline From distances to points : – MultiDimensional Scaling (MDS) Dimensionality Reductions or data projections Random projections.

Dimensionality Reduction PCA -- SVD

Optimal Column-Based Low-Rank Matrix Reconstruction SODA’12 Ali Kemal Sinop Joint work with Prof. Venkatesan Guruswami.

Lecture 19 Singular Value Decomposition

Slides by Olga Sorkine, Tel Aviv University. 2 The plan today Singular Value Decomposition  Basic intuition  Formal definition  Applications.

1cs542g-term High Dimensional Data  So far we’ve considered scalar data values f i (or interpolated/approximated each component of vector values.

Principal Component Analysis

Affine-invariant Principal Components Charlie Brubaker and Santosh Vempala Georgia Tech School of Computer Science Algorithms and Randomness Center.

Sampling algorithms for l 2 regression and applications Michael W. Mahoney Yahoo Research (Joint work with P. Drineas.

Randomized matrix algorithms and their applications

3D Geometry for Computer Graphics

Multimedia Databases SVD II. Optimality of SVD Def: The Frobenius norm of a n x m matrix M is (reminder) The rank of a matrix M is the number of independent.

A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts

Singular Value Decomposition COS 323. Underconstrained Least Squares What if you have fewer data points than parameters in your function?What if you have.

Information Retrieval in Text Part III Reference: Michael W. Berry and Murray Browne. Understanding Search Engines: Mathematical Modeling and Text Retrieval.

Multimedia Databases SVD II. SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case studies SVD properties More case.

TFIDF-space  An obvious way to combine TF-IDF: the coordinate of document in axis is given by  General form of consists of three parts: Local weight.

Lecture 4 Unsupervised Learning Clustering & Dimensionality Reduction

SVD and PCA COS 323. Dimensionality Reduction Map points in high-dimensional space to lower number of dimensionsMap points in high-dimensional space to.

The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.

Computing Sketches of Matrices Efficiently & (Privacy Preserving) Data Mining Petros Drineas Rensselaer Polytechnic Institute (joint.

Unsupervised Learning

Bioinformatics Challenge  Learning in very high dimensions with very few samples  Acute leukemia dataset: 7129 # of gene vs. 72 samples  Colon cancer.

3D Geometry for Computer Graphics

Ordinary least squares regression (OLS)

Kathryn Linehan Advisor: Dr. Dianne O’Leary

Singular Value Decomposition and Data Management

6 6.3 © 2012 Pearson Education, Inc. Orthogonality and Least Squares ORTHOGONAL PROJECTIONS.

1cs542g-term Notes  Extra class next week (Oct 12, not this Friday)  To submit your assignment: me the URL of a page containing (links to)

Normal Estimation in Point Clouds 2D/3D Shape Manipulation, 3D Printing March 13, 2013 Slides from Olga Sorkine.

Information Retrieval Latent Semantic Indexing. Speeding up cosine computation What if we could take our vectors and “pack” them into fewer dimensions.

Linear Algebra Review By Tim K. Marks UCSD Borrows heavily from: Jana Kosecka Virginia de Sa (UCSD) Cogsci 108F Linear.

Introduction The central problems of Linear Algebra are to study the properties of matrices and to investigate the solutions of systems of linear equations.

Summarized by Soo-Jin Kim

1 Information Retrieval through Various Approximate Matrix Decompositions Kathryn Linehan Advisor: Dr. Dianne O’Leary.

AN ORTHOGONAL PROJECTION

SVD: Singular Value Decomposition

Orthogonality and Least Squares

Orthogonalization via Deflation By Achiya Dax Hydrological Service Jerusalem, Israel

تهیه کننده : نرگس مرعشی استاد راهنما : جناب آقای دکتر جمشید شنبه زاده.

MATH 685/ CSI 700/ OR 682 Lecture Notes Lecture 4. Least squares.

Learning Spectral Clustering, With Application to Speech Separation F. R. Bach and M. I. Jordan, JMLR 2006.

Perfect and Related Codes

Advanced Computer Graphics Spring 2014 K. H. Ko School of Mechatronics Gwangju Institute of Science and Technology.

Matrix Factorization & Singular Value Decomposition Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.

Singular Value Decomposition and Numerical Rank. The SVD was established for real square matrices in the 1870’s by Beltrami & Jordan for complex square.

Chapter 13 Discrete Image Transforms

K -means clustering via Principal Component Analysis (Chris Ding and Xiaofeng He, ICML 2004) 03 March 2011 Kwak, Namju 1.

Chapter 61 Chapter 7 Review of Matrix Methods Including: Eigen Vectors, Eigen Values, Principle Components, Singular Value Decomposition.

A Story of Principal Component Analysis in the Distributed Model David Woodruff IBM Almaden Based on works with Christos Boutsidis, Ken Clarkson, Ravi.

Lecture XXVII. Orthonormal Bases and Projections Suppose that a set of vectors {x 1,…,x r } for a basis for some space S in R m space such that r  m.

Introduction to Vectors and Matrices

Introduction The central problems of Linear Algebra are to study the properties of matrices and to investigate the solutions of systems of linear equations.

Introduction The central problems of Linear Algebra are to study the properties of matrices and to investigate the solutions of systems of linear equations.

Spectral Clustering.

LSI, SVD and Data Management

Singular Value Decomposition

Orthogonality and Least Squares

Singular Value Decomposition SVD

Outline Singular Value Decomposition Example of PCA: Eigenfaces.

Lecture 13: Singular Value Decomposition (SVD)

Introduction to Vectors and Matrices

On Clusterings: Good, Bad, and Spectral

Orthogonality and Least Squares

Presentation transcript:

Clustering In Large Graphs And Matrices Petros Drineas, Alan Frieze, Ravi Kannan, Santosh Vempala, V. Vinay Presented by Eric Anderson

Outline Clustering: discrete vs. continuous Singular Value Decomposition (SVD) Applying SVD to clustering Algorithm Analysis and results

Clustering Group m similar points in Â n, or equivalently, group similar rows of an m x n matrix A m, n considered variable, k fixed Many options for goals

Discrete Clustering (DCP) Minimizes sum of squared distances to k cluster centers: Cluster centers are the centroids of the cluster points Each point belongs to one, and only one, cluster Slow Voronoi algorithm supplied:

Continuous Clustering (CCP) Minimizes sum of squared distances to some k-dimensional subspace V of Â n : Gives a lower bound on the optimal value of DCP, eg. V=span(B) Result: each point belongs to each cluster with some intensity Overlap is allowed, but now intensity vectors (of clusters) must be orthogonal

CCP Continued " i, let x i be the ith cluster, an m-vector of intensities The weight of x is Require Optimal clustering of A is a set of orthonormal x 1, …, x k where x i is a maximum weight cluster of A subject to being orthogonal to x 1, …, x i-1 Orthogonality: x i T x j =0 for i≠j

More CCP Orthogonality needed: let v=λu+w where u and w are orthogonal, u is the maximum weight cluster. Then so λ should be 0 for v to be of maximum weight when u is removed

Approximating DCP with CCP Compute V from CCP Project A onto V and solve DCP in k dimensions Result is shown to be a 2- approximation for full DCP (optimal value is off by a factor of no more than 2)

Frobenius Norm Definition: Similar to 2-norm for vectors Not the matrix 2-norm

Singular Value Decomposition (SVD) The SVD of a matrix A is Singular values Singular vectors Frobenius norm:

Use of SVD Minimizes error in rank k approximations: This solves CCP: where is the projection of A onto V, is minimized by D k, since is of rank at most k.

Algorithm SVD is rather slow, especially for large matrices Choose random columns of A for SVD, forming A* Want to find columns so that with D* induced by the first k singular vectors of A*, for some ε>0.

Algorithm Continued Steps: 1. Choose c>0, ε>0, δ<1. Let s=4k/(εcδ). For each i, include column with probability, in matrix S. 2. Find S T S. 3. Find the top k eigenvectors p i of S T S, and for each I, return as the clusters.

Analysis of Algorithm It is shown that with probability at least 1- δ, In practice, can pick fewer columns Actual method: check error by randomly sampling elements of and repeat if not satisfactory Running time: O(k 3 /ε 6 +k 2 m/ε 4 )

Preliminary Results Generated 1000 x 1000 random matrices with certain singular value distributions Distributions defined by q: fraction of Frobenius norm contained in first k singular values Checked number of columns of A necessary to get a 3% error bound (ε=0.03)

Preliminary Results

Conclusion Useful new definition of clusters Good (linear in m) running time to approximate CCP Forms 2-approximation for DCP A new use for the SVD