Download presentation
Presentation is loading. Please wait.
Published byMoris Paul Modified over 9 years ago
1
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 1 Graph Analytics Workshop: Tools Christos Faloutsos CMU
2
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 2 Welcome ! TueWeThu 9:00-10:30ToolsLaplaciansParallelism 11:00-12:30NELLRich graphsCommunities 1:30-3:00ExercisesPanelScalability 3:30-5:00Graph. modelsPostersGraph ‘Laws’ Reception
3
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 3 Roadmap Introduction – Motivation Task 1: Node importance Task 2: Community detection Task 3: Mining graphs over time – Tensors Task 4: Theory – intro to Laplacians Conclusions
4
CMU SCS C. Faloutsos (CMU) 4 Graphs - why should we care? Internet Map [lumeta.com] Food Web [Martinez ’91] >$10B revenue >0.5B users Graph Analytics wkshp
5
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 5 Graphs - why should we care? IR: bi-partite graphs (doc-terms) ‘NELL’: ‘ merkel ’ ‘ chancellor ’ ‘ germany ’ - facts -> tensors web: hyper-text graph... and more: D1D1 DNDN T1T1 TMTM...
6
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 6 Graphs - why should we care? ‘viral’ marketing web-log (‘blog’) news propagation computer network security: email/IP traffic and anomaly detection.... Any M:N relationship -> Graph Any subject-verb-object construct: -> Graph/Tensor
7
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 7 Graphs and matrices Closely related Powerful tools from matrix algebra, for graph mining
8
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 8 Examples of Matrices: Graph - social network John PeterMaryNick... John Peter Mary Nick... 0112255... 5067
9
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 9 Examples of Matrices: Market basket market basket as in Association Rules milkbreadchoc.wine... John Peter Mary Nick...
10
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 10 Examples of Matrices: Documents and terms Paper#1 Paper#2 Paper#3 Paper#4 dataminingclassif.tree...
11
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 11 Examples of Matrices: Authors and terms dataminingclassif.tree... John Peter Mary Nick...
12
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 12 Roadmap Introduction – Motivation Task 1: Node importance Task 2: Community detection Task 3: Mining graphs over time – Tensors Task 4: Theory – intro to Laplacians Conclusions
13
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 13 Node importance - Motivation: Given a graph (eg., web pages containing the desirable query word) Q: Which node is the most important?
14
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 14 Node importance - Motivation: Given a graph (eg., web pages containing the desirable query word) Q: Which node is the most important? A1: HITS (SVD = Singular Value Decomposition) A2: eigenvector (PageRank) ‘I am important, if my friends are important’ -> Fixed point / eigenvector
15
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 15 Node importance - motivation SVD and eigenvector analysis: very closely related
16
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 16 Roadmap Introduction – Motivation Task 1: Node importance Task 2: Community detection Task 3: Mining graphs over time – Tensors Task 4: Theory – intro to Laplacians Conclusions
17
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 17 Task 1 - SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case Studies –HITS –PageRank
18
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 18 SVD - Motivation problem #1: text - LSI: find ‘concepts’ problem #2: compression / dim. reduction
19
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 19 SVD - Motivation problem #1: text - LSI: find ‘concepts’
20
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 20 SVD - Motivation Customer-product, for recommendation system: bread lettuce beef vegetarians meat eaters tomatos chicken
21
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 21 SVD - Motivation problem #2: compress / reduce dimensionality
22
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 22 Problem - specs Visualize customers
23
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 23 SVD - Motivation
24
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 24 SVD - Motivation
25
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 25 Task 1 - SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case Studies –HITS –PageRank
26
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 26 SVD - Definition A = U V T - example:
27
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 27 SVD - Definition A [n x m] = U [n x r] r x r] (V [m x r] ) T A: n x m matrix (eg., n documents, m terms) U: n x r matrix (n documents, r concepts) : r x r diagonal matrix (strength of each ‘concept’) (r : rank of the matrix) V: m x r matrix (m terms, r concepts)
28
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 28 SVD - Properties THEOREM [Press+92]: always possible to decompose matrix A into A = U V T, where U, V: unique (*) U, V: column orthonormal (ie., columns are unit vectors, orthogonal to each other) –U T U = I; V T V = I (I: identity matrix) : singular are positive, and sorted in decreasing order
29
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 29 SVD - Example A = U V T - example: data inf. retrieval brain lung = CS MD xx
30
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 30 SVD - Example A = U V T - example: data inf. retrieval brain lung = CS MD xx CS-concept MD-concept
31
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 31 SVD - Example A = U V T - example: data inf. retrieval brain lung = CS MD xx CS-concept MD-concept doc-to-concept similarity matrix
32
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 32 SVD - Example A = U V T - example: data inf. retrieval brain lung = CS MD xx ‘strength’ of CS-concept
33
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 33 SVD - Example A = U V T - example: data inf. retrieval brain lung = CS MD xx term-to-concept similarity matrix CS-concept
34
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 34 SVD - Example A = U V T - example: data inf. retrieval brain lung = CS MD xx term-to-concept similarity matrix CS-concept
35
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 35 Task 1 - SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case studies Additional properties
36
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 36 SVD - Interpretation #1 ‘documents’, ‘terms’ and ‘concepts’: U: document-to-concept similarity matrix V: term-to-concept sim. matrix : its diagonal elements: ‘strength’ of each concept
37
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 37 SVD – Interpretation #1 ‘documents’, ‘terms’ and ‘concepts’: Q: if A is the document-to-term matrix, what is A T A? A: Q: A A T ? A:
38
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 38 Copyright: Faloutsos, Tong (2009) 2-38 SVD – Interpretation #1 ‘documents’, ‘terms’ and ‘concepts’: Q: if A is the document-to-term matrix, what is A T A? A: term-to-term ([m x m]) similarity matrix Q: A A T ? A: document-to-document ([n x n]) similarity matrix ICDE’09
39
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 39 Copyright: Faloutsos, Tong (2009) 2-39 SVD properties V are the eigenvectors of the covariance matrix A T A U are the eigenvectors of the Gram (inner- product) matrix AA T Further reading: 1. Ian T. Jolliffe, Principal Component Analysis (2 nd ed), Springer, 2002. 2. Gilbert Strang, Linear Algebra and Its Applications (4 th ed), Brooks Cole, 2005.
40
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 40 SVD - Interpretation #2 best axis to project on: (‘best’ = min sum of squares of projection errors)
41
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 41 SVD - Motivation
42
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 42 SVD - interpretation #2 minimum RMS error SVD: gives best axis to project v1 first singular vector
43
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 43 SVD - Interpretation #2
44
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 44 SVD - Interpretation #2 A = U V T - example: = xx v1
45
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 45 SVD - Interpretation #2 A = U V T - example: = xx variance (‘spread’) on the v1 axis
46
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 46 SVD - Interpretation #2 A = U V T - example: –U gives the coordinates of the points in the projection axis = xx
47
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 47 SVD - Interpretation #2 More details Q: how exactly is dim. reduction done? = xx
48
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 48 SVD - Interpretation #2 More details Q: how exactly is dim. reduction done? A: set the smallest singular values to zero: = xx
49
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 49 SVD - Interpretation #2 ~ xx
50
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 50 SVD - Interpretation #2 ~ xx
51
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 51 SVD - Interpretation #2 ~ xx
52
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 52 SVD - Interpretation #2 ~
53
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 53 SVD - Interpretation #2 Exactly equivalent: ‘spectral decomposition’ of the matrix: = xx
54
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 54 SVD - Interpretation #2 Exactly equivalent: ‘spectral decomposition’ of the matrix: = xx u1u1 u2u2 1 2 v1v1 v2v2
55
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 55 SVD - Interpretation #2 Exactly equivalent: ‘spectral decomposition’ of the matrix: =u1u1 1 vT1vT1 u2u2 2 vT2vT2 + +... n m
56
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 56 SVD - Interpretation #2 Exactly equivalent: ‘spectral decomposition’ of the matrix: =u1u1 1 vT1vT1 u2u2 2 vT2vT2 + +... n m n x 1 1 x m r terms
57
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 57 SVD - Interpretation #2 approximation / dim. reduction: by keeping the first few terms (Q: how many?) =u1u1 1 vT1vT1 u2u2 2 vT2vT2 + +... n m assume: 1 >= 2 >=...
58
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 58 SVD - Interpretation #2 A (heuristic - [Fukunaga]): keep 80-90% of ‘energy’ (= sum of squares of i ’s) =u1u1 1 vT1vT1 u2u2 2 vT2vT2 + +... n m assume: 1 >= 2 >=...
59
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 59 Pictorially: matrix form of SVD –Best rank-k approximation in L2 A m n m n U VTVT
60
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 60 Pictorially: Spectral form of SVD –Best rank-k approximation in L2 A m n + 1u1v11u1v1 2u2v22u2v2
61
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 61 Task 1 - SVD - Detailed outline Motivation Definition - properties Interpretation –#1: documents/terms/concepts –#2: dim. reduction –#3: picking non-zero, rectangular ‘blobs’ Complexity Case studies Additional properties
62
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 62 SVD - Interpretation #3 finds non-zero ‘blobs’ in a data matrix = xx
63
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 63 SVD - Interpretation #3 finds non-zero ‘blobs’ in a data matrix = xx
64
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 64 SVD - Interpretation #3 finds non-zero ‘blobs’ in a data matrix = ‘communities’ (bi-partite cores, here) Row 1 Row 4 Col 1 Col 3 Col 4 Row 5 Row 7
65
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 65 Task 1 - SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case Studies –HITS –PageRank
66
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 66 SVD - Complexity O( n * m * m) or O( n * n * m) (whichever is less) less work, if we just want singular values or if we want first k singular vectors or if the matrix is sparse [Berry] Implemented: in any linear algebra package (LINPACK, matlab, Splus, mathematica...)
67
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 67 SVD - conclusions so far SVD: A= U V T : unique (*) U: document-to-concept similarities V: term-to-concept similarities : strength of each concept dim. reduction: keep the first few strongest singular values (80-90% of ‘energy’) –SVD: picks up linear correlations SVD: picks up non-zero ‘blobs’
68
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 68 Task 1 - SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case Studies –HITS –PageRank
69
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 69 Kleinberg’s algo (HITS) Kleinberg, Jon (1998). Authoritative sources in a hyperlinked environment. Proc. 9th ACM-SIAM Symposium on Discrete Algorithms.
70
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 70 Recall: problem dfn Given a graph (eg., web pages containing the desirable query word) Q: Which node is the most important?
71
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 71 Kleinberg’s algorithm Problem dfn: given the web and a query find the most ‘authoritative’ web pages for this query Step 0: find all pages containing the query terms Step 1: expand by one move forward and backward
72
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 72 Kleinberg’s algorithm Step 1: expand by one move forward and backward
73
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 73 Kleinberg’s algorithm on the resulting graph, give high score (= ‘authorities’) to nodes that many important nodes point to give high importance score (‘hubs’) to nodes that point to good ‘authorities’) hubsauthorities
74
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 74 Kleinberg’s algorithm observations recursive definition! each node (say, ‘i’-th node) has both an authoritativeness score a i and a hubness score h i
75
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 75 Kleinberg’s algorithm Let E be the set of edges and A be the adjacency matrix: the (i,j) is 1 if the edge from i to j exists Let h and a be [n x 1] vectors with the ‘hubness’ and ‘authoritativiness’ scores. Then:
76
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 76 Kleinberg’s algorithm Then: a i = h k + h l + h m that is a i = Sum (h j ) over all j that (j,i) edge exists or a = A T h k l m i
77
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 77 Kleinberg’s algorithm symmetrically, for the ‘hubness’: h i = a n + a p + a q that is h i = Sum (q j ) over all j that (i,j) edge exists or h = A a p n q i
78
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 78 Kleinberg’s algorithm In conclusion, we want vectors h and a such that: h = A a a = A T h SVD properties: A [n x m] v 1 [m x 1] = 1 u 1 [n x 1] u 1 T A = 1 v 1 T =
79
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 79 Kleinberg’s algorithm In short, the solutions to h = A a a = A T h are the left- and right- singular-vectors of the adjacency matrix A. Starting from random a’ and iterating, we’ll eventually converge (Q: to which of all the singular-vectors? why?)
80
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 80 Kleinberg’s algorithm (Q: to which of all the singular-vectors? why?) A: to the ones of the strongest singular-value: (A T A ) k v’ ~ (constant) v 1
81
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 81 Kleinberg’s algorithm - results Eg., for the query ‘java’: 0.328 www.gamelan.com 0.251 java.sun.com 0.190 www.digitalfocus.com (“the java developer”)
82
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 82 Kleinberg’s algorithm - discussion ‘authority’ score can be used to find ‘similar pages’ (how?)
83
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 83 Task 1 - SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case Studies –HITS –PageRank
84
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 84 PageRank (google) Brin, Sergey and Lawrence Page (1998). Anatomy of a Large-Scale Hypertextual Web Search Engine. 7th Intl World Wide Web Conf. Larry Page Sergey Brin
85
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 85 Problem: PageRank Given a directed graph, find its most interesting/central node A node is important, if it is connected with important nodes (recursive, but OK!)
86
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 86 Problem: PageRank - solution Given a directed graph, find its most interesting/central node Proposed solution: Random walk; spot most ‘popular’ node (-> steady state prob. (ssp)) A node has high ssp, if it is connected with high ssp nodes (recursive, but OK!)
87
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 87 (Simplified) PageRank algorithm Let A be the adjacency matrix; let B be the transition matrix: transpose, column-normalized - then 1 2 3 4 5 = To From B
88
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 88 (Simplified) PageRank algorithm B p = p = 1 2 3 4 5
89
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 89 Definitions AAdjacency matrix (from-to) DDegree matrix = (diag ( d1, d2, …, dn) ) BTransition matrix: to-from, column normalized B = A T D -1
90
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 90 (Simplified) PageRank algorithm B p = 1 * p thus, p is the eigenvector that corresponds to the highest eigenvalue (=1, since the matrix is column-normalized ) Why does such a p exist? –p exists if B is nxn, nonnegative, irreducible [Perron–Frobenius theorem]
91
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 91 (Simplified) PageRank algorithm In short: imagine a particle randomly moving along the edges compute its steady-state probabilities (ssp) Full version of algo: with occasional random jumps Why? To make the matrix irreducible
92
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 92 Full Algorithm With probability 1-c, fly-out to a random node Then, we have p = c B p + (1-c)/n 1 => p = (1-c)/n [I - c B] -1 1
93
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 93 Alternative notation MModified transition matrix M = c B + (1-c)/n 1 1 T Then p = M p That is: the steady state probabilities = PageRank scores form the first eigenvector of the ‘modified transition matrix’
94
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 94 Parenthesis: intuition behind eigenvectors
95
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 95 Formal definition If A is a (n x n) square matrix , x) is an eigenvalue/eigenvector pair of A if A x = x CLOSELY related to singular values:
96
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 96 Property #1: Eigen- vs singular-values if B [n x m] = U [n x r] r x r] (V [m x r] ) T then A = ( B T B ) is symmetric and C(4): B T B v i = i 2 v i ie, v 1, v 2,...: eigenvectors of A = (B T B)
97
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 97 Property #2 If A [nxn] is a real, symmetric matrix Then it has n real eigenvalues (if A is not symmetric, some eigenvalues may be complex)
98
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 98 Property #3 If A [nxn] is a real, symmetric matrix Then it has n real eigenvalues And they agree with its n singular values, except possibly for the sign
99
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 99 Intuition A as vector transformation Axx’ = x 2 1 1 3
100
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 100 Intuition By defn., eigenvectors remain parallel to themselves (‘fixed points’) Av1v1 v1v1 = 3.62 * 1
101
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 101 Convergence Usually, fast:
102
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 102 Convergence Usually, fast:
103
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 103 Convergence Usually, fast: depends on ratio 1 : 2 1 2
104
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 104 Kleinberg/google - conclusions SVD helps in graph analysis: hub/authority scores: strongest left- and right- singular-vectors of the adjacency matrix random walk on a graph: steady state probabilities are given by the strongest eigenvector of the (modified) transition matrix
105
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 105 Conclusions SVD: a valuable tool given a document-term matrix, it finds ‘concepts’ (LSI)... and can find fixed-points or steady-state probabilities (google/ Kleinberg/ Markov Chains)
106
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 106 Conclusions cont’d (We didn’t discuss/elaborate, but, SVD... can reduce dimensionality (KL)... and can find rules (PCA; RatioRules)... and can solve optimally over- and under- constraint linear systems (least squares / query feedbacks)
107
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 107 References Berry, Michael: http://www.cs.utk.edu/~lsi/http://www.cs.utk.edu/~lsi/ Brin, S. and L. Page (1998). Anatomy of a Large-Scale Hypertextual Web Search Engine. 7th Intl World Wide Web Conf.
108
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 108 References Christos Faloutsos, Searching Multimedia Databases by Content, Springer, 1996. (App. D)Searching Multimedia Databases by Content Fukunaga, K. (1990). Introduction to Statistical Pattern Recognition, Academic Press. I.T. Jolliffe Principal Component Analysis Springer, 2002 (2 nd ed.)
109
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 109 References cont’d Kleinberg, J. (1998). Authoritative sources in a hyperlinked environment. Proc. 9th ACM-SIAM Symposium on Discrete Algorithms. Press, W. H., S. A. Teukolsky, et al. (1992). Numerical Recipes in C, Cambridge University Press. www.nr.com
110
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 110 PART 2: Communities
111
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 111 Roadmap Introduction – Motivation Task 1: Node importance Task 2: Community detection Task 3: Mining graphs over time – Tensors Task 4: Theory – intro to Laplacians Conclusions
112
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 112 Task 2 – Communities - Detailed outline Motivation Hard clustering – k pieces Hard clustering – optimal # pieces Observations
113
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 113 Problem Given a graph, and k Break it into k (disjoint) communities
114
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 114 Problem Given a graph, and k Break it into k (disjoint) communities k = 2
115
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 115 Solution #1: METIS Arguably, the best algorithm Open source, at –http://www.cs.umn.edu/~metishttp://www.cs.umn.edu/~metis and *many* related papers, at same url Main idea: –coarsen the graph; –partition; –un-coarsen
116
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 116 Solution #1: METIS G. Karypis and V. Kumar. METIS 4.0: Unstructured graph partitioning and sparse matrix ordering system. TR, Dept. of CS, Univ. of Minnesota, 1998.
117
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 117 Solution #2 (problem: hard clustering, k pieces) Spectral partitioning: Consider the 2 nd smallest eigenvector of the (normalized) Laplacian See details in ‘Task 7’, later
118
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 118 Solutions #3, … Many more ideas: Clustering on the A 2 (square of adjacency matrix) [Zhou, Woodruff, PODS’04] Minimum cut / maximum flow [Flake+, KDD’00] …
119
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 119 Task 2 – Communities - Detailed outline Motivation Hard clustering – k pieces Hard clustering – optimal # pieces Observations
120
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 120 Cross-association Desiderata: Simultaneously discover row and column groups Fully Automatic: No “magic numbers” Scalable to large matrices Reference: 1.Chakrabarti et al. Fully Automatic Cross-Associations, KDD’04
121
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 121 What makes a cross-association “good”? versus Column groups Row groups Why is this better?
122
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 122 What makes a cross-association “good”? versus Column groups Row groups Why is this better? simpler; easier to describe easier to compress!
123
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 123 What makes a cross-association “good”? Problem definition: given an encoding scheme decide on the # of col. and row groups k and l and reorder rows and columns, to achieve best compression
124
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 124 Main Idea size i * H(x i ) + Cost of describing cross-associations Code Cost Description Cost ΣiΣi Total Encoding Cost = Good Compression Better Clustering Minimize the total cost (# bits) for lossless compression
125
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 125 Algorithm k = 5 row groups k=1, l=2 k=2, l=2 k=2, l=3 k=3, l=3 k=3, l=4 k=4, l=4 k=4, l=5 l = 5 col groups
126
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 126 Experiments “CLASSIC” 3,893 documents 4,303 words 176,347 “dots” Combination of 3 sources: MEDLINE (medical) CISI (info. retrieval) CRANFIELD (aerodynamics) Documents Words
127
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 127 Experiments “CLASSIC” graph of documents & words: k=15, l=19 Documents Words
128
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 128 Experiments “CLASSIC” graph of documents & words: k=15, l=19 MEDLINE (medical) insipidus, alveolar, aortic, death, prognosis, intravenous blood, disease, clinical, cell, tissue, patient
129
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 129 Experiments “CLASSIC” graph of documents & words: k=15, l=19 CISI (Information Retrieval) providing, studying, records, development, students, rules abstract, notation, works, construct, bibliographies MEDLINE (medical)
130
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 130 Experiments “CLASSIC” graph of documents & words: k=15, l=19 CRANFIELD (aerodynamics) shape, nasa, leading, assumed, thin CISI (Information Retrieval) MEDLINE (medical)
131
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 131 Experiments “CLASSIC” graph of documents & words: k=15, l=19 paint, examination, fall, raise, leave, based CRANFIELD (aerodynamics) CISI (Information Retrieval) MEDLINE (medical)
132
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 132 Algorithm Code for cross-associations (matlab): www.cs.cmu.edu/~deepay/mywww/software/CrossAssociations -01-27-2005.tgz Variations and extensions: ‘Autopart’ [Chakrabarti, PKDD’04] www.cs.cmu.edu/~deepay
133
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 133 Algorithm Hadoop implementation [ICDM’08] Spiros Papadimitriou, Jimeng Sun: DisCo: Distributed Co-clustering with Map- Reduce: A Case Study towards Petabyte-Scale End-to-End Mining. ICDM 2008: 512-521
134
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 134 Task 2 – Communities - Detailed outline Motivation Hard clustering – k pieces Hard clustering – optimal # pieces Observations
135
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 135 Observation #1 Skewed degree distributions – there are nodes with huge degree (>O(10^4), in facebook/linkedIn popularity contests!)
136
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 136 Observation #2 Maybe there are no good cuts: ``jellyfish’’ shape [Tauro+’01], [Siganos+,’06], strange behavior of cuts [Chakrabarti+’04], [Leskovec+,’08]
137
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 137 Observation #2 Maybe there are no good cuts: ``jellyfish’’ shape [Tauro+’01], [Siganos+,’06], strange behavior of cuts [Chakrabarti+,’04], [Leskovec+,’08] ? ?
138
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 138 Jellyfish model [Tauro+] … A Simple Conceptual Model for the Internet Topology, L. Tauro, C. Palmer, G. Siganos, M. Faloutsos, Global Internet, November 25-29, 2001 Jellyfish: A Conceptual Model for the AS Internet Topology G. Siganos, Sudhir L Tauro, M. Faloutsos, J. of Communications and Networks, Vol. 8, No. 3, pp 339- 350, Sept. 2006.
139
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 139 Strange behavior of min cuts ‘negative dimensionality’ (!) NetMine: New Mining Tools for Large Graphs, by D. Chakrabarti, Y. Zhan, D. Blandford, C. Faloutsos and G. Blelloch, in the SDM 2004 Workshop on Link Analysis, Counter-terrorism and Privacy Statistical Properties of Community Structure in Large Social and Information Networks, J. Leskovec, K. Lang, A. Dasgupta, M. Mahoney. WWW 2008.
140
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 140 “Min-cut” plot Do min-cuts recursively. log (# edges) log (mincut-size / #edges) N nodes Mincut size = sqrt(N)
141
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 141 “Min-cut” plot Do min-cuts recursively. log (# edges) log (mincut-size / #edges) N nodes New min-cut
142
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 142 “Min-cut” plot Do min-cuts recursively. log (# edges) log (mincut-size / #edges) N nodes New min-cut Slope = -0.5 For a d-dimensional grid, the slope is -1/d
143
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 143 “Min-cut” plot log (# edges) log (mincut-size / #edges) Slope = -1/d For a d-dimensional grid, the slope is -1/d log (# edges) log (mincut-size / #edges) For a random graph, the slope is 0
144
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 144 “Min-cut” plot What does it look like for a real-world graph? log (# edges) log (mincut-size / #edges) ?
145
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 145 Experiments Datasets: –Google Web Graph: 916,428 nodes and 5,105,039 edges –Lucent Router Graph: Undirected graph of network routers from www.isi.edu/scan/mercator/maps.html; 112,969 nodes and 181,639 edges www.isi.edu/scan/mercator/maps.html –User Website Clickstream Graph: 222,704 nodes and 952,580 edges NetMine: New Mining Tools for Large Graphs, by D. Chakrabarti, Y. Zhan, D. Blandford, C. Faloutsos and G. Blelloch, in the SDM 2004 Workshop on Link Analysis, Counter-terrorism and Privacy
146
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 146 Experiments Used the METIS algorithm [ Karypis, Kumar, 1995] log (# edges) log (mincut-size / #edges) Google Web graph Values along the y- axis are averaged “lip” for large edges Slope of -0.4, corresponds to a 2.5- dimensional grid! Slope~ -0.4
147
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 147 Experiments Same results for other graphs too… log (# edges) log (mincut-size / #edges) Lucent Router graphClickstream graph Slope~ -0.57 Slope~ -0.45
148
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 148 Task 2 – Communities Conclusions – Practitioner’s guide Hard clustering – k pieces Hard clustering – optimal # pieces Observations METIS Cross-associations ‘jellyfish’: no good cuts
149
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 149 PART 3: Tensors
150
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 150 Roadmap Introduction – Motivation Task 1: Node importance Task 2: Community detection Task 3: Mining graphs over time – Tensors Task 4: Theory – intro to Laplacians Conclusions
151
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 151 Task 3 – Tensors - Detailed roadmap Motivation Definitions: PARAFAC Case study: web mining
152
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 152 Examples of Matrices: Authors and terms dataminingclassif.tree... John Peter Mary Nick...
153
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 153 But: if it changes over time?? A: treat it as ‘tensor’ dataminingclassif.tree... John Peter Mary Nick... KDD’08 KDD’07 KDD’09
154
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 154 Motivation: Why tensors? Q: what is a tensor?
155
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 155 Motivation: Why tensors? A: N-D generalization of matrix: dataminingclassif.tree... John Peter Mary Nick... KDD’09
156
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 156 Motivation: Why tensors? A: N-D generalization of matrix: dataminingclassif.tree... John Peter Mary Nick... KDD’08 KDD’07 KDD’09
157
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 157 Tensors are useful for 3 or more modes Terminology: ‘mode’ (or ‘aspect’): dataminingclassif.tree... Mode (== aspect) #1 Mode#2 Mode#3
158
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 158 Notice 3 rd mode does not need to be time we can have more than 3 modes... IP destination Dest. port IP source 80 125
159
CMU SCS Background: Tensors Tensors (=multi-dimensional arrays) are everywhere –Sensor stream (time, location, type) –Predicates (subject, verb, object) in knowledge base “Barrack Obama is the president of U.S.” “Eric Clapton plays guitar” (26M) (48M) NELL (Never Ending Language Learner) data Nonzeros =144M Graph Analytics wkshp 159 C. Faloutsos (CMU)
160
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 160 Task 3 – Tensors - Detailed roadmap Motivation Definitions: PARAFAC Case study: web mining
161
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 161 Tensor basics Multi-mode extensions of SVD – recall that:
162
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 162 Reminder: SVD –Best rank-k approximation in L2 A m n m n U VTVT
163
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 163 Reminder: SVD –Best rank-k approximation in L2 A m n + 1u1v11u1v1 2u2v22u2v2
164
CMU SCS Extension to (>=)3 modes Graph Analytics wkshp 164 C. Faloutsos (CMU)
165
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 165 Main points: 2 major types of tensor decompositions: PARAFAC and Tucker (not examined here) both can be solved with ``alternating least squares’’ (ALS)
166
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 166 Task 3 – Tensors - Detailed outline Motivation Definitions: PARAFAC Case study: web mining
167
CMU SCS Discoveries: Problem Definition Most important concepts and synonyms? (26M) (48M) NELL (Never Ending Language Learner) data Nonzeros =144M Graph Analytics wkshp 167 C. Faloutsos (CMU)
168
CMU SCS A1: Concept Discovery Concept Discovery in Knowledge Base Graph Analytics wkshp 168 C. Faloutsos (CMU)
169
CMU SCS A2.1: Concept Discovery Graph Analytics wkshp 169 C. Faloutsos (CMU)
170
CMU SCS A2: Synonym Discovery Synonym Discovery in Knowledge Base a1a1 a2a2 aRaR … (Given) noun phrase (Discovered) synonym 1 (Discovered) synonym 2 Graph Analytics wkshp 170 C. Faloutsos (CMU)
171
CMU SCS 171 C. Faloutsos (CMU) A2: Synonym Discovery Graph Analytics wkshp
172
CMU SCS GigaTensor: Scaling Tensor Analysis Up By 100 Times – Algorithms and Discoveries U Kang Christos Faloutsos KDD 2012 Evangelos Papalexakis Abhay Harpale Graph Analytics wkshp 172 C. Faloutsos (CMU)
173
CMU SCS Experiments GigaTensor solves 100x larger problem Number of nonzero = I / 50 (J) (I) (K) GigaTensor Tensor Toolbox Out of Memory 100x Graph Analytics wkshp 173 C. Faloutsos (CMU)
174
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 174 Conclusions Real data may have multiple aspects (modes) Tensors provide elegant theory and algorithms –PARAFAC (and Tucker): discover groups GigaTensor: scales up (hadoop/PEGASUS) –www.cs.cmu.edu/~pegasus
175
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 175 References T. G. Kolda, B. W. Bader and J. P. Kenny. Higher-Order Web Link Analysis Using Multilinear Algebra. In: ICDM 2005, Pages 242- 249, November 2005. Jimeng Sun, Spiros Papadimitriou, Philip Yu. Window-based Tensor Analysis on High- dimensional and Multi-aspect Streams, Proc. of the Int. Conf. on Data Mining (ICDM), Hong Kong, China, Dec 2006
176
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 176 Resources See tutorial on tensors, KDD’07 (w/ Tamara Kolda and Jimeng Sun): www.cs.cmu.edu/~christos/TALKS/KDD-07-tutorial
177
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 177 Tensor tools - resources Toolbox: from Tamara Kolda: csmr.ca.sandia.gov/~tgkolda/TensorToolbox 2-177 Copyright: Faloutsos, Tong (2009) 2-177 ICDE’09 T. G. Kolda and B. W. Bader. Tensor Decompositions and Applications. SIAM Review, Volume 51, Number 3, September 2009 csmr.ca.sandia.gov/~tgkolda/pubs/bibtgkfiles/TensorReview-preprint.pdf T. Kolda and J. Sun: Scalable Tensor Decomposition for Multi-Aspect Data Mining (ICDM 2008)
178
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 178 PART 4: Theory
179
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 179 Roadmap Introduction – Motivation Task 1: Node importance Task 2: Community detection Task 3: Mining graphs over time – Tensors Task 4: Theory – intro to Laplacians Conclusions
180
CMU SCS Task 4 – Theory - Detailed roadmap Adjacency matrix Laplacian –Connected Components –Intuition: 2 nd smallest eigenvalue -> ‘good cut’ 180 Graph Analytics wkshp 180 C. Faloutsos (CMU)
181
CMU SCS Adjacency matrix Graph Analytics wkshpC. Faloutsos (CMU) 181 A= 1 23 4
182
CMU SCS Adjacency matrix Graph Analytics wkshpC. Faloutsos (CMU) 182 A= 1 23 4 1-step-away paths
183
CMU SCS Adjacency matrix Graph Analytics wkshpC. Faloutsos (CMU) 183 1 23 4 Obvious extensions, for directed and/or weighted cases
184
CMU SCS Task 4 – Theory - Detailed roadmap Adjacency matrix Laplacian –Connected Components –Intuition: 2 nd smallest eigenvalue -> ‘good cut’ 184 Graph Analytics wkshp 184 C. Faloutsos (CMU)
185
CMU SCS Main upcoming result the second smallest eigenvector of the Laplacian (u 2 ) gives a good cut: Nodes with positive scores should go to one group And the rest to the other Graph Analytics wkshp 185 C. Faloutsos (CMU)
186
CMU SCS Laplacian Graph Analytics wkshpC. Faloutsos (CMU) 186 L= D-A= 1 23 4 Diagonal matrix, d ii =d i
187
CMU SCS Task 4 – Theory - Detailed roadmap Adjacency matrix Laplacian –Connected Components –Intuition: 2 nd smallest eigenvalue -> ‘good cut’ 187 Graph Analytics wkshp 187 C. Faloutsos (CMU)
188
CMU SCS Connected Components Lemma: Let G be a graph with n vertices and c connected components. If L is the Laplacian of G, then rank(L)= n-c. Proof: see p.279, Godsil-Royle Graph Analytics wkshpC. Faloutsos (CMU) 188
189
CMU SCS Connected Components Graph Analytics wkshpC. Faloutsos (CMU) 189 G(V,E) L= eig(L)= 123 6 75 4
190
CMU SCS Connected Components Graph Analytics wkshpC. Faloutsos (CMU) 190 G(V,E) L= eig(L)= #zeros = #components 123 6 75 4
191
CMU SCS Connected Components Graph Analytics wkshpC. Faloutsos (CMU) 191 G(V,E) L= eig(L)= 123 6 75 4 0.01
192
CMU SCS Connected Components Graph Analytics wkshpC. Faloutsos (CMU) 192 G(V,E) L= eig(L)= #zeros = #components 123 6 75 4 0.01
193
CMU SCS Connected Components Graph Analytics wkshpC. Faloutsos (CMU) 193 G(V,E) L= eig(L)= 123 6 75 4 0.01 Indicates a “good cut”
194
CMU SCS Task 4 – Theory - Detailed roadmap Reminders Adjacency matrix Laplacian –Connected Components –Intuition: 2 nd smallest eigenvalue -> ‘good cut’ 194 Graph Analytics wkshp 194 C. Faloutsos (CMU)
195
CMU SCS Example: Spectral Partitioning Graph Analytics wkshpC. Faloutsos (CMU) 195 K 500 dumbbell graph ?Montagues Capulets Romeo Juliet http://en.wikipedia.org/wiki/File:Romeo_and_juliet_brown.jpg
196
CMU SCS Example: Spectral Partitioning This is how adjacency matrix of B looks Graph Analytics wkshpC. Faloutsos (CMU) 196 spy(B)
197
CMU SCS Example: Spectral Partitioning 2 nd eigenvector u 2 of B: B u 2 = u 2 Graph Analytics wkshpC. Faloutsos (CMU) 197 L = diag(sum(B))-B; [u v] = eigs(L,2,'SM'); plot(u(:,1),’x’) Not so much information yet… Node-id ‘i’ u 2,i score
198
CMU SCS Example: Spectral Partitioning 2 nd eigenvector after sorting on x 2,i score Graph Analytics wkshpC. Faloutsos (CMU) 198 [ign ind] = sort(u(:,1)); plot(u(ind),'x') x 2,i score Node-id ‘i’
199
CMU SCS Example: Spectral Partitioning 2 nd eigenvector after sorting on x 2,i score Graph Analytics wkshpC. Faloutsos (CMU) 199 [ign ind] = sort(u(:,1)); plot(u(ind),'x') But now we see the two communities! x 2,i score Node-id ‘i’
200
CMU SCS Example: Spectral Partitioning This is how adjacency matrix of B looks now Graph Analytics wkshpC. Faloutsos (CMU) 200 spy(B(ind,ind))
201
CMU SCS Why λ 2 ? 201 Each ball 1 unit of mass x1x1 xnxn OSCILLATE Dfn of eigenvector Matrix viewpoint: Graph Analytics wkshp 201 C. Faloutsos (CMU)
202
CMU SCS Why λ 2 ? 202 Each ball 1 unit of mass x1x1 xnxn OSCILLATE Force due to neighbors displacement Hooke’s constant Physics viewpoint: Graph Analytics wkshp 202 C. Faloutsos (CMU)
203
CMU SCS Why λ 2 ? Graph Analytics wkshpC. Faloutsos (CMU) 203 Each ball 1 unit of mass Eigenvector value Node id x1x1 xnxn OSCILLATE For the first eigenvector: All nodes: same displacement (= value)
204
CMU SCS Why λ 2 ? 204 Each ball 1 unit of mass Eigenvector value Node id x1x1 xnxn OSCILLATE Graph Analytics wkshp 204 C. Faloutsos (CMU)
205
CMU SCS Conclusions Spectrum tells us a lot about the graph: Adjacency: #Paths Laplacian: Sparse Cut Graph Analytics wkshpC. Faloutsos (CMU) 205
206
CMU SCS References Fan R. K. Chung: Spectral Graph Theory (AMS) Chris Godsil and Gordon Royle: Algebraic Graph Theory (Springer) Bojan Mohar and Svatopluk Poljak: Eigenvalues in Combinatorial Optimization, IMA Preprint Series #939 Gilbert Strang: Introduction to Applied Mathematics (Wellesley-Cambridge Press) Graph Analytics wkshpC. Faloutsos (CMU) 206
207
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 207 PART 5: Conclusions
208
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) P9-208 Summary Task 1: Node importance Task 2: Community detection Task 3: Mining graphs over time Task 4: Spectral graph theory
209
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) P9-209 Summary Task 1: Node importance Task 2: Community detection Task 3: Mining graphs over time Task 4: Spectral graph theory ->SVD, PageRank, HITS -> METIS; ‘no good cuts’ -> Tensors -> Laplacians
210
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) P9-210 Acknowledgements Funding: IIS-0705359, IIS-0534205, DBI-0640543, CNS-0721736
211
Graph Analytics wkshpC. Faloutsos (CMU) P9-211 THANK YOU! Christos Faloutsos www.cs.cmu.edu/~christos www.cs.cmu.edu/~pegasus http://www.cs.cmu.edu/~epapalex/gmc/
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.