CMU SCS 15-826: Multimedia Databases and Data Mining Lecture #21: Tensor decompositions C. Faloutsos.

Slides:



Advertisements
Similar presentations
CMU SCS : Multimedia Databases and Data Mining Lecture #17: Text - part IV (LSI) C. Faloutsos.
Advertisements

Beyond Streams and Graphs: Dynamic Tensor Analysis
Multilinear Algebra for Analyzing Data with Multiple Linkages
CMU SCS : Multimedia Databases and Data Mining Lecture #20: SVD - part III (more case studies) C. Faloutsos.
CMU SCS : Multimedia Databases and Data Mining Lecture #19: SVD - part II (case studies) C. Faloutsos.
CMU SCS : Multimedia Databases and Data Mining Lecture #11: Fractals: M-trees and dim. curse (case studies – Part II) C. Faloutsos.
Multilinear Algebra for Analyzing Data with Multiple Linkages Tamara G. Kolda plus: Brett Bader, Danny Dunlavy, Philip Kegelmeyer Sandia National Labs.
CMU SCS : Multimedia Databases and Data Mining Lecture#5: Multi-key and Spatial Access Methods - II C. Faloutsos.
Dimensionality Reduction PCA -- SVD
15-826: Multimedia Databases and Data Mining
CMU SCS Large Graph Mining - Patterns, Tools and Cascade Analysis Christos Faloutsos CMU.
SCS CMU Joint Work by Hanghang Tong, Spiros Papadimitriou, Jimeng Sun, Philip S. Yu, Christos Faloutsos Speaker: Hanghang Tong Aug , 2008, Las Vegas.
10-603/15-826A: Multimedia Databases and Data Mining SVD - part II (more case studies) C. Faloutsos.
Principal Component Analysis
Multimedia Databases SVD II. Optimality of SVD Def: The Frobenius norm of a n x m matrix M is (reminder) The rank of a matrix M is the number of independent.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 4 March 30, 2005
Multimedia Databases SVD II. SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case studies SVD properties More case.
SCS CMU Proximity Tracking on Time- Evolving Bipartite Graphs Speaker: Hanghang Tong Joint Work with Spiros Papadimitriou, Philip S. Yu, Christos Faloutsos.
IR Models: Latent Semantic Analysis. IR Model Taxonomy Non-Overlapping Lists Proximal Nodes Structured Models U s e r T a s k Set Theoretic Fuzzy Extended.
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
Clustering In Large Graphs And Matrices Petros Drineas, Alan Frieze, Ravi Kannan, Santosh Vempala, V. Vinay Presented by Eric Anderson.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 6 May 7, 2006
10-603/15-826A: Multimedia Databases and Data Mining SVD - part I (definitions) C. Faloutsos.
3D Geometry for Computer Graphics
Kathryn Linehan Advisor: Dr. Dianne O’Leary
Multimedia Databases LSI and SVD. Text - Detailed outline text problem full text scanning inversion signature files clustering information filtering and.
Singular Value Decomposition and Data Management
E.G.M. PetrakisDimensionality Reduction1  Given N vectors in n dims, find the k most important axes to project them  k is user defined (k < n)  Applications:
CMU SCS : Multimedia Databases and Data Mining Lecture #30: Conclusions C. Faloutsos.
Matrices CS485/685 Computer Vision Dr. George Bebis.
Introduction The central problems of Linear Algebra are to study the properties of matrices and to investigate the solutions of systems of linear equations.
The PageRank Citation Ranking: Bringing Order to the Web Presented by Aishwarya Rengamannan Instructor: Dr. Gautam Das.
Presented By Wanchen Lu 2/25/2013
Linear Algebra Review 1 CS479/679 Pattern Recognition Dr. George Bebis.
CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 1 Graph Analytics Workshop: Tools Christos Faloutsos CMU.
1 Information Retrieval through Various Approximate Matrix Decompositions Kathryn Linehan Advisor: Dr. Dianne O’Leary.
CMU SCS KDD'09Faloutsos, Miller, Tsourakakis P0-1 Large Graph Mining: Power Tools and a Practitioner’s guide Christos Faloutsos Gary Miller Charalampos.
Introduction to tensor, tensor factorization and its applications
SVD: Singular Value Decomposition
CMU SCS U Kang (CMU) 1KDD 2012 GigaTensor: Scaling Tensor Analysis Up By 100 Times – Algorithms and Discoveries U Kang Christos Faloutsos School of Computer.
CMU SCS Big (graph) data analytics Christos Faloutsos CMU.
Latent Semantic Indexing: A probabilistic Analysis Christos Papadimitriou Prabhakar Raghavan, Hisao Tamaki, Santosh Vempala.
CMU SCS KDD '09Faloutsos, Miller, Tsourakakis P5-1 Large Graph Mining: Power Tools and a Practitioner’s guide Task 5: Graphs over time & tensors Faloutsos,
1 Latent Concepts and the Number Orthogonal Factors in Latent Semantic Analysis Georges Dupret
ParCube: Sparse Parallelizable Tensor Decompositions
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
Review of Linear Algebra Optimization 1/16/08 Recitation Joseph Bradley.
Streaming Pattern Discovery in Multiple Time-Series Jimeng Sun Spiros Papadimitrou Christos Faloutsos PARALLEL DATA LABORATORY Carnegie Mellon University.
CMU SCS : Multimedia Databases and Data Mining Lecture #18: SVD - part I (definitions) C. Faloutsos.
Instructor: Mircea Nicolescu Lecture 8 CS 485 / 685 Computer Vision.
Facets: Fast Comprehensive Mining of Coevolving High-order Time Series Hanghang TongPing JiYongjie CaiWei FanQing He Joint Work by Presenter:Wei Fan.
SCS CMU Speaker Hanghang Tong Colibri: Fast Mining of Large Static and Dynamic Graphs Speaking Skill Requirement.
MTH108 Business Math I Lecture 20.
Large Graph Mining: Power Tools and a Practitioner’s guide
Introduction The central problems of Linear Algebra are to study the properties of matrices and to investigate the solutions of systems of linear equations.
Introduction The central problems of Linear Algebra are to study the properties of matrices and to investigate the solutions of systems of linear equations.
Estimation Techniques for High Resolution and Multi-Dimensional Array Signal Processing EMS Group – Fh IIS and TU IL Electronic Measurements and Signal.
15-826: Multimedia Databases and Data Mining
Zhu Han University of Houston Thanks for Dr. Hung Nguyen’s Slides
Jure Leskovec and Christos Faloutsos Machine Learning Department
LSI, SVD and Data Management
Large Graph Mining: Power Tools and a Practitioner’s guide
15-826: Multimedia Databases and Data Mining
15-826: Multimedia Databases and Data Mining
15-826: Multimedia Databases and Data Mining
15-826: Multimedia Databases and Data Mining
Jimeng Sun · Charalampos (Babis) E
15-826: Multimedia Databases and Data Mining
Lecture 13: Singular Value Decomposition (SVD)
Junghoo “John” Cho UCLA
Presentation transcript:

CMU SCS : Multimedia Databases and Data Mining Lecture #21: Tensor decompositions C. Faloutsos

CMU SCS 2 Must-read Material Tamara G. Kolda and Brett W. Bader. Tensor decompositions and applications. Technical Report SAND , Sandia National Laboratories, Albuquerque, NM and Livermore, CA, November 2007 Tensor decompositions and applications.

CMU SCS 3 Outline Goal: ‘Find similar / interesting things’ Intro to DB Indexing - similarity search Data Mining

CMU SCS 4 Indexing - Detailed outline primary key indexing secondary key / multi-key indexing spatial access methods fractals text Singular Value Decomposition (SVD) -… -Tensors multimedia...

CMU SCS 5 Most of foils by Dr. Tamara Kolda (Sandia N.L.) csmr.ca.sandia.gov/~tgkolda Dr. Jimeng Sun (CMU -> IBM) 3h tutorial:

CMU SCS 6 Outline Motivation - Definitions Tensor tools Case studies

CMU SCS 7 Motivation 0: Why “matrix”? Why matrices are important?

CMU SCS 8 Examples of Matrices: Graph - social network John PeterMaryNick... John Peter Mary Nick

CMU SCS 9 Examples of Matrices: cloud of n-d points chol#blood# age..... John Peter Mary Nick...

CMU SCS 10 Examples of Matrices: Market basket market basket as in Association Rules milkbreadchoc.wine... John Peter Mary Nick...

CMU SCS 11 Examples of Matrices: Documents and terms Paper#1 Paper#2 Paper#3 Paper#4 dataminingclassif.tree...

CMU SCS 12 Examples of Matrices: Authors and terms dataminingclassif.tree... John Peter Mary Nick...

CMU SCS 13 Examples of Matrices: sensor-ids and time-ticks t1 t2 t3 t4 temp1temp2humid.pressure...

CMU SCS 14 Motivation: Why tensors? Q: what is a tensor?

CMU SCS 15 Motivation 2: Why tensor? A: N-D generalization of matrix: dataminingclassif.tree... John Peter Mary Nick... KDD’07

CMU SCS 16 Motivation 2: Why tensor? A: N-D generalization of matrix: dataminingclassif.tree... John Peter Mary Nick... KDD’06 KDD’05 KDD’07

CMU SCS 17 Tensors are useful for 3 or more modes Terminology: ‘mode’ (or ‘aspect’): dataminingclassif.tree... Mode (== aspect) #1 Mode#2 Mode#3

CMU SCS 18 Motivating Applications Why matrices are important? Why tensors are useful? –P1: social networks –P2: web mining

CMU SCS 19 P1: Social network analysis Traditionally, people focus on static networks and find community structures We plan to monitor the change of the community structure over time

CMU SCS 20 P2: Web graph mining How to order the importance of web pages? –Kleinberg’s algorithm HITS –PageRank –Tensor extension on HITS (TOPHITS) context-sensitive hypergraph analysis

CMU SCS 21 Outline Motivation – Definitions Tensor tools Case studies Tensor Basics Tucker PARAFAC

CMU SCS Tensor Basics

CMU SCS 23 Reminder: SVD –Best rank-k approximation in L2 A m n  m n U VTVT 

CMU SCS 24 Reminder: SVD –Best rank-k approximation in L2 A m n  + 1u1v11u1v1 2u2v22u2v2

CMU SCS 25 Goal: extension to >=3 modes ~ I x R K x R A B J x R C R x R x R I x J x K +…+=

CMU SCS 26 Main points: 2 major types of tensor decompositions: PARAFAC and Tucker both can be solved with ``alternating least squares’’ (ALS) Details follow

CMU SCS Specially Structured Tensors

CMU SCS 28 = U I x R V J x R W K x R R x R x R Specially Structured Tensors Tucker TensorKruskal Tensor I x J x K = U I x R V J x S W K x T R x S x T I x J x K Our Notation +…+= u1u1 uRuR v1v1 w1w1 vRvR wRwR “core”

CMU SCS 29 Specially Structured Tensors Tucker TensorKruskal Tensor In matrix form: details

CMU SCS Tensor Decompositions

CMU SCS 31 Tucker Decomposition - intuition I x J x K ~ A I x R B J x S C K x T R x S x T author x keyword x conference A: author x author-group B: keyword x keyword-group C: conf. x conf-group : how groups relate to each other Needs elaboration!

CMU SCS 32 Intuition behind core tensor 2-d case: co-clustering [Dhillon et al. Information-Theoretic Co- clustering, KDD’03]

CMU SCS 33 m m n nl k k l eg, terms x documents

CMU SCS 34 term x term-group doc x doc group term group x doc. group med. terms cs terms common terms med. doc cs doc

CMU SCS 35

CMU SCS 36 Tucker Decomposition Proposed by Tucker (1966) AKA: Three-mode factor analysis, three-mode PCA, orthogonal array decomposition A, B, and C generally assumed to be orthonormal (generally assume they have full column rank) is not diagonal Not unique Recall the equations for converting a tensor to a matrix I x J x K ~ A I x R B J x S C K x T R x S x T Given A, B, C, the optimal core is:

CMU SCS 37 Outline Motivation – Definitions Tensor tools Case studies Tensor Basics Tucker PARAFAC

CMU SCS 38 CANDECOMP/PARAFAC Decomposition CANDECOMP = Canonical Decomposition (Carroll & Chang, 1970) PARAFAC = Parallel Factors (Harshman, 1970) Core is diagonal (specified by the vector ) Columns of A, B, and C are not orthonormal If R is minimal, then R is called the rank of the tensor (Kruskal 1977) Can have rank( ) > min{I,J,K} ¼ I x R K x R A B J x R C R x R x R I x J x K +…+=

CMU SCS 39 Tucker vs. PARAFAC Decompositions Tucker –Variable transformation in each mode –Core G may be dense –A, B, C generally orthonormal –Not unique PARAFAC –Sum of rank-1 components –No core, i.e., superdiagonal core –A, B, C may have linearly dependent columns –Generally unique I x J x K ~ A I x R B J x S C K x T R x S x T I x J x K +…+ ~ a1a1 aRaR b1b1 c1c1 bRbR cRcR IMPORTANT

CMU SCS 40 Tensor tools - summary Two main tools –PARAFAC –Tucker Both find row-, column-, tube-groups –but in PARAFAC the three groups are identical To solve: Alternating Least Squares Toolbox: from Tamara Kolda:

CMU SCS 41 Outline Motivation - Definitions Tensor tools Case studies

CMU SCS 42 P1: Web graph mining How to order the importance of web pages? –Kleinberg’s algorithm HITS –PageRank –Tensor extension on HITS (TOPHITS)

CMU SCS 43 Kleinberg’s Hubs and Authorities (the HITS method) Sparse adjacency matrix and its SVD: authority scores for 1 st topic hub scores for 1 st topic hub scores for 2 nd topic authority scores for 2 nd topic from to Kleinberg, JACM, 1999

CMU SCS 44 authority scores for 1 st topic hub scores for 1 st topic hub scores for 2 nd topic authority scores for 2 nd topic from to HITS Authorities on Sample Data We started our crawl from and crawled 4700 pages, resulting in 560 cross-linked hosts.

CMU SCS 45 Three-Dimensional View of the Web Observe that this tensor is very sparse! Kolda, Bader, Kenny, ICDM05

CMU SCS 46 Topical HITS (TOPHITS) Main Idea: Extend the idea behind the HITS model to incorporate term (i.e., topical) information. authority scores for 1 st topic hub scores for 1 st topic hub scores for 2 nd topic authority scores for 2 nd topic from to term term scores for 1 st topic term scores for 2 nd topic

CMU SCS 47 Topical HITS (TOPHITS) Main Idea: Extend the idea behind the HITS model to incorporate term (i.e., topical) information. authority scores for 1 st topic hub scores for 1 st topic hub scores for 2 nd topic authority scores for 2 nd topic from to term term scores for 1 st topic term scores for 2 nd topic

CMU SCS 48 TOPHITS Terms & Authorities on Sample Data TOPHITS uses 3D analysis to find the dominant groupings of web pages and terms. authority scores for 1 st topic hub scores for 1 st topic hub scores for 2 nd topic authority scores for 2 nd topic from to term term scores for 1 st topic term scores for 2 nd topic Tensor PARAFAC w k = # unique links using term k

CMU SCS GigaTensor: Scaling Tensor Analysis Up By 100 Times – Algorithms and Discoveries U Kang Christos Faloutsos School of Computer Science Carnegie Mellon University Evangelos Papalexakis Abhay Harpale

CMU SCS P2: N.E.L.L. analysis NELL: Never Ending Language Learner –Q1: dominant concepts / topics? –Q2: synonyms for a given new phrase? 50 “Barrack Obama is the president of U.S.” “Eric Clapton plays guitar” (26M) (48M) NELL (Never Ending Language Learner) Nonzeros =144M (26M)

CMU SCS A1: Concept Discovery Concept Discovery in Knowledge Base

CMU SCS A1: Concept Discovery

CMU SCS A2: Synonym Discovery Synonym Discovery in Knowledge Base a1a1 a2a2 aRaR … (Given) subject (Discovered) synonym 1 (Discovered) synonym 2

CMU SCS A2: Synonym Discovery

CMU SCS 55 Conclusions Real data are often in high dimensions with multiple aspects (modes) Matrices and tensors provide elegant theory and algorithms I x J x K +…+ ~ a1a1 aRaR b1b1 c1c1 bRbR cRcR

CMU SCS 56 References Inderjit S. Dhillon, Subramanyam Mallela, Dharmendra S. Modha: Information-theoretic co-clustering. KDD 2003: T. G. Kolda, B. W. Bader and J. P. Kenny. Higher- Order Web Link Analysis Using Multilinear Algebra. In: ICDM 2005, Pages , November Jimeng Sun, Spiros Papadimitriou, Philip Yu. Window- based Tensor Analysis on High-dimensional and Multi- aspect Streams, Proc. of the Int. Conf. on Data Mining (ICDM), Hong Kong, China, Dec 2006