Dept. of Computer Science Rutgers Node and Graph Similarity : Theory and Applications Danai Koutra (CMU) Tina Eliassi-Rad (Rutgers) Christos Faloutsos.

Slides:



Advertisements
Similar presentations
Weiren Yu 1, Jiajin Le 2, Xuemin Lin 1, Wenjie Zhang 1 On the Efficiency of Estimating Penetrating Rank on Large Graphs 1 University of New South Wales.
Advertisements

BiG-Align: Fast Bipartite Graph Alignment
CMU SCS I2.2 Large Scale Information Network Processing INARC 1 Overview Goal: scalable algorithms to find patterns and anomalies on graphs 1. Mining Large.
School of Computer Science Carnegie Mellon University Duke University DeltaCon: A Principled Massive- Graph Similarity Function Danai Koutra Joshua T.
School of Computer Science Carnegie Mellon University National Taiwan University of Science & Technology Unifying Guilt-by-Association Approaches: Theorems.
Multi-label Relational Neighbor Classification using Social Context Features Xi Wang and Gita Sukthankar Department of EECS University of Central Florida.
Dept. of Computer Science Rutgers Node Similarity, Graph Similarity and Matching: Theory and Applications Danai Koutra (CMU) Tina Eliassi-Rad (Rutgers)
Xiaowei Ying, Xintao Wu, Daniel Barbara Spectrum based Fraud Detection in Social Networks 1.
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Web Search – Summer Term 2006 VI. Web Search - Ranking (cont.) (c) Wolfgang Hürst, Albert-Ludwigs-University.
CMU SCS C. Faloutsos (CMU)#1 Large Graph Algorithms Christos Faloutsos CMU McGlohon, Mary Prakash, Aditya Tong, Hanghang Tsourakakis, Babis Akoglu, Leman.
ON LINK-BASED SIMILARITY JOIN A joint work with: Liwen Sun, Xiang Li, David Cheung (University of Hong Kong) Jiawei Han (University of Illinois Urbana.
Fast Query Execution for Retrieval Models based on Path Constrained Random Walks Ni Lao, William W. Cohen Carnegie Mellon University
N EIGHBORHOOD F ORMATION AND A NOMALY D ETECTION IN B IPARTITE G RAPHS Jimeng Sun, Huiming Qu, Deepayan Chakrabarti & Christos Faloutsos Jimeng Sun, Huiming.
Graph Based Semi- Supervised Learning Fei Wang Department of Statistical Science Cornell University.
Neighborhood Formation and Anomaly Detection in Bipartite Graphs Jimeng Sun Huiming Qu Deepayan Chakrabarti Christos Faloutsos Speaker: Jimeng Sun.
On Community Outliers and their Efficient Detection in Information Networks Jing Gao 1, Feng Liang 1, Wei Fan 2, Chi Wang 1, Yizhou Sun 1, Jiawei Han 1.
Detecting Fraudulent Personalities in Networks of Online Auctioneers Duen Horng (“Polo”) Chau Shashank Pandit Christos Faloutsos School of Computer Science.
SCS CMU Proximity Tracking on Time- Evolving Bipartite Graphs Speaker: Hanghang Tong Joint Work with Spiros Papadimitriou, Philip S. Yu, Christos Faloutsos.
Measuring and Extracting Proximity in Networks By - Yehuda Koren, Stephen C.North and Chris Volinsky - Rahul Sehgal.
1 Fast Dynamic Reranking in Large Graphs Purnamrita Sarkar Andrew Moore.
Measure Proximity on Graphs with Side Information Joint Work by Hanghang Tong, Huiming Qu, Hani Jamjoom Speaker: Mary McGlohon 1 ICDM 2008, Pisa, Italy15-19.
Fast Random Walk with Restart and Its Applications
CMU SCS KDD'09Faloutsos, Miller, Tsourakakis P3-1 Large Graph Mining: Power Tools and a Practitioner’s guide Task 3: Recommendations & proximity Faloutsos,
CMU SCS Big (graph) data analytics Christos Faloutsos CMU.
Faculty: Dr. Chengcui Zhang Students: Wei-Bang Chen Song Gao Richa Tiwari.
Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Institute for System Programming of RAS.
CS523 INFORMATION RETRIEVAL COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
School of Computer Science Carnegie Mellon University National Taiwan University of Science & Technology Unifying Guilt-by-Association Approaches: Theorems.
DATA MINING LECTURE 13 Absorbing Random walks Coverage.
CSM06 Information Retrieval Lecture 4: Web IR part 1 Dr Andrew Salway
2015/10/111 DBconnect: Mining Research Community on DBLP Data Osmar R. Zaïane, Jiyang Chen, Randy Goebel Web Mining and Social Network Analysis Workshop.
Web Mining Class Nam Hoai Nguyen Hiep Tuan Nguyen Tri Survey on Web Structure Mining
Overview of Web Ranking Algorithms: HITS and PageRank
Fast Random Walk with Restart and Its Applications Hanghang Tong, Christos Faloutsos and Jia-Yu (Tim) Pan ICDM 2006 Dec , HongKong.
Lecture #10 PageRank CS492 Special Topics in Computer Science: Distributed Algorithms and Systems.
Graph-based Text Classification: Learn from Your Neighbors Ralitsa Angelova , Gerhard Weikum : Max Planck Institute for Informatics Stuhlsatzenhausweg.
Information Network Analysis and Discovery Cuiping Li Guoming He Information School, Renmin University of China.
CMU SCS KDD '09Faloutsos, Miller, Tsourakakis P5-1 Large Graph Mining: Power Tools and a Practitioner’s guide Task 5: Graphs over time & tensors Faloutsos,
Hanghang Tong, Brian Gallagher, Christos Faloutsos, Tina Eliassi-Rad
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
CoNMF: Exploiting User Comments for Clustering Web2.0 Items Presenter: He Xiangnan 28 June School of Computing National.
Panther: Fast Top-k Similarity Search in Large Networks JING ZHANG, JIE TANG, CONG MA, HANGHANG TONG, YU JING, AND JUANZI LI Presented by Moumita Chanda.
Single-Pass Belief Propagation
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Kijung Shin Jinhong Jung Lee Sael U Kang
CSCE 5073 Section 001: Data Mining Spring Overview Class hour 12:30 – 1:45pm, Tuesday & Thur, JBHT 239 Office hour 2:00 – 4:00pm, Tuesday & Thur,
CS 540 Database Management Systems Web Data Management some slides are due to Kevin Chang 1.
Extrapolation to Speed-up Query- dependent Link Analysis Ranking Algorithms Muhammad Ali Norozi Department of Computer Science Norwegian University of.
Term Project Proposal By J. H. Wang Apr. 7, 2017.
A Peta-Scale Graph Mining System
Large Graph Mining: Power Tools and a Practitioner’s guide
Cross-lingual Knowledge Linking Across Wiki Knowledge Bases
PEGASUS: A PETA-SCALE GRAPH MINING SYSTEM
Link-Based Ranking Seminar Social Media Mining University UC3M
Hanghang Tong, Brian Gallagher, Christos Faloutsos, Tina Eliassi-Rad
CS7280: Special Topics in Data Mining Information/Social Networks
Large Graph Mining: Power Tools and a Practitioner’s guide
Author: Kazunari Sugiyama, etc. (WWW2004)
Speaker: Hanghang Tong Carnegie Mellon University
Jinhong Jung, Woojung Jin, Lee Sael, U Kang, ICDM ‘16
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
Large Graph Mining: Power Tools and a Practitioner’s guide
GANG: Detecting Fraudulent Users in OSNs
Learning to Rank Typed Graph Walks: Local and Global Approaches
CSCE 4143 Section 001: Data Mining Spring 2019.
Proximity in Graphs by Using Random Walks
PRSim: Sublinear Time SimRank Computation on Large Power-Law Graphs.
PRSim: Sublinear Time SimRank Computation on Large Power-Law Graphs
Presentation transcript:

Dept. of Computer Science Rutgers Node and Graph Similarity : Theory and Applications Danai Koutra (CMU) Tina Eliassi-Rad (Rutgers) Christos Faloutsos (CMU) ICDM 2014, Monday December 15 th 2014, Shenzhen, China Copyright for the tutorial materials is held by the authors. The authors grant IEEE ICDM permission to distribute the materials through its website.

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Part 1b Node Similarity: Proximity 2

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos What to remember Node Roles and Proximity are complementary Node Proximity: –Building block for many applications! –Many (wrong) ways to define similarities. –Guilt-by-association techniques and effective conductance are similar (and recommended). 3

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Real-world Applications 4

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Movies recommendations 5

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Search Engines (IR) Topical Sessions 6 “popular music videos” Queries URLs “music” “yahoo” similar

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Node proximity measures: Information exchange [Minkov+ ’06] Latency/speed of info exchange [Bunke] Likelihood of future links [Liben-Nowell+ ‘03], [Tong+] Propagation of a product/idea/disease [Prakash+] Relevance: ranking [Haveliwala], [Chakrabarti+] 7

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Intuitively, a good measure… … of node-proximity should reward –many –short paths –heavy 8

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Roadmap Node Roles Node Proximity –Graph-theoretic Approaches –Effective Conductance –Guilt-by-association techniques –Summary 9

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Graph-theoretic Approaches Idea: Simple Metrics: –Number of hops –Sum of weights of hops 10 more similar than [Koren+ ’07]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Graph-theoretic Approaches… … do not always capture meaningful relationships 11 linked via only one path (s,t) probably unrelated t s t s t s dist(s,t) = 2 [Koren+ ’07]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Max-flow Min-cut Approach Idea: - assign a limited capacity to each edge - compute maximal # of units delivered from s to t 12 Same maximal flow Although red “closer” than green 1/1 1/2 1/1 1/2 s s t t [Koren+ ’07]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos SimRank Idea: two objects are similar if they are referenced by similar objects 13 [Jeh, Widom ’07] Structural context G(V,E) G 2 (V 2,E 2 )

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos SimRank 14 Structural context G(V,E) α b similarity of in-neighbors decay factor ε[0,1] total # of in-neighbors pairs Avg similarity between in-neighbors of α and in-neighbors of b [Jeh, Widom ’07]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos SimRank for Bipartite Graphs 15 G(V,E) Avg similarity between out-neighbors of A and out-neighbors of B c d Avg similarity between in-neighbors of c and in-neighbors of d A,BA,B c,dc,d [Jeh, Widom ’07; Improvements: Antonellis+‘08 SimRank++, C. Li, Han+’10, Y. Zhang ’13, P. Li+’14 … ] “music” “music videos”

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Roadmap Node Roles Node Proximity –Graph-theoretic Approaches –Effective Conductance –Guilt-by-association techniques –Summary 16

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Effective Conductance Edges: resistors Weights: conductance 17 [Doyle & Snell ‘84] (s,t)-proximity = current from s to t s t Vs = 1 Solve system of linear equations

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Effective Conductance Favors short paths (graph-theoretic) and more paths (max-flow) BUT: ‘pizza delivery boy’ issues 18 s t Vs = 1 [Doyle & Snell ‘84]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Cycle-free effective conductance ~ random walks EC(s,t) = Effective Conductance(s,t) = = deg(s) * P(s -> t) = = deg(t) * P(t -> s) = = Expected number of successful escapes from s -> t for deg(s) attempts 19 s t Vs = 1 [Doyle & Snell ‘84]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Community Detection Proximity Graph 20 RSA cryptosystem Proximity graph [Koren+ ’07]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Community around a single node 21 [Koren+ ’07]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Roadmap Node Roles Node Proximity –Graph-theoretic Approaches –Effective Conductance –Unification: Guilt-by-association techniques –Summary 22

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos ? ? ? ? ? Guilt-by-Association Techniques 23 Given: graph and few labeled nodes Find: class (red/green) for rest nodes Assuming: network effect (homophily/ heterophily) red green F raudster H onest A ccomplice red green red

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Guilt-by-Association Techniques Random Walk with Restarts (RWR) Google Semi-supervised Learning (SSL) Belief Propagation (BP) Bayesian 24 ? ? ? ? ?

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Personalized Random Walk with Restarts (RWR) 25 [Brin+ ’98; Haveliwala ’03; Tong+ ’06; Minkov, Cohen ’07] measure relevance

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Personalized RWR 26 relevance vector restart prob starting vector 0 ½ ½ ½ 0 ½ ½ ½ ½ 0 ½ ? ? graph structure

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Guilt-by-Association Techniques Random Walk with Restarts (RWR) Google Semi-supervised Learning (SSL) Belief Propagation (BP) Bayesian 27 ? ? ? ? ?

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Semi-Supervised Learning (SSL) graph-based few labeled nodes edges: similarity between nodes Inference: exploit neighborhood information 28 STEP1STEP1 STEP1STEP1 STEP2STEP2 STEP2STEP ? ? [Zhou ’06; Ji, Han ’10]…

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos SSL Equation 29 homophily strength of neighbors ~”stiffness of spring” final labels known labels 1 ? ? d1 d2 d3 d1 d2 d graph structure ? ?

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Guilt-by-Association Techniques Random Walk with Restarts (RWR) Google Semi-supervised Learning (SSL) Belief Propagation (BP) Bayesian 30 ? ? ? ? ?

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Belief Propagation Iterative message-based method st round 2 nd round... until stop criterion fulfilled “Propagation matrix”:  Homophily PL AI class of sender class of receiver [Pearl ’82; Yedidia+ ’02; … ; Gonzalez+ ’09; Chechetka+ ‘10]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Belief Propagation Iterative message-based method “Propagation matrix”:  Homophily  Heterophily class of sender class of receiver [Pearl ’82; Yedidia+ ’02; … ; Gonzalez+ ’09; Chechetka+ ‘10]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos 33 Belief Propagation Equations i j … … message(i −> j) ≈ belief(i)  homophily strength

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos 34 Belief Propagation Equations i j … belief of i prior belief messages from neighbors

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Fast Belief Propagation 35 BP is approximated by Linearized BP ? ? d1 d2 d3 d1 d2 d3 linearnon-linear Belief Propagation FaBP [Koutra+]:Original [Yedidia+]: [Koutra+ PKDD’11: Unifying Guilt-by-Association Approaches: Theorems and Fast Algorithms] prior beliefs

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Qualitative Comparison 36 GBA Method HeterophilyScalabilityConvergence RWR ✗✓✓ SSL ✗✓✓ BP ✓✓ ? F A BP ✓✓✓ [Koutra+ PKDD’11]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Correspondence of Methods RWR ≈ SSL ≈ BP Random Walk Semi-supervised Belief with Restarts Learning Propagation 37 MethodMatrixunknownknown RWR[I – c AD -1 ]×x=(1-c)y SSL [I + a (D - A)] ×x=y FABP [I + a D - c ’ A] ×bhbh =φhφh ? ? [Koutra+ PKDD’11]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Extension to multiple classes 38 [Gatterbauer+ VLDB’15] MethodMatrixunknownknown RWR [I – c AD -1 ]×x=(1-c)y SSL [I + a (D - A) ] ×x=y FABP [I + a D - c ’ A ] ×bhbh =φhφh LinBP [I + H 2 D - H A ]×vec(F)=vec(X)

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Extension to multiple classes 39 [Gatterbauer+ VLDB’15] MethodMatrixunknownknown RWR [I – c AD -1 ]×x=(1-c)y SSL [I + a (D - A) ] ×x=y FABP [I + a D - c ’ A ] ×bhbh =φhφh LinBP [I + H 2 D - H A ]×vec(F)=vec(X) LinBP* [I - H A ]×vec(F)=vec(X)

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Applications of Guilt-by-Association Approaches 40

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Center-Piece Subgraph(CePS) 41 Original Graph CePS Q: How to find hub for the black nodes? A: Proximity! [Tong+ KDD’06] CePS guy Input Output Red: Max (Prox(A, Red) x Prox(B, Red) x Prox(C, Red))

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos CePS Example 42 [Tong+ KDD’06]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos 43 Related Work Recommendation [APOLO, Chau+ ’11] initial interest Relevance indicated by color saturation.

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Fraud detection scams16.6% Non-delivery merchandise11.9% Fee fraud 9.8% Identity theft 8.2% Overpayment fraud 7.3% Misc Fraud 6.3% Spam 6.2% Auction fraud 5.7% … 44 of internet crime complaints in 2009 [Pandit, Chau+ ‘07]

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos 45 Fraud detection [Pandit, Chau+ ‘07] F raudster H onest A ccomplice F A H Near-bipartite core 66,130 users 795,320 trans. Found bipartite cores with confirmed fraudsters! Belief Propagation Fraudster Accomplice Honest

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Even More Applications Clustering [Ding+ KDD 2007] management [Minkov+ CEAS 06] Business Process Management [Qu+ 2008] ProSIN –Listen to clients’ comments [Tong+ 2008] TANGENT –Broaden Users’ Horizon [Oonuma & Tong ] Ghost Edge Within Network Classification [Gallagher & Tong+ KDD08 b] … 46

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Roadmap Node Roles Node Proximity –Graph-theoretic Approaches –Effective Conductance –Guilt-by-association techniques –Summary 47

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos Summary Node Roles and Proximity are complementary Node Proximity: –Building block for many applications! –Many (wrong) ways to define similarities. –Guilt-by-association techniques and effective conductance are similar (and recommended). 48

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos What we will cover next 49

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos References Guilt-by-association techniques Wolfgang Gatterbauer, Stephan Guennemann, Danai Koutra, Christos Faloutsos. Linearized and Single-Pass Belief Propagation. Proceedings of the VLDB Endowment, Volume 8(4) (VLDB'15), August [code]Linearized and Single-Pass Belief Propagationcode Danai Koutra, Tai-You Ke, U. Kang, Duen Horng Chau, Hsing-Kuo Kenneth Pao, and Christos Faloutsos Unifying guilt-by-associationUnifying guilt-by-association approaches: theorems and fast algorithms approaches: theorems and fast algorithms. ECML PKDD'11. [code]code Duen Horng Chau, Aniket Kittur, Jason I Hong, Christos Faloutsos. Apolo: making sense of large network data by combining rich user interaction and machine learning. In SIGCHI, , W. Cohen. (2007) Graph Walks and Graphical Models. Draft. H. Tong, S. Papadimitriou, P.S. Yu & C. Faloutsos. (2008) Proximity Tracking on Time-Evolving Bipartite Graphs. to appear in SDM B. Gallagher, H. Tong, T. Eliassi-Rad, C. Faloutsos. Using Ghost Edges for Classification in Sparsely Labeled Networks. KDD (In reverse chronological order)

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos References H. Tong, H. Qu, and H. Jamjoom. Measuring Proximity on Graphs with Side Information. Proceedings of ICDM S Pandit, DH Chau, S Wang, C Faloutsos. Netprobe: a fast and scalable system for fraud detection in online auction networks. In WWW, , H. Tong, Y. Koren, & C. Faloutsos. (2007) Fast direction-aware proximity for graph mining. In KDD, , S. Chakrabarti. (2007) Dynamic personalized pagerank in entity- relation graphs. In WWW, , F. Fouss, A. Pirotte, J.-M. Renders, & M. Saerens. (2007) Random-Walk Computation of Similarities between Nodes of a Graph with Application to Collaborative Recommendation. IEEE Trans. Knowl. Data Eng. 19(3), H. Tong, Y. Koren, & C. Faloutsos. (2007) Fast direction-aware proximity for graph mining. In KDD, ,

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos References H. Tong, B. Gallagher, C. Faloutsos, & T. Eliassi-Rad. (2007) Fast best- effort pattern matching in large attributed graphs. In KDD, , Einat Minkov, William W. Cohen, and Andrew Y. Ng Contextual search and name disambiguation in using graphs. ACM SIGIR '06. H. Tong & C. Faloutsos. (2006) Center-piece subgraphs: problem definition and fast solutions. In KDD, , H. Tong, C. Faloutsos, & J.Y. Pan. (2006) Fast Random Walk with Restart and Its Applications. In ICDM, , A. Agarwal, S. Chakrabarti & S. Aggarwal. (2006) Learning to rank networked entities. In KDD, 14-23, J. Sun, H. Qu, D. Chakrabarti & C. Faloutsos. (2005) Neighborhood Formation and Anomaly Detection in Bipartite Graphs. In ICDM, , J.Y. Pan, H.J. Yang, C. Faloutsos & P. Duygulu. (2004) Automatic multimedia cross-modal correlation discovery. In KDD, ,

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos References C. Faloutsos, K. S. McCurley, and A. Tomkins. Fast discovery of connection subgraphs. In Proc. 10th ACM SIGKDD conference, pages 118– 127, David Liben-Nowell and Jon Kleinberg The link prediction problem for social networks. CIKM '03. Jonathan S. Yedidia, William T. Freeman, and Yair Weiss Understanding belief propagation and its generalizations. In Exploring artificial intelligence in the new millennium, Gerhard Lakemeyer and Bernhard Nebel (Eds.). T.H. Haveliwala (2002) Topic-Sensitive Pagerank. In WWW, , Sergey Brin and Lawrence Page The anatomy of a large-scale hypertextual Web search engine. In World Wide Web 7 (WWW7). Peter G. Doyle and J. Laurie Snell. Random walks and electric networks, The Mathematical Association of America,

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos References Graph Theoretic Techniques Zhang, Yinglong and Li, Cuiping and Chen, Hong and Sheng, Likun. Fast SimRank Computation over Disk-Resident Graphs. Lecture Notes in Computer Science, Database Systems for Advanced Applications Cuiping Li, Jiawei Han, Guoming He, Xin Jin, Yizhou Sun, Yintao Yu, and Tianyi Wu Fast computation of SimRank for static and dynamic information networks. In Proceedings of the 13th International Conference on Extending Database Technology (EDBT '10). ACM, Pei Li and Hongyan Liu and Jeffrey Xu and Yu Jun and He Xiaoyong Du Fast single-pair simrank computation. In Proc. of the SIAM Intl. Conf. on Data Mining (SDM 2010). Ryan N. Lichtenwalter, Jake T. Lussier, and Nitesh V. Chawla New perspectives and methods in link prediction. KDD '10. Dmitry Lizorkin, Pavel Velikhov, Maxim Grinev, and Denis Turdakov Accuracy estimate and optimization techniques for SimRank computation. Weiren Yu; Xuemin Lin; Jiajin Le, "A Space and Time Efficient Algorithm for SimRank Computation”, th International Asia-Pacific Web Conference (APWEB), pp

ICDM’14 Tutorial D. Koutra & T. Eliassi-Rad & C. Faloutsos References Ioannis Antonellis, Hector Garcia Molina, and Chi Chao Chang Simrank++: query rewriting through link analysis of the click graph. Proc. VLDB Endow. 1, 1 (August 2008), Yehuda Koren, Stephen C. North, and Chris Volinsky Measuring a nd extracting proximity graphs in networks. ACM TKDD 1, 3, Article 12 (Dec 2007) Dániel Fogaras and Balázs Rácz Scaling link-based similarity search. WWW '05. Vincent D. Blondel, Anahí Gajardo, Maureen Heymans, Pierre Senellart, and Paul Van Dooren A Measure of Similarity between Graph Vertices: Applications to Synonym Extraction and Web Searching. SIAM Rev. 46, 4 (April 2004), Glen Jeh and Jennifer Widom SimRank: a measure of structural-context similarity. ACM SIGKDD