Large Graph Mining: Power Tools and a Practitioner’s guide

Slides:



Advertisements
Similar presentations
CMU SCS I2.2 Large Scale Information Network Processing INARC 1 Overview Goal: scalable algorithms to find patterns and anomalies on graphs 1. Mining Large.
Advertisements

School of Computer Science Carnegie Mellon University Duke University DeltaCon: A Principled Massive- Graph Similarity Function Danai Koutra Joshua T.
Correlation Search in Graph Databases Yiping Ke James Cheng Wilfred Ng Presented By Phani Yarlagadda.
Mauro Sozio and Aristides Gionis Presented By:
The Connectivity and Fault-Tolerance of the Internet Topology
15-826: Multimedia Databases and Data Mining
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Object Detection by Matching Longin Jan Latecki. Contour-based object detection Database shapes: …..
CMU SCS Mining Billion-node Graphs Christos Faloutsos CMU.
Fast Direction-Aware Proximity for Graph Mining KDD 2007, San Jose Hanghang Tong, Yehuda Koren, Christos Faloutsos.
Neighborhood Formation and Anomaly Detection in Bipartite Graphs Jimeng Sun Huiming Qu Deepayan Chakrabarti Christos Faloutsos Speaker: Jimeng Sun.
CMU SCS Large Graph Mining Christos Faloutsos CMU.
© 2010 IBM Corporation Diversified Ranking on Large Graphs: An Optimization Viewpoint Hanghang Tong, Jingrui He, Zhen Wen, Ching-Yung Lin, Ravi Konuru.
SCS CMU Proximity Tracking on Time- Evolving Bipartite Graphs Speaker: Hanghang Tong Joint Work with Spiros Papadimitriou, Philip S. Yu, Christos Faloutsos.
Keyword Proximity Search on XML Graphs Vagelis Hristidis Yannis Papakonstatinou Andrey Presenter: Feng Shao.
Presented by Ozgur D. Sahin. Outline Introduction Neighborhood Functions ANF Algorithm Modifications Experimental Results Data Mining using ANF Conclusions.
Measure Proximity on Graphs with Side Information Joint Work by Hanghang Tong, Huiming Qu, Hani Jamjoom Speaker: Mary McGlohon 1 ICDM 2008, Pisa, Italy15-19.
The community-search problem and how to plan a successful cocktail party Mauro SozioAris Gionis Max Planck Institute, Germany Yahoo! Research, Barcelona.
Given Connections Solution
Fast Random Walk with Restart and Its Applications
CMU SCS KDD'09Faloutsos, Miller, Tsourakakis P3-1 Large Graph Mining: Power Tools and a Practitioner’s guide Task 3: Recommendations & proximity Faloutsos,
Introduction Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Introduction Facebook How does Facebook use your data? Where do you think.
Mining Large Graphs Part 3: Case studies Jure Leskovec and Christos Faloutsos Machine Learning Department Joint work with: Lada Adamic, Deepay Chakrabarti,
1 Data Mining over the Deep Web Tantan Liu, Gagan Agrawal Ohio State University April 12, 2011.
School of Computer Science Carnegie Mellon LLNL, Feb. '07C. Faloutsos1 Mining static and time-evolving graphs Christos Faloutsos Carnegie Mellon University.
Limits of Local Algorithms in Random Graphs
CMU SCS KDD'09Faloutsos, Miller, Tsourakakis P0-1 Large Graph Mining: Power Tools and a Practitioner’s guide Christos Faloutsos Gary Miller Charalampos.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 TANGENT: A Novel, “Surprise-me”, Recommendation Algorithm.
RESOURCES, TRADE-OFFS, AND LIMITATIONS Group 5 8/27/2014.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
ValuePick : Towards a Value-Oriented Dual-Goal Recommender System Leman Akoglu Christos Faloutsos OEDM in conjunction with ICDM 2010 Sydney, Australia.
KDD 2007, San Jose Fast Direction-Aware Proximity for Graph Mining Speaker: Hanghang Tong Joint work w/ Yehuda Koren, Christos Faloutsos.
Fast Random Walk with Restart and Its Applications Hanghang Tong, Christos Faloutsos and Jia-Yu (Tim) Pan ICDM 2006 Dec , HongKong.
CMU SCS KDD '09Faloutsos, Miller, Tsourakakis P5-1 Large Graph Mining: Power Tools and a Practitioner’s guide Task 5: Graphs over time & tensors Faloutsos,
Hanghang Tong, Brian Gallagher, Christos Faloutsos, Tina Eliassi-Rad
KDD 2007, San Jose Fast Direction-Aware Proximity for Graph Mining Speaker: Hanghang Tong Joint work w/ Yehuda Koren, Christos Faloutsos.
Kijung Shin Jinhong Jung Lee Sael U Kang
Private Release of Graph Statistics using Ladder Functions J.ZHANG, G.CORMODE, M.PROCOPIUC, D.SRIVASTAVA, X.XIAO.
Center-Piece Subgraphs: Problem definition and Fast Solutions Hanghang Tong Christos Faloutsos Carnegie Mellon University.
Graph Indexing From managing and mining graph data.
CMU SCS KDD'09Faloutsos, Miller, Tsourakakis P9-1 Large Graph Mining: Power Tools and a Practitioner’s guide Christos Faloutsos Gary Miller Charalampos.
Of 17 Limits of Local Algorithms in Random Graphs Madhu Sudan MSR Joint work with David Gamarnik (MIT) 7/11/2013Local Algorithms on Random Graphs1.
CMU SCS Panel: Social Networks Christos Faloutsos CMU.
CMU SCS KDD '09Faloutsos, Miller, Tsourakakis P8-1 Large Graph Mining: Power Tools and a Practitioner’s guide Task 8: hadoop and Tera/Peta byte graphs.
Queensland University of Technology
Finding Dense and Connected Subgraphs in Dual Networks
Large Graph Mining: Power Tools and a Practitioner’s guide
DOULION: Counting Triangles in Massive Graphs with a Coin
T.W. Scholten, C. de Persis, P. Tesi
Hanghang Tong, Brian Gallagher, Christos Faloutsos, Tina Eliassi-Rad
NetMine: Mining Tools for Large Graphs
Kijung Shin1 Mohammad Hammoud1
Summarizing Entities: A Survey Report
Hanghang Tong, Brian Gallagher, Christos Faloutsos, Tina Eliassi-Rad
Large Graph Mining: Power Tools and a Practitioner’s guide
Distributed Representations of Subgraphs
Graph Database Mining and Its Applications
KDD Reviews 周天烁 2018年5月9日.
Graph and Tensor Mining for fun and profit
Graph and Tensor Mining for fun and profit
Jinhong Jung, Woojung Jin, Lee Sael, U Kang, ICDM ‘16
Graph and Tensor Mining for fun and profit
Christos Faloutsos CMU
Classification in Complex Systems
Graph and Tensor Mining for fun and profit
Graph and Tensor Mining for fun and profit
Large Graph Mining: Power Tools and a Practitioner’s guide
Learning to Rank Typed Graph Walks: Local and Global Approaches
Proximity in Graphs by Using Random Walks
Presentation transcript:

Large Graph Mining: Power Tools and a Practitioner’s guide Task 4: Center-piece Subgraphs Faloutsos, Miller and Tsourakakis CMU KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis Outline Introduction – Motivation Task 1: Node importance Task 2: Community detection Task 3: Recommendations Task 4: Connection sub-graphs Task 5: Mining graphs over time … Conclusions KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis Detailed outline Problem definition Solution Results H. Tong & C. Faloutsos Center-piece subgraphs: problem definition and fast solutions. In KDD, 404-413, 2006. KDD'09 Faloutsos, Miller, Tsourakakis

Center-Piece Subgraph(Ceps) Given Q query nodes Find Center-piece ( ) Input of Ceps Q Query nodes Budget b k softAnd number App. Social Network Law Inforcement Gene Network … KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis Challenges in Ceps Q1: How to measure importance? (Q2: How to extract connection subgraph? Q3: How to do it efficiently?) KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis Challenges in Ceps Q1: How to measure importance? A: “proximity” – but how to combine scores? (Q2: How to extract connection subgraph? Q3: How to do it efficiently?) KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis AND: Combine Scores Q: How to combine scores? KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis AND: Combine Scores Q: How to combine scores? A: Multiply …= prob. 3 random particles coincide on node j KDD'09 Faloutsos, Miller, Tsourakakis

K_SoftAnd: Relaxation of AND Disconnected Communities Noise What if AND query  No Answer? KDD'09 Faloutsos, Miller, Tsourakakis

K_SoftAnd: Combine Scores Generalization – SoftAND: We want nodes close to k of Q (k<Q) query nodes. Q: How to do that? KDD'09 Faloutsos, Miller, Tsourakakis

K_SoftAnd: Combine Scores Generalization – softAND: We want nodes close to k of Q (k<Q) query nodes. Q: How to do that? A: Prob(at least k-out-of-Q will meet each other at j) KDD'09 Faloutsos, Miller, Tsourakakis

AND query vs. K_SoftAnd query x 1e-4 And Query 2_SoftAnd Query KDD'09 Faloutsos, Miller, Tsourakakis

1_SoftAnd query = OR query KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis Detailed outline Problem definition Solution Results KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis Case Study: AND query KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis Case Study: AND query KDD'09 Faloutsos, Miller, Tsourakakis

Faloutsos, Miller, Tsourakakis database Statistic KDD'09 Faloutsos, Miller, Tsourakakis 2_SoftAnd query

Faloutsos, Miller, Tsourakakis Conclusions Proximity (e.g., w/ RWR) helps answer ‘AND’ and ‘k_softAnd’ queries KDD'09 Faloutsos, Miller, Tsourakakis