Bo Zong, Yinghui Wu, Ambuj K. Singh, Xifeng Yan 1 Inferring the Underlying Structure of Information Cascades

Slides:



Advertisements
Similar presentations
Algorithmic Mechanism Design: an Introduction VCG-mechanisms for some basic network optimization problems: The Minimum Spanning Tree problem Guido Proietti.
Advertisements

Inconsistent Heuristics
Instructor Neelima Gupta Table of Contents Approximation Algorithms.
BiG-Align: Fast Bipartite Graph Alignment
O(N 1.5 ) divide-and-conquer technique for Minimum Spanning Tree problem Step 1: Divide the graph into  N sub-graph by clustering. Step 2: Solve each.
Distance-Constraint Reachability Computation in Uncertain Graphs Ruoming Jin, Lin Liu Kent State University Bolin Ding UIUC Haixun Wang MSRA.
DIJKSTRA’s Algorithm. Definition fwd search Find the shortest paths from a given SOURCE node to ALL other nodes, by developing the paths in order of increasing.
Minimizing Seed Set for Viral Marketing Cheng Long & Raymond Chi-Wing Wong Presented by: Cheng Long 20-August-2011.
Minimum Spanning Tree Sarah Brubaker Tuesday 4/22/8.
1 Discrete Structures & Algorithms Graphs and Trees: III EECE 320.
CS171 Introduction to Computer Science II Graphs Strike Back.
S. J. Shyu Chap. 1 Introduction 1 The Design and Analysis of Algorithms Chapter 1 Introduction S. J. Shyu.
Combinatorial Algorithms
Lectures on Network Flows
Cloud Service Placement via Subgraph matching
PCPs and Inapproximability Introduction. My T. Thai 2 Why Approximation Algorithms  Problems that we cannot find an optimal solution.
The Complexity of the Network Design Problem Networks, 1978 Classic Paper Reading
Graph Algorithms: Minimum Spanning Tree We are given a weighted, undirected graph G = (V, E), with weight function w:
1 Internet Networking Spring 2006 Tutorial 6 Network Cost of Minimum Spanning Tree.
A general approximation technique for constrained forest problems Michael X. Goemans & David P. Williamson Presented by: Yonatan Elhanani & Yuval Cohen.
Robust Network Design with Exponential Scenarios By: Rohit Khandekar Guy Kortsarz Vahab Mirrokni Mohammad Salavatipour.
INFERRING NETWORKS OF DIFFUSION AND INFLUENCE Presented by Alicia Frame Paper by Manuel Gomez-Rodriguez, Jure Leskovec, and Andreas Kraus.
Streaming Models and Algorithms for Communication and Information Networks Brian Thompson (joint work with James Abello)
[1][1][1][1] Lecture 2-3: Coping with NP-Hardness of Optimization Problems in Practice May 26 + June 1, Introduction to Algorithmic Wireless.
1 Internet Networking Spring 2004 Tutorial 6 Network Cost of Minimum Spanning Tree.
1 Internet Networking Spring 2002 Tutorial 6 Network Cost of Minimum Spanning Tree.
CISC220 Fall 2009 James Atlas Nov 13: Graphs, Line Intersections.
Chapter 11 Limitations of Algorithm Power Copyright © 2007 Pearson Addison-Wesley. All rights reserved.
Steiner trees Algorithms and Networks. Steiner Trees2 Today Steiner trees: what and why? NP-completeness Approximation algorithms Preprocessing.
Towards Scalable Critical Alert Mining Bo Zong 1 with Yinghui Wu 1, Jie Song 2, Ambuj K. Singh 1, Hasan Cam 3, Jiawei Han 4, and Xifeng Yan 1 1 UCSB, 2.
A Framework For Community Identification in Dynamic Social Networks Chayant Tantipathananandh Tanya Berger-Wolf David Kempe Presented by Victor Lee.
Priority Models Sashka Davis University of California, San Diego June 1, 2003.
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
1 Quantum query complexity of some graph problems C. DürrUniv. Paris-Sud M. HeiligmanNational Security Agency P. HøyerUniv. of Calgary M. MhallaInstitut.
On the Construction of Data Aggregation Tree with Minimum Energy Cost in Wireless Sensor Networks: NP-Completeness and Approximation Algorithms National.
Mehdi Kargar Aijun An York University, Toronto, Canada Discovering Top-k Teams of Experts with/without a Leader in Social Networks.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
COSC 2007 Data Structures II Chapter 14 Graphs III.
Influence Maximization in Dynamic Social Networks Honglei Zhuang, Yihan Sun, Jie Tang, Jialin Zhang, Xiaoming Sun.
Diversified Top-k Graph Pattern Matching 1 Yinghui Wu UC Santa Barbara Wenfei Fan University of Edinburgh Southwest Jiaotong University Xin Wang.
Spanning Trees Introduction to Spanning Trees AQR MRS. BANKS Original Source: Prof. Roger Crawfis from Ohio State University.
A Survey Based Seminar: Data Cleaning & Uncertain Data Management Speaker: Shawn Yang Supervisor: Dr. Reynold Cheng Prof. David Cheung
On Graph Query Optimization in Large Networks Alice Leung ICS 624 4/14/2011.
Approximation Algorithms
1 Lower Bounds Lower bound: an estimate on a minimum amount of work needed to solve a given problem Examples: b number of comparisons needed to find the.
Optimization of Wavelength Assignment for QoS Multicast in WDM Networks Xiao-Hua Jia, Ding-Zhu Du, Xiao-Dong Hu, Man-Kei Lee, and Jun Gu, IEEE TRANSACTIONS.
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
Answering pattern queries using views Yinghui Wu UC Santa Barbara Wenfei Fan University of EdinburghSouthwest Jiaotong University Xin Wang.
Efficient Computing k-Coverage Paths in Multihop Wireless Sensor Networks XuFei Mao, ShaoJie Tang, and Xiang-Yang Li Dept. of Computer Science, Illinois.
EXCURSIONS IN MODERN MATHEMATICS SIXTH EDITION Peter Tannenbaum 1.
COSC 5341 High-Performance Computer Networks Presentation for By Linghai Zhang ID:
A randomized linear time algorithm for graph spanners Surender Baswana Postdoctoral Researcher Max Planck Institute for Computer Science Saarbruecken,
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Introduction to Multiple-multicast Routing Chu-Fu Wang.
ETH Zurich – Distributed Computing Group Stephan HolzerSODA Stephan Holzer Silvio Frischknecht Roger Wattenhofer Networks Cannot Compute Their Diameter.
Models of Greedy Algorithms for Graph Problems Sashka Davis, UCSD Russell Impagliazzo, UCSD SIAM SODA 2004.
Yinghui Wu, SIGMOD Incremental Graph Pattern Matching Wenfei Fan Xin Wang Yinghui Wu University of Edinburgh Jianzhong Li Jizhou Luo Harbin Institute.
1 1 Stanford University 2 MPI for Biological Cybernetics 3 California Institute of Technology Inferring Networks of Diffusion and Influence Manuel Gomez.
Construction of Optimal Data Aggregation Trees for Wireless Sensor Networks Deying Li, Jiannong Cao, Ming Liu, and Yuan Zheng Computer Communications and.
Instructor Neelima Gupta Table of Contents Introduction to Approximation Algorithms Factor 2 approximation algorithm for TSP Factor.
Errol Lloyd Design and Analysis of Algorithms Approximation Algorithms for NP-complete Problems Bin Packing Networks.
An Algorithm to Learn the Structure of a Bayesian Network Çiğdem Gündüz Olcay Taner Yıldız Ethem Alpaydın Computer Engineering Taner Bilgiç Industrial.
Inferring Networks of Diffusion and Influence
Lectures on Network Flows
CS120 Graphs.
Minimum Spanning Tree.
An Algorithm for Bayesian Network Construction from Data
Solving the Minimum Labeling Spanning Tree Problem
Distance-Constraint Reachability Computation in Uncertain Graphs
Deployment Optimization of IoT Devices through Attack Graph Analysis
Presentation transcript:

Bo Zong, Yinghui Wu, Ambuj K. Singh, Xifeng Yan 1 Inferring the Underlying Structure of Information Cascades

Information cascades in social networks 2  Information propagation in social networks  Cascade: a subgraph tracking propagation trace (a) Facebook shared link (b) Twitter retweet But missing structure: a)Privacy policies b) Crawling noise. (a) A social graph Importance of Cascades a)Viral marketing b)Recommendation c)Information Security (b) A cascade

An example: cascade structure inference 3  We can fully observe a social graph, but the cascade structure is partially observed. #iPhone5 Alice, 1pmBob, 1:50pm David Cooper, 3pmEd Frank Alice, 1pm Bob, 1:50pm Cooper, 3pm David Frank Alice, 1pm Bob, 1:50pm Cooper, 3pm David 1. Which one? 2. How difficult? 3. Algorithms? 4. Accuracy?

Roadmap 4  Introduction  Preliminary  Problem definition  Our solutions  Experimental results  Conclusions

Preliminary 5  Diffusion graph, G = (V, E)  Independent Cascade Model  Diffusion function, f : E  [0,1]  Partial observation, X X={ (A, 0), (B, 1), (C, 3) } AB D CE F G

Problem definition 6 Minimum Perfect Consistent Tree (PCT)  Input  diffusion graph G  source s  partial observation X  Output  cascade T: a tree in G   (v, t)  X, s reaches v by exactly t edges  minimize:  Variants  Weighted, PCT w  Non-weighted, PCT min A 0 B 1 C 3 D F T A 0B 1 D C 3Ed F G X={ (A, 0), (B, 1), (C, 2) } 0.8

Problem definition (cont.) 7 Minimum Bounded Consistent Tree (BCT)  Input  diffusion graph G  source s  partial observation X  Output  cascade T: a tree in G   (v, t)  X, s reaches v by  t edges  minimize:  Variants  Weighted, BCT w  Non-weighted, BCT min A 0 B 1 C 3 David T A 0B 1 D C 3Ed F G X={ (A, 0), (B, 1), (C, 2) } 0.8

Perfect consistent tree (PCT) 8  Complexity of PCT w and PCT min  NP-complete  APX-hard  Heuristic: bottom-up search  Perfect consistent SP tree  Reduce to Steiner tree  Approximable in for PCT w  Approximable in for PCT min …… …… titi … t max t1t1 S t i+1 … … … … Shortest Path

Bounded consistent tree (BCT) 9  Complexity of BCT w and BCT min  NP-complete  Approximable in for BCT w  Approximable in for BCT min  Approximating BCT w and BCT min  (v, t)  X, connect v to s with the shortest path

Datasets 10  Enron  G: 86,808 nodes and 660,642 edges  260 cascades (depth  3, #nodes  8)  Twitter hashtag retweets  G:  17M nodes,  1,400M edges,  70M Retweets  321 cascades (depth  4, #nodes  10)  Synthetic cascades on Facebook  G:  3M nodes,  28M edges  Diffusion functions are learned.

Metrics 11  Partial observation  define uncertainty  randomly remove nodes from T and add them into X until  is reached.  Performance metric  precision: rate of #correct estimates to # estimates  recall: rate of # correct estimates to # missing nodes

Effectiveness on Perfect Consistent Tree 12

Effectiveness on Bounded Consistent Tree 13

Scalability on PCT and BCT 14

Conclusions 15  Defined the cascade structure inference problem  perfect and bounded consistent trees;  metrics for quality of consistent trees  Investigated the cascade structure inference problem  Analyzed the problem complexity  Proposed approximations and heuristics  Experimental results verify the effectiveness and efficiency of our solutions

Thanks 16  Questions?