Finding Skyline Nodes in Large Networks. Evaluation Metrics:  Distance from the query node. (John)  Coverage of the Query Topics. (Big Data, Cloud Computing,

Slides:



Advertisements
Similar presentations
Towards Data Mining Without Information on Knowledge Structure
Advertisements

TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Online Max-Margin Weight Learning with Markov Logic Networks Tuyen N. Huynh and Raymond J. Mooney Machine Learning Group Department of Computer Science.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 38.
1 Chapter 40 - Physiology and Pathophysiology of Diuretic Action Copyright © 2013 Elsevier Inc. All rights reserved.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
and 6.855J Cycle Canceling Algorithm. 2 A minimum cost flow problem , $4 20, $1 20, $2 25, $2 25, $5 20, $6 30, $
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
0 - 0.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULTIPLYING MONOMIALS TIMES POLYNOMIALS (DISTRIBUTIVE PROPERTY)
ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.
MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING Think Distributive property backwards Work down, Show all steps ax + ay = a(x + y)
Addition Facts
ALGEBRAIC EXPRESSIONS
Query optimisation.
ZMQS ZMQS
Improved Shortest Path Algorithms for Nearly Acyclic Directed Graphs L. Tian and T. Takaoka University of Canterbury New Zealand 2007.
Algorithms for Geometric Covering and Piercing Problems Robert Fraser PhD defence Nov. 23, 2012.
Ken C. K. Lee, Baihua Zheng, Huajing Li, Wang-Chien Lee VLDB 07 Approaching the Skyline in Z Order 1.
BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.
ABC Technology Project
Compressing Forwarding Tables Ori Rottenstreich (Technion, Israel) Joint work with Marat Radan, Yuval Cassuto, Isaac Keslassy (Technion, Israel) Carmi.
Clustered Pivot Tables for I/O-optimized Similarity Search Juraj Moško, Jakub Lokoč, Tomáš Skopal Department of Software Engineering Faculty of Mathematics.
© 2009 IBM Corporation IBM Research Xianglong Liu 1, Junfeng He 2,3, and Bo Lang 1 1 Beihang University, Beijing, China 2 Columbia University, New York,
1 Comnet 2010 Communication Networks Recitation 4 Scheduling & Drop Policies.
1 Undirected Breadth First Search F A BCG DE H 2 F A BCG DE H Queue: A get Undiscovered Fringe Finished Active 0 distance from A visit(A)
© S Haughton more than 3?
Association Rule Mining
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
1 Directed Depth First Search Adjacency Lists A: F G B: A H C: A D D: C F E: C D G F: E: G: : H: B: I: H: F A B C G D E H I.
Twenty Questions Subject: Twenty Questions
Linking Verb? Action Verb or. Question 1 Define the term: action verb.
Energy & Green Urbanism Markku Lappalainen Aalto University.
Computer Science and Engineering Diversified Spatial Keyword Search On Road Networks Chengyuan Zhang 1,Ying Zhang 2,1,Wenjie Zhang 1, Xuemin Lin 3,1, Muhammad.
EN U1 3 6 EN U2 3 6 Pin 5 Pin 1 +3v3 100n RJ XFEL C-C Test Daughter Card-A, Schem-1 : OUTPUTS MP-UCL, DS90LV001 1k +3v3 100R.
YO-YO Leader Election Lijie Wang
Rahnuma Islam Nishat Debajyoti Mondal Md. Saidur Rahman Graph Drawing and Information Visualization Laboratory Department of Computer Science and Engineering.
1 Weiren Yu 1,2, Xuemin Lin 1, Wenjie Zhang 1 1 University of New South Wales 2 NICTA, Australia Towards Efficient SimRank Computation over Large Networks.
AN ARITHMETIC and A GEOMETRY SERIES AN ARITHMETIC and A GEOMETRY SERIES.
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
Faster Query Answering in Probabilistic Databases using Read-Once Functions Sudeepa Roy Joint work with Vittorio Perduca Val Tannen University of Pennsylvania.
This, that, these, those Number your paper from 1-10.
Chapter 5 Test Review Sections 5-1 through 5-4.
Lecture 4 vector data analysis. 2014年10月11日 2014年10月11日 2014年10月11日 2 Introduction Based on the objects,such as point,line and polygon Based on the objects,such.
1 First EMRAS II Technical Meeting IAEA Headquarters, Vienna, 19–23 January 2009.
Addition 1’s to 20.
25 seconds left…...
Test B, 100 Subtraction Facts
U1A L1 Examples FACTORING REVIEW EXAMPLES.
1 Scheduling Crossbar Switches Who do we chose to traverse the switch in the next time slot? N N 11.
Week 1.
Vincent S. Tseng, Cheng-Wei Wu, Bai-En Shie, and Philip S. Yu SIG KDD 2010 UP-Growth: An Efficient Algorithm for High Utility Itemset Mining 2010/8/25.
We will resume in: 25 Minutes.
1 Unit 1 Kinematics Chapter 1 Day
Salvatore Ruggieri SIGKDD2010 Frequent Regular Itemset Mining 2010/9/2 1.
Choosing an Order for Joins
Sequential PAttern Mining using A Bitmap Representation
1/22 Worst and Best-Case Coverage in Sensor Networks Seapahn Meguerdichian, Farinaz Koushanfar, Miodrag Potkonjak, and Mani Srivastava IEEE TRANSACTIONS.
Chapter 30 Induction and Inductance In this chapter we will study the following topics: -Faraday’s law of induction -Lenz’s rule -Electric field induced.
Computer Science and Engineering Inverted Linear Quadtree: Efficient Top K Spatial Keyword Search Chengyuan Zhang 1,Ying Zhang 1,Wenjie Zhang 1, Xuemin.
Efficient Progressive Processing of Skyline Queries in Peer-to-Peer Systems INFOSCALE’06.
Efficient Processing of Top-k Spatial Preference Queries
Presentation transcript:

Finding Skyline Nodes in Large Networks

Evaluation Metrics:  Distance from the query node. (John)  Coverage of the Query Topics. (Big Data, Cloud Computing, Map Reduce) Motivation Finding Skyline Nodes in Large Networks 2

Homogeneous Approach ? Finding Skyline Nodes in Large Networks 3 Score = λ. Distance + (1- λ ). Coverage How to get λ ?

Weighted Set Cover ? Finding Skyline Nodes in Large Networks 4  Find nodes with smallest aggregate distance from the query node, such that they cover all query topics.  Ignore some interesting nodes.  Cannot rank the results. abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q

Graph Skyline Finding Skyline Nodes in Large Networks 5  Dominance on Coverage: u > c v Query topics covered by node u is a superset of the query topics covered by node v.  Dominance on Distance: u > d v Distance of u from q is less than that of v from q.  Dominance: u > v (1) u > c v and u ≥ d v ; or (2) u ≥ c v and u > d v. abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q

Ranking of Skyline Nodes Finding Skyline Nodes in Large Networks 6  Too many skyline nodes.  Rank them. abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q  Dominance Count: # nodes dominated by a skyline node. [Lin et. al., ICDE ‘07]  Higher Dominance Count => more pruning from candidate set.  1. DC(u 4 ) = {u 5, u 6, u 7 }, 2. DC(u 1 ) = {u 5 } 3. DC(u 2 ) = Φ; 4. DC(u 3 ) = Φ

Algorithm Finding Skyline Nodes in Large Networks 7  Construct a Query DAG.  Three variables associated with each DAG node: Count (C), Dominance (D), Traversal (T). abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q abc abacbc a bc Input NetworkQuery DAG  Naïve Complexity: O(n2 r )  Complexity with Preprocessing: O(nr 2 ) C = 0 D = - T = - C = 2 D = - T = - C = 0 D = - T = - C = 2 D = - T = - C = 0 D = - T = - C = 1 D = - T = - C = 2 D = - T = -

Query DAG Construction Finding Skyline Nodes in Large Networks 8 abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q ab a b c u4u4 u7u7 u1u1 u5u5 u2u2 u3u3 u4u4 u6u6 u7u7

Query DAG Construction (cont.) Finding Skyline Nodes in Large Networks 9 abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q ab a b c u1u1 u5u5 u2u2 u3u3 u4u4 u6u6 u7u7 abc

Query DAG Construction (cont.) Finding Skyline Nodes in Large Networks 10 abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q ab ab c u1u1 u5u5 u2u2 u3u3 u4u4 u6u6 u7u7 abc ac bc

Find Dominance Variable Finding Skyline Nodes in Large Networks 11  Perform a topological ordering of the DAG nodes to evaluate the Dominance variable (D) of each DAG node.  # Nodes dominated (or equal) by coverage. abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q abc abacbc a bc Input NetworkQuery DAG  Naïve Complexity: O(n2 r )  Complexity by Topological Ordering: O(3 r ) C = 0 D = 3 T = - C = 2 D = 2 T = - C = 0 D = 4 T = - C = 2 D = 7 T = - C = 0 D = 3 T = - C = 1 D = 1 T = - C = 2 D = 2 T = -

Find Traversal Variable Finding Skyline Nodes in Large Networks 12  Perform a Breadth First Search (BFS) starting from the query node.  # Nodes not dominated by distance. abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q abc abacbc a bc Input NetworkQuery DAG  Complexity by BFS: O(n+e) C = 0 D = 3 T = 0 C = 2 D = 2 T = 2 C = 0 D = 4 T = 0 C = 2 D = 7 T = 1 C = 0 D = 3 T = 0 C = 1 D = 1 T = 1 C = 2 D = 2 T = 2 h =2

Find Skyline Nodes Finding Skyline Nodes in Large Networks 13  Store DAG nodes into a Lookup Table. Skyline Bit for each DAG node.  Helps to prune non-skyline nodes directly. abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q abc abacbc a bc Input Network Query DAG h =1 abc0 ab0 ac0 bc0 a1 b1 c1 Lookup Table abc

Find Skyline Nodes (cont.) Finding Skyline Nodes in Large Networks 14 abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q abc abacbc a bc Input Network Query DAG h =2 abc1 ab1 ac1 bc1 a1 b1 c1 Lookup Table  Store DAG nodes into a Lookup Table. Skyline Bit for each DAG node.  Helps to prune non-skyline nodes directly.

Dominance Count of Skyline Nodes Finding Skyline Nodes in Large Networks 15 abc abcacd abcde Q = { a, b, c } u1u1 u2u2 u3u3 u4u4 u5u5 u6u6 u7u7 u8u8 u 0 = q abc abacbc a bc Input Network Query DAG h =2 abc1 ab1 ac1 bc1 a1 b1 c1 Lookup Table C = 0 D = 3 T = 0 C = 2 D = 2 T = 1 C = 0 D = 4 T = 0 C = 2 D = 7 T = 0 C = 0 D = 3 T = 0 C = 1 D = 1 T = 1 C = 2 D = 2 T = 1  DC(u 4 ) = D(abc)-T(abc)-T(ab)-T(ac)-T(bc)-T(a)-T(b)-T(c)-1 = 3  Top-k Buffer to store top-k skyline nodes.

Pruning and Early Termination Finding Skyline Nodes in Large Networks 16  DC(u 4 ) = D(abc)-T(abc)-T(ab)-T(ac)-T(bc)-T(a)-T(b)-T(c)-1 = 3  Top-k Buffer to store top-k skyline nodes.

Experimental Results Finding Skyline Nodes in Large Networks 17  DC(u 4 ) = D(abc)-T(abc)-T(ab)-T(ac)-T(bc)-T(a)-T(b)-T(c)-1 = 3  Top-k Buffer to store top-k skyline nodes.

Efficiency Finding Skyline Nodes in Large Networks 18  DC(u 4 ) = D(abc)-T(abc)-T(ab)-T(ac)-T(bc)-T(a)-T(b)-T(c)-1 = 3  Top-k Buffer to store top-k skyline nodes.

Conclusion and Future Works Finding Skyline Nodes in Large Networks 19  DC(u 4 ) = D(abc)-T(abc)-T(ab)-T(ac)-T(bc)-T(a)-T(b)-T(c)-1 = 3  Top-k Buffer to store top-k skyline nodes.  Efficient Algorithm to find top-k skyline nodes in large attributed network.  Required experimental evaluation in real and synthetic datasets.  Time Complexity is linear in the number of nodes and edges in the network. Distance based indexing might improve the efficiency.  Top-k Skyline set instead of Top-k Skyline nodes might be more effective.

Questions Finding Skyline Nodes in Large Networks 20  DC(u 4 ) = D(abc)-T(abc)-T(ab)-T(ac)-T(bc)-T(a)-T(b)-T(c)-1 = 3  Top-k Buffer to store top-k skyline nodes. Thank You ! ! !