Download presentation

Presentation is loading. Please wait.

Published byDana Douglas Modified over 2 years ago

1
SIGMOD 2006 PAKDD 2009 Finding k-Dominant Skylines in High Dimensional Space K-Dominant Skyline Computation by Using Sort-Filtering Method 1

2
Outline 2 Motivation Definition Analysis One-scan Two-scan Sorted Retrieval Sort-Filtering Method Experimental Result Conclusion

3
Motivation The Number of skyline point may be huge in high dimensional space. A new concept, called k-dominant skyline to alleviate the effect of dimensionality curse on skyline query in high dimensional spaces. 3

4
Definition D-dimensional space A set of points : is a data set on S if every is a d-dimensional data point on S. Total order relationship, denoted, we assume > here. 4

5
Definition 5 p2 dominate p5.

6
Definition 6 p5 is dominated by p2. SP(D,S)={p1,p2,p3,p4}

7
Definition 7 Assume k=5 p1 is better than p4 on s1,s2,s3,s5,and s6. p1 5-dominants p4

8
Definition 8 p1 can’t be 5-dominanted by the other points p1 is 5-dominant skyline point.

9
Analysis User want most choose :k is bigger p1 5-dominates p4,and p1 4-dominates p4 9

10
Analysis 10 k=5, 5-dominate skyline points:p1,p2,p3 k=6, 6-dominate skyline points:p1,p2,p3,p4

11
One-Scan 11

12
One-Scan 12 Skyline point:P1,P2,P3,and P4.<-free skyline points. k=3, P1 3-dominante P2, P1 is 3-dominanted by P4 P2 3-dominante P3. P4 3-dominante P3. 3-dominate skyline point:P4 belongs free skyline point. P2 is not 3-dominate skyline point. It is 3-dominated by P1, but P1 is not 3-dominant skyline point. S1S2S3S4 P14448 P28334 P37822 P46783

13
One-Scan Thus, based on Lemma 4.1, our algorithm computes k- dominant skyline points by actually computing the free skyline points in D and using them to eliminate non-k- dominate skyline points. 1.R stores the set of intermediate k-dominant skyline points in D. 2. T stores the set of intermediate skyline points in D that are not k-dominant (i.e., not in R). Together, R ∪ T gives the set of skyline points in D. 13

14
One-Scan 14 For each point p in D, p is first compared against points in T. 1. If a point ∈ T is dominated by p( is not skyline),then remove from T. 2. If a point ∈ T dominates p(p is not skyline) or p= (p is not unique),then p is ignored. Case 1: p is unique skyline point, compared against points in R to check k-dominante.

15
One-Scan 15 For each point in R, 1.If p k-dominates,then is moved from R to T. 2.If k-dominates p, then p is not k- dominant End of p compared against points in R. P is not dominated-> insert to R P is dominated-> insert to T.

16
One-Scan 16 K=5 p1: initial p1 insert to R T:{}, R:{p1}

17
One-Scan 17 p2: T:{}, R:{p1}->T is empty,check point in R p2 is not 5-dominated by p1 and p1 is not 5-dominated by p2 and->p2 insert to R T:{}, R:{p1,p2}

18
One-Scan 18 p3: T:{}, R:{p1,p2}->T is empty,check point in R p3 is not 5-dominated by p1 or p2 and p1 and p2 are not 5-dominated by p3 ->p3 insert to R T:{}, R:{p1,p2,p3}

19
One-Scan 19 p4: T:{}, R:{p1,p2,p3}->T is empty,check point in R p4 is 5-dominated by p1, p2, and p3 ->p4 insert to T T:{p4}, R:{p1,p2,p3}

20
One-Scan 20 p5: T:{p4}, R:{p1,p2,p3}->check point in T p5 don’t dominates p4 and p4 don’t dominates p5 -> check point in R p5 is 5-dominated by p2 and p3->p5 insert to T T:{p4,p5}, R:{p1,p2,p3}

21
One-Scan 21 p5 is dominated by p2. p5 is not skyline, but it is in T.

22
Two Scan 22 In the One-Scan algorithm, free skyline points (i.e., T ) need to be maintained to compute the k-dominant skyline points. Scanning D twice avoid need to maintain T. Fist scan of D, computed a set of candidate k-dominant R. Base on Lemma 4.1 p2, false positive can exist in R. Second scan D-R to determine whether a point is indeed k-dominate skyline

23
Two Scan 23 k=3 First Scan: Initinal p1:insert to R R={p1} S1S2S3S4 P112925 P211 48 P310 86 P4108662

24
Two Scan 24 k=3 First Scan: p2 compared against point in R={p1}. p2 3-dominates p1 p1 remove from R, p2 is inserted to R. R={p2} S1S2S3S4 P112925 P211 48 P310 86 P4108662

25
Two Scan 25 k=3 First Scan: p3 compared against point in R={p2}. p3 is 3-dominated by p2, R={p2} S1S2S3S4 P112925 P211 48 P310 56 P4108662

26
Two Scan 26 k=3 First Scan: p4 compared against point in R={p2}. p2 is inserted to R R={p2,p4} S1S2S3S4 P112925 P211 48 P310 86 P4108662

27
Two Scan 27 k=3 Second Scan: R={p2,p4},D-R={p1,p3} choose p1 compared against point in R={p2,p4} R ={p2,p4}, S1S2S3S4 P112925 P211 48 P310 86 P4108662

28
Two Scan 28 k=3 Second Scan: R={p2,p4},D-R={p1,p3} choose p3 compared against point in R={p2,p4} p3 3-dominates p4 (false positive) remove p4 from R,R={p3} 3-dominant skyline point: p3 S1S2S3S4 P112925 P211 48 P310 86 P4108662

29
Sorted Retrieval 29

30
Initial T=D 4-dominate p3,p4 Remove p3, p4 from T. Sorted Retrieval 30

31
Sorted Retrieval 31

32
Sorted Retrieval 32 3=d-k+1=6-4+1 p1 is 4-dominant skyline point Moved from T to R

33
Sort-Filtering Method K-Dominant Skyline Algorithm: (From k=d calculation) 1.Domination Power Calculation 2.k-Dominant Checking 33

34
Sort-Filtering Method Domination Power Calculation Example : p(9,1,2) and q(3,2,3) :in 3D space Domination Power p=2, q=1 sum(p)=12, sum(q)=8 sum(p)>sum(q), but Domination Power p>q p is 2-dominated q. 34

35
Sort-Filtering Method Domination Power Calculation Calculate Domination Power and sum. 35

36
Sort-Filtering Method Domination Power Calculation 36

37
Sort-Filtering Method k-Dominant Checking Consider 5-dominant N 5,N 3,N 8,N 1,N 6 are 5-dominated by the first object N 2, remove 5-dominated objects,output N 2 37

38
Experimental Result 38

39
Conclusion Use domination power to find k-domination skyline? Choose k to reduce the number of k-dominant skyline points. 39

Similar presentations

OK

Machine Learning Chapter 2. Concept Learning and The General-to-specific Ordering Tom M. Mitchell.

Machine Learning Chapter 2. Concept Learning and The General-to-specific Ordering Tom M. Mitchell.

© 2018 SlidePlayer.com Inc.

All rights reserved.

Ads by Google

Download ppt on corruption Ppt on area of parallelogram and triangles worksheet Oled flexible display ppt on tv Ppt on blood stain pattern analysis history Ppt on ministry of corporate affairs india Download ppt on radioactive waste management Ppt on solid dielectrics effect Free ppt on mobile number portability solutions Ppt on maggi product Ppt on 9-11 conspiracy theories attacks