1 Continuous k-dominant Skyline Query Processing Presented by Prasad Sriram Nilu Thakur.

Slides:



Advertisements
Similar presentations
Skyline Charuka Silva. Outline Charuka Silva, Skyline2  Motivation  Skyline Definition  Applications  Skyline Query  Similar Interesting Problem.
Advertisements

Identifying the Most Influential Data Objects with Reverse Top-k Queries By Akrivi Vlachou 1, Christos Doulkeridis 1, Kjetil Nørvag 1 and Yannis Kotidis.
1 Query Processing in Spatial Network Databases presented by Hao Hong Dimitris Papadias Jun Zhang Hong Kong University of Science and Technology Nikos.
Ranking Outliers Using Symmetric Neighborhood Relationship Wen Jin, Anthony K.H. Tung, Jiawei Han, and Wei Wang Advances in Knowledge Discovery and Data.
VLDB 2011 Pohang University of Science and Technology (POSTECH) Republic of Korea Jongwuk Lee, Seung-won Hwang VLDB 2011.
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
The Skyline Operator (Stephan Borzsonyi, Donald Kossmann, Konrad Stocker) Presenter: Shehnaaz Yusuf March 2005.
Maintaining Sliding Widow Skylines on Data Streams.
Data Mining Classification: Alternative Techniques
Presented by: GROUP 7 Gayathri Gandhamuneni & Yumeng Wang.
ISAC 教育學術資安資訊分享與分析中心研發專案 The Skyline Operator Stephan B¨orzs¨onyi, Donald Kossmann, Konrad Stocker EDBT
July 29HDMS'08 Caching Dynamic Skyline Queries D. Sacharidis 1, P. Bouros 1, T. Sellis 1,2 1 National Technical University of Athens 2 Institute for Management.
Stabbing the Sky: Efficient Skyline Computation over Sliding Windows COMP9314 Lecture Notes.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
Top-k and Skyline Computation in Database Systems
Topological Relationships Between Complex Spatial Objects Daniel Hess and Yun Zhang.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
Probabilistic Skyline Operator over sliding Windows Wan Qian HKUST DB Group.
Scalable Network Distance Browsing in Spatial Database Samet, H., Sankaranarayanan, J., and Alborzi H. Proceedings of the 2008 ACM SIGMOD international.
Efficient Computation of the Skyline Cube Yidong Yuan School of Computer Science & Engineering The University of New South Wales & NICTA Sydney, Australia.
Continuous Processing of Preference Queries in Data Streams : a Survey
Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces Jian Pei # Wen Jin # Martin Ester # Yufei Tao + # Simon Fraser University,
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Skyline Queries Against Mobile Lightweight Devices in MANETs Zhiyong Huang 1 Christian S. Jensen 2 Hua Lu 1 Beng Chin Ooi 1 1 National University of Singapore,
Skyline Queries Against Mobile Lightweight Devices in MANETs Zhiyong Huang 1 Christian S. Jensen 2 Hua Lu 1 Beng Chin Ooi 1 1 National University of Singapore,
Maximal Vector Computation in Large Data Sets The 31st International Conference on Very Large Data Bases VLDB 2005 / VLDB Journal 2006, August Parke Godfrey,
1 Progressive Computation of Constrained Subspace Skyline Queries Evangelos Dellis 1 Akrivi Vlachou 1 Ilya Vladimirskiy 1 Bernhard Seeger 1 Yannis Theodoridis.
Join Synopses for Approximate Query Answering Swarup Achrya Philip B. Gibbons Viswanath Poosala Sridhar Ramaswamy Presented by Bhushan Pachpande.
Towards Robust Indexing for Ranked Queries Dong Xin, Chen Chen, Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign VLDB.
Top-k Similarity Join over Multi- valued Objects Wenjie Zhang Jing Xu, Xin Liang, Ying Zhang, Xuemin Lin The University of New South Wales, Australia.
Reverse Top-k Queries Akrivi Vlachou *, Christos Doulkeridis *, Yannis Kotidis #, Kjetil Nørvåg * *Norwegian University of Science and Technology (NTNU),
Research and Practice at University of Queensland Wei Lu ( 卢卫 ) 2/19/2009.
1 Approximating Quantiles over Sliding Windows Srimathi Harinarayanan CMPS 565.
K-Hit Query: Top-k Query Processing with Probabilistic Utility Function SIGMOD2015 Peng Peng, Raymond C.-W. Wong CSE, HKUST 1.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
1 Top-k Dominating Queries DB seminar Speaker: Ken Yiu Date: 25/05/2006.
Efficient Processing of Top-k Spatial Preference Queries
Zhuo Peng, Chaokun Wang, Lu Han, Jingchao Hao and Yiyuan Ba Proceedings of the Third International Conference on Emerging Databases, Incheon, Korea (August.
Probabilistic Contextual Skylines D. Sacharidis 1, A. Arvanitis 12, T. Sellis 12 1 Institute for the Management of Information Systems — “Athena” R.C.,
EECS 730 Introduction to Bioinformatics Microarray Luke Huan Electrical Engineering and Computer Science
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Presented by: Daniel Hess, Yun Zhang. Motivation Problem statement Major contributions Key concepts Validation methodology Assumptions Recommended changes.
The σ-neighborhood skyline queries Chen, Yi-Chung; LEE, Chiang. The σ-neighborhood skyline queries. Information Sciences, 2015, 322: 張天彥 2015/12/05.
Information Technology Selecting Representative Objects Considering Coverage and Diversity Shenlu Wang 1, Muhammad Aamir Cheema 2, Ying Zhang 3, Xuemin.
Efficient Computation of Combinatorial Skyline Queries Author: Yu-Chi Chung, I-Fang Su, and Chiang Lee Source: Information Systems, 38(2013), pp
Information Technology (Some) Research Trends in Location-based Services Muhammad Aamir Cheema Faculty of Information Technology Monash University, Australia.
D-skyline and T-skyline Methods for Similarity Search Query in Streaming Environment Ling Wang 1, Tie Hua Zhou 1, Kyung Ah Kim 2, Eun Jong Cha 2, and Keun.
Online Interval Skyline Queries on Time Series ICDE 2009.
1 Finding Competitive Price Yu Peng (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and Technology)
Finding skyline on the fly HKU CS DB Seminar 21 July 2004 Speaker: Eric Lo.
Bin Jiang, Jian Pei ICDE 2009 Online Interval Skyline Queries on Time Series 1.
Approximate NN queries on Streams with Guaranteed Error/performance Bounds Nick AT&T labs-research Beng Chin Ooi, Kian-Lee Tan, Rui National.
Answering Why-not Questions on Top-K Queries Andy He and Eric Lo The Hong Kong Polytechnic University.
HKU CSIS DB Seminar Skyline Queries HKU CSIS DB Seminar 9 April 2003 Speaker: Eric Lo.
Subgraph Search Over Uncertain Graphs Erşan Demircioğlu.
1 Introduction to Spatial Databases Donghui Zhang CCIS Northeastern University.
Computer Science and Engineering Jianye Yang 1, Ying Zhang 2, Wenjie Zhang 1, Xuemin Lin 1 Influence based Cost Optimization on User Preference 1 The University.
Tian Xia and Donghui Zhang Northeastern University
Abolfazl Asudeh Azade Nazi Nan Zhang Gautam DaS
The Analysis of Cyclic Circuits with Boolean Satisfiability
Discovering the Skyline of Web Databases
Query in Streaming Environment
Chapter 4: Probabilistic Query Answering (2)
Probabilistic Data Management
Probabilistic n-of-N Skyline Computation over Uncertain Data Streams
Similarity Search: A Matching Based Approach
Relaxing Join and Selection Queries
The Skyline Query in Databases Which Objects are the Most Important?
Efficient Processing of Top-k Spatial Preference Queries
Faster skyline searching using Hilbert R-tree
Presentation transcript:

1 Continuous k-dominant Skyline Query Processing Presented by Prasad Sriram Nilu Thakur

2 Outline Introduction Problem definition Key Concepts Validation Rewrite Today

3 Example Skyline Which one is better? e or b? (e, because its price and distance dominate those of b) C or f? Finding skyline of hotel, lesser price & closer to the beach Distance Pricea b c d e f

4 Problem Definition Input A set of points, p 1,p 2,…p n Output A set of points P (referred to as the skyline points), such that any point p 1 Є P is not dominated by any other point in the dataset Objective Provide correct and complete results Minimize the query response time and memory consumption Continuous queries require continuous evaluation Scalability in terms of the number of queries Constraints Minimize the number of dominance checks

5 Skyline Properties (1/2) Meaningful for incomparable dimensions Browsing Laptops Price, weight, size, memory, etc. Insensitive to scaling and shifting of the dimensions Skyline - Curse of Dimensionality Movie Rating Different users may have different rating preferences Movie p better than q only if p rated higher or equal to q by all users One outlier opinion will invalidate the dominance

6 Skyline Properties (2/2) Too many skyline points in high dimensional spaces Example: NBA data set, player season statistics on 17 attributes Over 1000 skyline points in the full space Some average-skilled players are in the skyline if they are not bad on some attributes. Possible Solutions Dimension Reduction Techniques - Requires domain knowledge Subspace Skylines - Many subspaces need to be explored Relax the notion of d-dominance - k-dominance

7 k-dominant Skyline k-Dominate If A is not worse than B on k dimensions, and better on at least one of the k dimensions, we say A k- dominates B. k-Dominant Skyline k-dominant skyline contains all the points that cannot be k-dominated by any other point k-Dominant Skyline Query Given a data set, find the k-dominant skyline When k=d, we have the conventional skyline K-dominance is cyclic unlike d-dominance

Slide Courtesy [2]8 k-dominant Skyline - Example d1d2d3d4d5d6 p p p p p conventional skyline 5-dominant skyline 4-dominant skyline Smaller k, smaller k-dominant skyline

9 Cyclic Properties of k-dominance k-dominance can be cyclic A 3-dominates B d1d2d3d4 A5555 B1666 C2177 D3218

10 Cyclic Properties of k-dominance B 3-dominates C d1d2d3d4 A5555 B1666 C2177 D3218

11 Cyclic Properties of k-dominance C 3-dominates D d1d2d3d4 A5555 B1666 C2177 D3218

12 Cyclic Properties of k-dominance D 3-dominates A d1d2d3d4 A5555 B1666 C2177 D3218

13 Skyline Evaluation Techniques – A Taxonomy Static vs Continuous Index-based vs Non-Index based Euclidean distance vs Road Network distance Geometric Properties Ranked skyline queries Constrained skyline queries Enumerating queries k-dominating queries k-dominant queries

14 A naïve approach Case 1 A new point arrives It is k-dominated by some points It k-dominates some points Case 2 A point expires

15 An improved approach a(1) b(3) c(5) d(7)e(9)f(11)g(13) Skyline heapNon-Skyline heap

16 An improved approach a(1) b(3) c(5) d(7)e(9)f(11)g(13) Skyline heapNon-Skyline heap h(15) h(26) a16DIS b18DIS c20DIS d22DIS e24DIS f26DIS g28DIS h26RET

17 An improved approach b(3) d(7) c(5) e(9)f(11)g(13) Skyline heapNon-Skyline heap h(26) b18DIS c20DIS d22DIS e24DIS f26DIS g28DIS h26RET at t = 16

18 An improved approach b(3) d(7) c(5) e(9)f(11)g(13) Skyline heapNon-Skyline heap h(26) b18DIS c20DIS d22DIS e24DIS f26DIS g28DIS i20RET i(17) i(20)

19 An improved approach c(5) d(7) f(11) e(9)g(13) Skyline heapNon-Skyline heap i(20) c20DIS d22DIS e24DIS f26DIS g28DIS i20RET at t = 18

20 An improved approach c(5) d(7) f(11) e(9)g(13) Skyline heapNon-Skyline heap i(20) c20DIS d22DIS e24DIS f26DIS g28DIS i20RET j(19)

21 An improved approach c(5) d(7) f(11) e(9)g(13) Skyline heapNon-Skyline heap i(20) c20DIS d22DIS e24DIS f26DIS g28DIS i20RET j32RET j(32)

22 Validations Methodology  Theorem based proving for correctness and completeness  Experiments to analyze performance Validation criteria  Query Response time

23 Experimental Analysis

24 Rewrite today Improvements A better technique for k-dominance Conduct detailed experiments with network object generators Think about how to find (spatial) skyline in road networks

25 References 1. Yufei Tao, Dimitris Papadias: Maintaining Sliding Window Skylines on Data Streams. IEEE Trans. Knowl. Data Eng. 18(2): (2006) 2. Chee Yong Chan, H. V. Jagadish, Kian-Lee Tan, Anthony K. H. Tung, Zhenjie Zhang: Finding k-dominant skylines in high dimensional space. SIGMOD Conference 2006: M. Sharifzadeh, C. Shahabi. The Spatial Skyline Queries. In Proceedings of VLDB’ Michael D. Morse, Jignesh M. Patel, William I. Grosky: Efficient Continuous Skyline Computation. ICDE 2006: Zhiyong Huang, Hua Lu, Beng Chin Ooi, Anthony K.H. Tung, Continuous Skyline Queries for Moving Objects, IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 12, pp , Dec., S. Borzsonyi, D. Kossmann, and K. Stocker. The Skyline Operator. In Proceedings of ICDE' D. Kossmann, F. Ramsak, and S. Rost. Shooting Stars in the Sky: An Online Algorithm for Skyline Queries. In Proceedings of VLDB'02.