Selectivity Estimation Example Mohammad Farhan Husain.

Slides:



Advertisements
Similar presentations
1 gStore: Answering SPARQL Queries Via Subgraph Matching Presented by Guan Wang Kent State University October 24, 2011.
Advertisements

1 Evaluating Conjunctive Triple Pattern Queries over Large Structured Overlay Networks Erietta Liarou, Stratos Idreos, and Manolis Koubarakis Waled.
© Copyright IBM Corporation 2014 Getting started with Rational Engineering Lifecycle Manager queries Andy Lapping – Technical sales and solutions Joanne.
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
SPARQL for Querying PML Data Jitin Arora. Overview SPARQL: Query Language for RDF Graphs W3C Recommendation since 15 January 2008 Outline: Basic Concepts.
David Konopnicki Choosing Access Path ä The basic methods. ä The access paths and when they are available. ä How the optimizer chooses among the.
Virtual techdays INDIA │ 9-11 February 2011 SQL 2008 Query Tuning Praveen Srivatsa │ Principal SME – StudyDesk91 │ Director, AsthraSoft Consulting │ Microsoft.
Improved Approximation Bounds for Planar Point Pattern Matching (under rigid motions) Minkyoung Cho Department of Computer Science University of Maryland.
Semantic Web Query Processing with Relational Databases Artem Chebotko Department of Computer Science Wayne State University.
Graph Data Management Lab, School of Computer Scalable SPARQL Querying of Large RDF Graphs Xu Bo
Section 11.2 Systems of Linear Equations
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
LINKED DATA AS TRANSFORMATION Philip E. Schreur Stanford University Coalition for Networked Information April 3, 2012 Philip Schreur/Stanford University.
Data Intensive Query Processing for Large RDF Graphs Using Cloud Computing Tools Mohammad Farhan Husain, Latifur Khan, Murat Kantarcioglu and Bhavani Thuraisingham.
ISBD for the Semantic Web: namespaces, elements, vocabularies, application profile Gordon Dunsire Presented at Centar zu Stalno Stručno Usavršavanje (CSSU),
KIT – University of the State of Baden-Württemberg and National Large-scale Research Center of the Helmholtz Association Institute of Applied Informatics.
Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce Mohammad Farhan Husain, Pankil Doshi, Latifur Khan, Bhavani Thuraisingham University.
Lesley Charles November 23, 2009.
3.4 Linear Programming p Optimization - Finding the minimum or maximum value of some quantity. Linear programming is a form of optimization where.
Projectile Motion Students will be able to match up projectile motion graphs with the correct verbal description.
Constant-Time LCA Retrieval Presentation by Danny Hermelin, String Matching Algorithms Seminar, Haifa University.
Chapter 4 Displaying Quantitative Data *histograms *stem-and-leaf plots *dotplot *shape, center, spread.
Random Sampling Approximations of E(X), p.m.f, and p.d.f.
Chapter 2 Organizing Data
Subjects and Predicates. Subject Tells whom or what the sentence is about Example: The black pan is very hot.
Counting nCr = n!/r!(n-r)!=nC(n-r) This equation reflects the fact that selecting r items is same as selecting n-r items in forming a combination from.
Alg 2 - Chapter 3 Jeopardy Solving Systems by Graphing Solving Systems Algebraically Graph & Solving Systems of Linear Inequalities Linear Programming.
Ontology based e-Real Estate Agency Information System By Moein Mehrolhasani Bijan Zamanian cmpe 588.
I can write and graph an equation of a direct variation.
Math Jeopardy Chapters 9-12 Measurement Geometry.
Module 2.9 Solving Addition Word Problems. In your group create a story problem to match your assigned equation. How do you know that the story matches.
Nonparametric Density Estimation Riu Baring CIS 8526 Machine Learning Temple University Fall 2007 Christopher M. Bishop, Pattern Recognition and Machine.
Continuous Random Variables Much of the material contained in this presentation can be found at this excellent website
File Processing : Query Processing 2008, Spring Pusan National University Ki-Joune Li.
Subject Predicate Subject Predicate Auxiliary verb.
Bin Packing Algorithms. Bin Packing Consider a set of bins, all the same cross section and height. The bin packing problem is to pack into the bins a.
. Suppose I had a protractor and I wanted to determine the uncertainty of a measurement I made with it. If I measured an angle to be 30 degrees and noticed.
Keyword Search over RDF Graphs
Basics of histograms and frequency tables
Selectivity Estimation Example
ASSIGNMENT NO.-2.
Temperature: Comparing degrees Celsius (C) and degrees Fahrenheit (F)
Accessing nearby copies of replicated objects
Twenty Questions Subject:.
Logics for Data and Knowledge Representation
3.2 Linear Programming 3 Credits AS
Using a histogram to estimate the median
Using a histogram to estimate the median
Equation Review Given in class 10/4/13.
Lu Xing CS59000GDM Sept 7th, 2018.
8.4 Linear Programming p
G-CORE: A Core for Future Graph Query Languages
Sungho Kang Yonsei University
Суури мэдлэг Basic Knowledge
Twenty Questions Subject:.
A set of 3 whole numbers that satisfy the equation
STORE MANAGER RESPONSIBILITIES.
Twenty Questions Subject:.
Equation Review.
Twenty Questions Subject:.
Twenty Questions Subject:.
Twenty Questions Subject:.
Twenty Questions Subject:.
Twenty Questions Subject:.
Switching Lemmas and Proof Complexity
' · · ,.-.., '' !'",. -,..._ ·-.·-...;.· -
X ⦁ X = 64 ±8 ±14 X ⦁ X ⦁ X =
Twenty Questions Subject:.
Twenty Questions Subject:.
Presentation transcript:

Selectivity Estimation Example Mohammad Farhan Husain

Example Data SubjectPredicateObject R1P1L1 R2P1L2 R3P1R4 R5P1R2 R6P1L3 R7P2L4 R8P2R1 R3P2L5 R1, R2, …, R8 are resources i.e. URIs P1 and P2 are predicates, also URIs L1, L2, …, L5 are literals R = Total number of unique resources = 8 T = Total number of triples = 8 T P1 = Total number of triples having predicate P1 = 5 T P2 = Total number of triples having predicate P2 = 3 For any query: Selectivity of a bound subject s = sel(s) = 1 / R = 1 / 8 = Selectivity of predicate P1 = sel(P1) = T P1 / T = 5 / 8 = Selectivity of predicate P2 = sel(P2) = T P2 / T = 3 / 8 = Selectivity of unbound subject and predicate and object = 1.0

Example Histogram for P1 Suppose there is a hash function which assigns the object values of triples having predicate P1 in two bins in the following manner: Bin 1 contains: L1, L2 and R2 Bin 2 contains: R4 and L3

Example Histogram for P2 Suppose the same hash function assigns the object values of triples having predicate P2 in two bins in the following manner: Bin 1 contains: L5 Bin 2 contains: L4 and R1

Estimation Approach – Base Equations EquationNotes sel(t) = sel(s) * sel(p) * sel(o)t refers to a triple pattern sel(s) = 1/RR - No. of unique Resources in knowledge store sel(p) = Tp/T T – Total No. of triples, Tp – Triples matching predicate p sel(o) = hc(p,oc)/Tpwhere (p,oc) represents the class of the histogram for predicate p in which object o falls sel(?a) = 1when ?a is unbound subject, predicate, or object

Selectivity Estimation for Triple Pattern Example with Bound Predicate Triple Pattern: ?s P1 L2 Estimated selectivity = sel(s) x sel(P1) x sel(L2) = 1.0 x x sel(P1, L2) = 1.0 x x (h 1 (P1, L2) / T P1 ) = 1.0 x x (Height of Bin 1 / T P1 ) = 1.0 x x (3 / 5) = Here, h 1 (P1, L2) denotes the bin of the histogram of predicate P1 where the hash function puts L2 in.

Selectivity Estimation for Triple Pattern Example with Unbound Predicate Triple Pattern: ?s ?p L2 Estimated selectivity = sel(s) x sel(p) x sel(L2) = 1.0 x 1.0 x {∑ Pi ϵ P sel(Pi, L2)} = 1.0 x 1.0 x {sel(P1, L2) + sel(P2, L2)} = 1.0 x 1.0 x {h 1 (P1, L2) / T P1 + h 1 (P2, L2) / T P2 } = 1.0 x 1.0 x {Height of Bin 1 of P1 Histogram / T P1 + Height of Bin 1 of P2 Histogram / T P2 } = 1.0 x 1.0 x {3 / / 3} = Note that the hash function always puts the value L2 into bin 1. That is why we pick the height of Bin 1 of the histogram for P2 even though P2 does not have the value L2 as its object in any of the triples.

Selectivity Estimation for Triple Pattern Example with Unbound Object Triple Pattern: ?s P1 ?o Estimated selectivity = sel(s) x sel(P1) x sel(o) = 1.0 x x 1.0 = 0.625