Multidimensional Index Structures One dimensional index structures assume a single search key, and retrieve records that match a given search-key value.

Slides:



Advertisements
Similar presentations
Circle Theorems Bingo!. Use any 9 of these numbers 105 o 240 o 55 o 14 o 65 o 50 o 90 o 45 o 60 o 40 o 110 o 155 o 15 o 74 o 120 o 12 o.
Advertisements

Artrelle Fragher & Robert walker. 1 you look for the median 1 you look for the median 2 then you look for the min and max 2 then you look for the min.
Name: Date: Read temperatures on a thermometer Independent / Some adult support / A lot of adult support
Ozone Level ppb (parts per billion)
Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 8 How index-learning turns no student pale Yet holds.
Objective: Know the sum of angles at a point, on a straight line and in a triangle. Vocabulary:
/4/2010 Box and Whisker Plots Objective: Learn how to read and draw box and whisker plots Starter: Order these numbers.
File Organizations and Indexing Chapter 8
Making Changes in the Balance Sheet UNIT 1C A Lot to Learn.
Introduction to Indexes Rui Zhang The University of Melbourne Aug 2006.
Shade the rectangles with correct answer with a colour pencil of your choice.
Comparison of X-ray diffraction patterns of La 2 CuO 4+   from different crystals at room temperature Pia Jensen.
CpSc 3220 File and Database Processing Lecture 17 Indexed Files.
Introduction to Database Systems1 Records and Files Storage Technology: Topic 3.
CS4432: Database Systems II Hash Indexing 1. Hash-Based Indexes Adaptation of main memory hash tables Support equality searches No range searches 2.
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Part C Part A:  Index Definition in SQL  Ordered Indices  Index Sequential.
Multidimensional Indexing
Access Methods for Advanced Database Applications.
File Processing : Hash 2015, Spring Pusan National University Ki-Joune Li.
CS 245Notes 51 CS 245: Database System Principles Hector Garcia-Molina Notes 5: Hashing and More.
Multidimensional Data. Many applications of databases are "geographic" = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Multidimensional Data
Multidimensional Data. Many applications of databases are "geographic" = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
COMP 451/651 B-Trees Size and Lookup Chapter 1.
Spatial Indexing I Point Access Methods. PAMs Point Access Methods Multidimensional Hashing: Grid File Exponential growth of the directory Hierarchical.
BTrees & Bitmap Indexes
Multiple-key indexes Index on one attribute provides pointer to an index on the other. If V is a value of the first attribute, then the index we reach.
1 File Organizations and Indexing Module 4, Lecture 2 “How index-learning turns no student pale Yet holds the eel of science by the tail.” -- Alexander.
Spring 2003 ECE569 Lecture ECE 569 Database System Engineering Spring 2003 Yanyong Zhang
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial Indexing I Point Access Methods.
1 Geometric index structures April 15, 2004 Based on GUW Chapter , [Arge01] Sections 1, 2.1 (persistent B- trees), 3-4 (static versions.
CS 245Notes 51 CS 245: Database System Principles Hector Garcia-Molina Notes 5: Hashing and More.
COMP 451/651 Multiple-key indexes
Spring 2004 ECE569 Lecture ECE 569 Database System Engineering Spring 2004 Yanyong Zhang
CS 277 – Spring 2002Notes 51 CS 277: Database System Implementation Arthur Keller Notes 5: Hashing and More.
CS 4432lecture #71 CS4432: Database Systems II Lecture #7 Professor Elke A. Rundensteiner.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
CPSC-608 Database Systems Fall 2008 Instructor: Jianer Chen Office: HRBB 309B Phone: Notes #9.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Multidimensional Data Many applications of databases are ``geographic'' = 2­dimensional data. Others involve large numbers of dimensions. Example: data.
Layers of a DBMS Query optimization Execution engine Files and access methods Buffer management Disk space management Query Processor Query execution plan.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 8.
Indexing for Multidimensional Data An Introduction.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 8 “How index-learning turns no student pale Yet holds.
Multidimensional Indexes Applications: geographical databases, data cubes. Types of queries: –partial match (give only a subset of the dimensions) –range.
Hashing and Hash-Based Index. Selection Queries Yes! Hashing  static hashing  dynamic hashing B+-tree is perfect, but.... to answer a selection query.
CS 245Notes 51 CS 245: Database System Principles Hector Garcia-Molina Notes 5: Hashing and More.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Hash-Based Indexes Chapter 10.
Layers of a DBMS Query optimization Execution engine Files and access methods Buffer management Disk space management Query Processor Query execution plan.
Spatial Indexing Techniques Introduction to Spatial Computing CSE 5ISC Some slides adapted from Spatial Databases: A Tour by Shashi Shekhar Prentice Hall.
Chapter 5 Multidimensional Indexes. One dimensional index can be used to support multidimensional query. F1=‘abcd’ F2= 123‘abcd#123’
1 CPS216: Advanced Database Systems Notes 05: Operators for Data Access (contd.) Shivnath Babu.
Multi-dimensional Search Trees CS302 Data Structures Modified from Dr George Bebis.
File Processing : Multi-dimensional Index 2015, Spring Pusan National University Ki-Joune Li.
Chapter 5 Record Storage and Primary File Organizations
Grid Files Multi-dimensional Index Structures. Jaruloj Chongstitvatana 2006Grid Files 2 Properties of Grid Files  Support multi-dimensional data, but.
1 Ullman et al. : Database System Principles Notes 5: Hashing and More.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 File Organizations and Indexing Chapter 8 Jianping Fan Dept of Computer Science UNC-Charlotte.
CPSC 8620Notes 61 CPSC 8620: Database Management System Design Notes 6: Hashing and More.
Multidimensional Access Structures COMP3017 Advanced Databases Dr Nicholas Gibbins –
Em Spatiotemporal Database Laboratory Pusan National University File Processing : Hash 2004, Spring Pusan National University Ki-Joune Li.
Indexing Goals: Store large files Support multiple search keys
Multidimensional Access Structures
Spatial Indexing I Point Access Methods.
Multidimensional Indexes
2018, Spring Pusan National University Ki-Joune Li
File Processing : Multi-dimensional Index
CPSC-608 Database Systems
File Organizations and Indexing
Presentation transcript:

Multidimensional Index Structures One dimensional index structures assume a single search key, and retrieve records that match a given search-key value. (The search key can be a single field or a combination of fields.) One dimensional index structures assume a single search key, and retrieve records that match a given search-key value. (The search key can be a single field or a combination of fields.) Many applications, e.g. GIS, OLAP, require us to view data as existing in a space of two or more dimensions. Many applications, e.g. GIS, OLAP, require us to view data as existing in a space of two or more dimensions.

Multidimensional Index Structures The queries to be supported on such data: The queries to be supported on such data: partial-match queries: specify values for a subset of the dimensions partial-match queries: specify values for a subset of the dimensions range queries: give the range for each dimension. range queries: give the range for each dimension. nearest-neighbor queries: ask for the closest point to the given point. nearest-neighbor queries: ask for the closest point to the given point.

Grid File The grid file slices the space of points in each of the dimensions. The grid file slices the space of points in each of the dimensions. Each of the regions can be thought of as a bucket, and each of the points in that region has its record placed in a block belonging to that bucket. If needed, overflow blocks can be used to increase the size of a bucket. Each of the regions can be thought of as a bucket, and each of the points in that region has its record placed in a block belonging to that bucket. If needed, overflow blocks can be used to increase the size of a bucket.

Age Salary * * * * * * * * * * ** Age Salary

25, 60 25, , , 75 45, 60 50, , , , , , ,

Searching A grid directory consists of a k- dimensional array (grid array) and k one- dimensional arrays (linear scales). A grid directory consists of a k- dimensional array (grid array) and k one- dimensional arrays (linear scales). To search for a record: To search for a record: determine the positions of the record in each of the dimensions according to the linear scales determine the positions of the record in each of the dimensions according to the linear scales locate the proper bucket in the grid array locate the proper bucket in the grid array access the data block access the data block

Insertion Locate the bucket for the new record, then place the record in the block of the bucket if there is room. Locate the bucket for the new record, then place the record in the block of the bucket if there is room. If no room, two general approaches: If no room, two general approaches: add an overflow block as needed; add an overflow block as needed; refine the grid partition by splitting blocks; there is a choice of dimension and point of split. refine the grid partition by splitting blocks; there is a choice of dimension and point of split.

Performance Assumption: Assumption: the bucket matrix can be kept in main memory the bucket matrix can be kept in main memory the array of partitioning values in each dimension can be kept in main memory the array of partitioning values in each dimension can be kept in main memory Point query Point query the number of I/Os is what is necessary to read the bucket. the number of I/Os is what is necessary to read the bucket. For insertion and deletion, an additional disk write is needed. For insertion and deletion, an additional disk write is needed.

More about Grid File Partial-match query Partial-match query Range query Range query Nearest-neighbor query Nearest-neighbor query Reference: Reference: J. Nievergelt, et al., The grid file: an adaptable, symmetric, multikey file structure, ACM Trans. on Database Systems 9:1(1984), pp J. Nievergelt, et al., The grid file: an adaptable, symmetric, multikey file structure, ACM Trans. on Database Systems 9:1(1984), pp