Rudolf Bayer Technische Universität München B-Trees and Databases, Past & Future.

Slides:



Advertisements
Similar presentations
Chapter 13: Query Processing
Advertisements

Analysis of Computer Algorithms
Vorlesung Datawarehousing Table of Contents Prof. Rudolf Bayer, Ph.D. Institut für Informatik, TUM SS 2002.
Kapitel 21 Astronomie Autor: Bennett et al. Galaxienentwicklung Kapitel 21 Galaxienentwicklung © Pearson Studium 2010 Folie: 1.
Chapter 1: The Database Environment
Chapter 6 Cost and Choice. Copyright © 2001 Addison Wesley LongmanSlide 6- 2 Figure 6.1 A Simplified Jam-Making Technology.
Foundations of Relational Implementation (1) IS 240 – Database Management Lecture #13 – Prof. M. E. Kabay, PhD, CISSP Norwich University
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Chapter 1 The Study of Body Function Image PowerPoint
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 5 Author: Julia Richards and R. Scott Hawley.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Relational Database and Data Modeling
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
My Alphabet Book abcdefghijklm nopqrstuvwxyz.
0 - 0.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Year 6 mental test 5 second questions
Year 6 mental test 10 second questions Numbers and number system Numbers and the number system, fractions, decimals, proportion & probability.
Limitations of the relational model 1. 2 Overview application areas for which the relational model is inadequate - reasons drawbacks of relational DBMSs.
1 Term 2, 2004, Lecture 9, Distributed DatabasesMarian Ursu, Department of Computing, Goldsmiths College Distributed databases 3.
Chapter 6 File Systems 6.1 Files 6.2 Directories
ZMQS ZMQS
BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.
Databasteknik Databaser och bioinformatik Data structures and Indexing (II) Fang Wei-Kleiner.
CS 440 Database Management Systems RDBMS Architecture and Data Storage 1.
Chapter 18 Methodology – Monitoring and Tuning the Operational System Transparencies © Pearson Education Limited 1995, 2005.
ABC Technology Project
Yong Choi School of Business CSU, Bakersfield
Database Modeling Past and Present
1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
© Charles van Marrewijk, An Introduction to Geographical Economics Brakman, Garretsen, and Van Marrewijk.
Chapter 6 File Systems 6.1 Files 6.2 Directories
Squares and Square Root WALK. Solve each problem REVIEW:
Do you have the Maths Factor?. Maths Can you beat this term’s Maths Challenge?
Database System Concepts and Architecture
© 2012 National Heart Foundation of Australia. Slide 2.
Introduction to Databases
Executional Architecture
1 Databases 2013/2014 Introduction. 2 The menu for today Organisational aspects Introduction to database technology The relational model.
Chapter 5 Test Review Sections 5-1 through 5-4.
SIMOCODE-DP Software.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Addition 1’s to 20.
25 seconds left…...
Test B, 100 Subtraction Facts
Week 1.
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Chapter 15 A Table with a View: Database Queries.
A SMALL TRUTH TO MAKE LIFE 100%
1 Unit 1 Kinematics Chapter 1 Day
TASK: Skill Development A proportional relationship is a set of equivalent ratios. Equivalent ratios have equal values using different numbers. Creating.
1 PART 1 ILLUSTRATION OF DOCUMENTS  Brief introduction to the documents contained in the envelope  Detailed clarification of the documents content.
How Cells Obtain Energy from Food
Management Information Systems, 10/e
Databases MMG508. DB Properties  Definition of a database: “A database is a collection of interrelated data items that are managed as a single unit”
Using Relational Databases and SQL Steven Emory Department of Computer Science California State University, Los Angeles Lecture 1: Introduction to Relational.
DATABASE & INFORMATION MODELS. Data vs Information Data: Collection of letters, numbers or facts Information: Processed data that provides value.
10/16/2015 1Yan Huang - Introduction Chapter 1: Introduction What is a DBMS? What is a DBMS? A little history of DB A little history of DB Major Components.
Chapter 15 Relational Implementation with DB2 David M. Kroenke Database Processing © 2000 Prentice Hall.
Performance. Performance Performance is a critical issue especially in a multi-user environment. Benchmarking is one way of testing this.
Latihan Create a separate table with the same structure as the Booking table to hold archive records. Using the INSERT statement, copy the records from.
Chapter 1 Introduction to Database Processing
Presentation transcript:

Rudolf Bayer Technische Universität München B-Trees and Databases, Past & Future

R. BayerB-Trees and Databases Factor main memory 200 KB 200 MB 10 3 cache 20 KB 20 MB 10 3 cache pages <10 3 disk size 7.5 MB 20 GB3*10 3 disk/memory size transfer rate 150 KB/s 15 MB/s 10 2 random access 50 ms 5 ms 10 scanning full disk 130 s 1300 s -10 (accessibility) Computing Technology in 1969 vs 2001

R. BayerB-Trees and Databases 3 Challenge of Applications in 1969 Space Industry Supersonic Transport SST C5A Boeing 747 Manufacturing parts explosion (spare) parts mangement Commerce bank check management credit card management

R. BayerB-Trees and Databases 4 Basics of B-Trees

R. BayerB-Trees and Databases 5 Basics of B-Trees: Insertion

R. BayerB-Trees and Databases 6 Basics of B-Trees: the Split

R. BayerB-Trees and Databases 7 Basics of B-Trees: recursive Split

R. BayerB-Trees and Databases 8 Basics of B-Trees: Growth at Root

R. BayerB-Trees and Databases 9 Scientific American 1984

R. BayerB-Trees and Databases 10 Fundamental Properties of B-Trees Time & I/O Complexity: O(log k n); k > 400 for all elementary operations find insert & delete Storage Utilization ~ 83% Growth: heightnodessize 11 8 KB MB 316* GB 464* GB < 4 logical I/O per operation !!

R. BayerB-Trees and Databases 11 Independence of DB Size cached... Index part 1% of file remember: since 1969 disk size memory size const 100 < 2 physical I/O per operation !!

R. BayerB-Trees and Databases 12 DB-Models in 1969 IMS: hierarchical, commercial success CODASYL: network model, M. Senko, C. Bachmann Relational: E. F. Codd, theory only Senko, Codd in same department: Information Systems Department IBM Research Lab, San José Senko Codd Efficiency Simplicity

R. BayerB-Trees and Databases 13 Relational DB-Model, Ted Codd Research in 1969, published in 1970 CACM Relational Algebra = Tables + Operators today: x set operators Codd: lossless restriction by table tie algebraic laws for query optimization (Codd does not mention this aspect)

R. BayerB-Trees and Databases 14 2 Languages imperative, procedural: algebraic expressions declarative, non-procedural: applied predicate calculus, DSL/Alpha (1971) no implementation of acceptable efficiency in sight!

R. BayerB-Trees and Databases 15 Hard Questions from which model? which language? which implementation? infighting, Codd to Systems Department defer decisions: rel. Storage System RSS to support all models and languages Leonard Liu: CS Dept 1974Cargese Workshop, Frank King

R. BayerB-Trees and Databases 16 Which Language? DL/I: IMS CODASYL: COBOL + pointer chasing and currency indicators, Chamberlin Rel. Algebra: Codd DSL/Alpha: Codd SQUARE: Chamberlin, et al. SEQUEL: Chamberlin, Boyce, Reisner QBE: Moshe Zloof Rendezvous: Codd 3 survivors: DL/I, SQL, Rel.Algebra

R. BayerB-Trees and Databases 17 Implementation: System R, IBM SQL Chamberlin, Reisner Schemata: normalization, Codd, Boyce Rel. Algebra: Codd et al. Optimization: Blasgen, Selinger, Eswaran Cost Models: Transactions: Gray, Traiger B-Trees: Bayer, Schkolnick, Blasgen Recovery: Lorie, Putzolu

R. BayerB-Trees and Databases 18 Factors for Product Success simple, formalized model simple user interface: SQL algebra + laws for optimization performance: B-trees multiuser: transactions (Gray) robustness: transactions + recovery, self-organization of B-trees scalability: B-trees with logarithmic growth, parallelism

R. BayerB-Trees and Databases 19 Prefix B-Trees... (Smith, Bernie), (Smith, Henry) (Smith, C)... (Smith, Bernie) store shortest separators: Simple Prefix B-trees trim common prefixes:Prefix B-trees (Smith, Henry)

R. BayerB-Trees and Databases 20 Concurrency and B-Trees Bayer, Schkolnick: Acta Informatica 9, 1-21 (1977)... everybody reads root root almost never changes low probability of conflicts near leaves combination of synchronization protocols no chance of testing real general case

R. BayerB-Trees and Databases 21 UB-Trees: Multidimensional Indexing geographic databases (GIS) Data-Warehousing: Star Schema all relational databases with n:m relationships RS nm (r, s) XML mobile, location based applications

R. BayerB-Trees and Databases 22 Basic Idea of UB-Tree linearize multidimensional space by space filling curve, e.g. Z-curve or Hilbert Use Z-address to store objects in B-Tree Response time for query is proportional to size of the answer!

R. BayerB-Trees and Databases 23 UB-Tree: Regions and Query-Box Z-region [0.1 : 1.1.1]

R. BayerB-Trees and Databases 24 World as self balancing UB-Tree