Distributed Database Management Systems

Slides:



Advertisements
Similar presentations
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Advertisements

Distributed Database Systems
ISOM Distributed Databases Arijit Sengupta. ISOM Learning Objectives Understand the concept and necessity of distributed databases Understand the types.
Distributed Database Systems Dr. Mohamed Osman Hegazi.
Distributed Databases: Review May 2003Yangjun Chen1 Distributed Databases System Architecture Distributed Database Design Semantic Data Control Distributed.
CS 347Notes 021 CS 347: Parallel and Distributed Data Management Notes02: Distributed DB Design Hector Garcia-Molina.
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
Institut für Scientific Computing – Universität WienP.Brezany Fragmentation Univ.-Prof. Dr. Peter Brezany Institut für Scientific Computing Universität.
1 Distributed Databases Review CS347 June 6, 2001.
1 Distributed Databases CS347 Lecture 13 May 23, 2001.
Distributed Databases
H.Lu/HKUST L04: Physical Database Design (2)  Introduction  Index Selection  Partitioning & Denormalization.
Database Design – Lecture 16
DISTRIBUTED DATABASES IN ADBMS Shilpa Seth
DISTRIBUTED DATABASE DESIGN
PMIT-6102 Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
PMIT-6102 Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
PMIT-6102 Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
DDBMS Distributed Database Management Systems Fragmentation
ASMA AHMAD 28 TH APRIL, 2011 Database Systems Distributed Databases I.
Distributed Database. Introduction A major motivation behind the development of database systems is the desire to integrate the operational data of an.
PMIT-6101 Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
Lecture 15- Parallel Databases (continued) Advanced Databases Masood Niazi Torshiz Islamic Azad University- Mashhad Branch
CS573 Data Privacy and Security Secure data outsourcing – Combining encryption and fragmentation.
PMIT-6101 Advanced Database Systems By- Jesmin Akhter Assistant Professor, IIT, Jahangirnagar University.
1 ICS 214B: Transaction Processing and Distributed Data Management Lecture 9: Fragmentation and Distributed Query Processing Professor Chen Li.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Lec 7 Practical Database Design and Tuning Copyright © 2004 Pearson Education, Inc.
Distributed Database Management Systems. Reading Textbook: Ch. 1, Ch. 3 Textbook: Ch. 1, Ch. 3 For next class: Ch. 4 For next class: Ch. 4 FarkasCSCE.
1 Distributed Databases architecture, fragmentation, allocation Lecture 1.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
CS 440 Database Management Systems Lecture 5: Query Processing 1.
Distributed Database Design Bayu Adhi Tama, MTI Fasilkom-Unsri Adapted from Connolly, et al., Database Systems 4 th Edition, Pearson Education Limited,
CMS Advanced Database and Client-Server Applications Distributed Databases slides by Martin Beer and Paul Crowther Connolly and Begg Chapter 22.
1 Chapter 22 Distributed DBMSs - Concepts and Design Simplified Transparencies © Pearson Education Limited 1995, 2005.
CS742 – Distributed & Parallel DBMSPage 2. 1M. Tamer Özsu Outline Introduction & architectural issues  Data distribution  Fragmentation  Data Allocation.
Practical Database Design and Tuning
CS 540 Database Management Systems
CS 440 Database Management Systems
Database Management System
Physical Database Design and Performance
Distributed Database Management Systems
Physical Database Design for Relational Databases Step 3 – Step 8
Chapter 19: Distributed Databases
Relational Algebra Chapter 4, Part A
Evaluation of Relational Operations
CHAPTER 5: PHYSICAL DATABASE DESIGN AND PERFORMANCE
Lecture 17: Distributed Transactions
G063 - Distributed Databases
Database Management System
Practical Database Design and Tuning
External Joins Query Optimization 10/4/2017
Relational Algebra Chapter 4, Sections 4.1 – 4.2
(Two-Pass Algorithms)
Databases.
Outline Introduction Background Distributed DBMS Architecture
Distributed Database Management Systems
Vertical Fragmentation
Bioinformatics Algorithms and Data Structures
Distributed Database Management Systems
Distributed Database Management Systems
Distributed Database Management Systems
Distributed Database Design
Distributed Database Management Systems
Outline Introduction Background Distributed DBMS Architecture
Outline Introduction Background Distributed DBMS Architecture
Presentation transcript:

Distributed Database Management Systems Lecture 20

In the Previous Lecture Continued with VF Computed CA Partitioning Algorithm

In this Lecture Continue with VF Hybrid Fragmentation Allocation Problem Replication

A1 A3 A2 A4 45 53 5 3 80 75 78 A1 A2 A3 A4 q1 1 q2 q3 q4 S1 S2 S3 q1 15 20 10 q2 5 q3 25 q4 3 CA refj(qi) accj(qi) z2 = 3311 z1 = 0 – 452 z3= 0 - 782

A1= jNo A2= jName A3= budget A4= loc V1 = {jNo, budget} V2 = {jNo, jName, loc}

VF- Two Problems 1- Clusters not in the sides, rather in the middle of CA 2- m-way partitioning

VF Correctness

A relation R, defined over attribute set A and key K, generates the vertical partitioning FR = {R1, R2 , …, Rr } Completeness: The following should be true for A A =U Ri

Reconstruction: can be achieved by R = ⋈K Ri, ∀Ri ∈ FR Disjointness: TID's are not considered to be overlapping since they are maintained by the system PK is exception

Hybrid Fragmentation

Practically, applications require the fragmentation of both the types to be combined

So the nesting of fragmentations, i. e So the nesting of fragmentations, i.e., one following the other, it becomes sort of a tree

Disjoint ness and completeness have to be assured at each step, and reconstruction can be obtained by applying Join and Union in reverse order

CUST Beta Delta1 Delta2 A/C# Name Bal Branch AB101 Saeed 4535 MTN Laeeq 45632.34 LHR AB203 Salma 67839.87 AB109 Shaan 45.32 CUST Beta = ΠA/C#, Bal (CUST) Delta1 = σ Loc = “MTN” (ΠA/C#, Name, Branch (CUST)) Delta2 = σ Loc = “LHR” (ΠA/C#, Name, Branch (CUST)) Beta A/C# Bal AB101 4535 AB202 45632.34 AB203 67839.87 AB109 45.32 Delta1 Delta2 A/C# Name Branch AB101 Saeed MTN AB109 Shaan A/C# Name Branch AB202 Laeeq LHR AB203 Salma

Allocation

Find the "optimal" distribution of F to S. Given F = {F1, F2 , …, Fn} fragments S ={S1 , S2 , …, Sm} network sites Q = {q1, q2 ,…, qq } applications Find the "optimal" distribution of F to S.

Optimality Minimize the processing cost and maximize the system throughput at each site

It is a complex problem to be solved mathematically, to make the things very simple, consider the allocation of a single fragment Fk,

set of read only queries on Fk from Si; T = {t1, t2, …, tm} set of update queries U on Fk from Si; U= {u1, u2, .., um}

Communication Cost C(T) = {c1,2, c1,3, …., c1,m, ….cm-1, m} C’(T) = {c’1,2, c’1,3, …., c’1,m, ….c’m-1, m} Storage Cost D = {d1, d2, ……., dm}

Allocation problem is to find the cites out of set of sites S, where the copy of Fk will be stored.

The specification of the allocation problem will be 0 otherwise xj = 1 if the fragment Fk is assigned to site Sj The specification of the allocation problem will be min

That concludes our discussion on Fragmentation Lets summarize it

Fragmentation is splitting a table into smaller tables Alternatives Horizontal Vertical Hybrid

Horizontal Fragmentation

Splits a table into horizontal subsets (row wise) Primary and Derived Horizontal Fragmentation

We need major simple predicates (Pr); should be complete & minimal Pr is transformed into Pr’ Minterm (M) predicates from Pr’

Correctness of PHF depends on the Pr’ Derived Horizontal Fragmentation is based on Owner-member link

Vertical Fragmentation is more complicated due to more options Based on attributes’ affinities

AA is transformed into CA using BEA Calculated using usage data and access frequencies from different sites AA is transformed into CA using BEA

CA establishes clusters of attributes that are split for efficient access Hybrid Fragmentation combines HF and VF That concludes Fragmentation

Thanks