Database Systems: Design, Implementation, and Management

Slides:



Advertisements
Similar presentations
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
Advertisements

Chapter 1: The Database Environment
Distributed databases 1. 2 Outline introduction principles / objectives problems.
1 Term 2, 2004, Lecture 9, Distributed DatabasesMarian Ursu, Department of Computing, Goldsmiths College Distributed databases 3.
ICS 434 Advanced Database Systems
Database System Concepts and Architecture
Chapter 10: The Traditional Approach to Design
Systems Analysis and Design in a Changing World, Fifth Edition
Distributed Database Management Systems
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
PSSA Preparation.
Chapter 24 Replication and Mobile Databases Transparencies © Pearson Education Limited 1995, 2005.
Database Architectures and the Web
Distributed databases
Distributed Database Systems Dr. Mohamed Osman Hegazi.
Transaction.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Chapter 13 (Web): Distributed Databases
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
Distributed Database Management Systems
DISTRIBUTED DATABASE. Centralized & Distributed Database  Single site database – centralized database –A database is located at a single site or distributed.
Chapter 9 : Distributed Database.
Overview Distributed vs. decentralized Why distributed databases
Distributed Database Management Systems
Chapter 12 Distributed Database Management Systems
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
DISTRIBUTED DATABASE MANAGEMENT SYSTEM CHAPTER 07.
Distributed databases
DATABASE MANAGEMENT SYSTEMS 2 ANGELITO I. CUNANAN JR.
Distributed Databases
Distributed Databases and DBMSs: Concepts and Design
DISTRIBUTED DATABASES AND DDBMS.  Understand the concept of “Distributed Data”  Describe various Distributed Data and DDBMS implementations  Explain.
ITEC 3220A Using and Designing Database Systems
Client/Server Databases and the Oracle 10g Relational Database
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
12 1 Chapter 12 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Database Design – Lecture 16
III. Current Trends: 1 - Distributed DBMSsSlide 1/32 III. Current Trends Part 1: Distributed DBMSs: Concepts and Design Lecture 12 (2 hours) Lecturer:
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
DISTRIBUTED DATABASES IN ADBMS Shilpa Seth
Database Systems: Design, Implementation, and Management Tenth Edition
Session-8 Data Management for Decision Support
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
Chapter 10 Distributed Database Management System
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
The Evolution of Distributed DBMS 4Social and Technical Changes in the 1980’s u Business operations became more decentralized geographically. u Competition.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
Distributed Databases
Chapter 12 Distributed Database Management Systems.
1 Distributed Databases BUAD/American University Distributed Databases.
Distributed Database. Introduction A major motivation behind the development of database systems is the desire to integrate the operational data of an.
Chapter 10 Distributed Database Management System
Distributed DB CSE2132 Database Systems Week 12 Lecture Distributed Database.
Distributed database system
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Chapter 12 Distributed Data Bases. Learning Objectives What a distributed database management system (DDBMS) is and what its components are How database.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
Distributed DBMSs – Concepts and Design Chapter 24 in Textbook.
DISTRIBUTED DATABASES AND DDBMS. Learning Objectives  Describe various DDBMS implementations  Explain how database design affects the DDBMS environment.
Distributed Databases
1 Chapter 22 Distributed DBMSs - Concepts and Design Simplified Transparencies © Pearson Education Limited 1995, 2005.
Chapter 12 Distributed Database Management Systems
Introduction of Week 14 Return assignment 12-1
Presentation transcript:

Database Systems: Design, Implementation, and Management CHAPTER 10 Distributed Database Management System 1

Chapter Objectives Understand concepts of distributed DBMS Understand various transparency features of distributed databases Understand distributed database design issues 2

What Is A Distributed DBMS? Decentralization of business operations and globalization of businesses created a demand for distributing the data and processes across multiple locations. Distributed database management systems (DDBMS) are designed to meet the information requirements of such multi-location organizations. A DDBMS manages the storage and processing of logically related data over interconnected computer systems in which both data and processing functions are distributed among several sites. Distributed processing shares the database’s logical processing among two or more physically independent sites that are connected through a network. 14

DDBMS Advantages Data located near site with greatest demand Faster data access Faster data processing Growth facilitation Improved communications Reduced operating costs User-friendly interface Less danger of single-point failure Processor independence

DDBMS Disadvantages Complexity of management and control Security Lack of standards Increased storage requirements Greater difficulty in managing data environment Increased training costs

Distributed Processing Shares database’s logical processing among physically, networked independent sites Figure 10.1 Distributed Processing Environment 11

Distributed Database Distributed database stores a logically related database over two or more physically independent sites connected via a computer network.

Distributed Database Stores logically related database over physically independent sites Figure 10.2

Distributed Database vs. Distributed Processing Does not require distributed database May be based on a single database on single computer Copies or parts of database processing functions must be distributed to all data storage sites Distributed database Requires distributed processing Both Require a network to connect components

Functions of DDBMS Application/end user interface Validation to analyze data requests Transformation to determine request components Query optimization to find the best access strategy Mapping to determine the data location I/O interface to read or write data Formatting to prepare the data for presentation Security to provide data privacy Backup and recovery DB Administration Concurrency Control Transaction Management

What Is A Distributed DBMS? Figure 10.3 Centralized Database Management System 16

What Is A Distributed DBMS? Figure 10.4 Fully Distributed Database Management System 17

DDBMS Components Computer workstations that form the network system. Network hardware and software components that reside in each workstation. Communications media that carry the data from one workstation to another. Transaction processor (TP) receives and processes the application’s data requests. Data processor (DP) stores and retrieves data located at the site. Also known as data manager (DM). 18

Distributed Database Components Figure 10.5

Levels of Data & Process Distribution Depending on the levels of data and process distribution we can envisage three different configurations: SPSD: Single site process, single site data (Centralized) MPSD: Multiple site processing, single site data MPMD: Multiple site processing, multiple site data (Fully distributed) SPMD: Single site processing, multiple site data (Logically unsound)

Levels of Data & Process Distribution Single-Site Processing, Single-Site Data (SPSD) All processing is done on a single CPU or host computer. All data are stored on the host computer’s local disk. The DBMS is located on the host computer. The DBMS is accessed by dumb terminals. This is an example of a centralized DBMS 22

Levels of Data & Process Distribution Figure 10.6 Nondistributed (Centralized) DBMS 23

Levels of Data & Process Distribution Multiple-Site Processing, Single-Site Data (MPSD) Typically, MPSD requires a network file server on which conventional applications are accessed through a LAN. A popular variation of the MPSD approach is known as a client/server architecture. 24

Levels of Data & Process Distribution Figure 10.7 Multiple-Site Processing, Single-Site Data 25

Levels of Data & Process Distribution Multiple-Site Processing, Multiple-Site Data (MPMD) Fully distributed DBMS with support for multiple DPs and TPs at multiple sites. Homogeneous DDBMS integrate only one type of centralized DBMS over the network. Heterogeneous DDBMS integrate different types of centralized DBMSs over a network. 26

Distributed DB Transparency A DDBMS ensures that the database operations are transparent to the end user. Different types of transparencies are: Distribution transparency Transaction transparency Failure transparency Performance transparency Heterogeneity transparency 28

Distribution Transparency Distribution transparency allows us to manage a physically dispersed database as though it were a centralized database. Three Levels of Distribution Transparency Fragmentation transparency Location transparency Local mapping transparency 29

Distribution Transparency Example: Employee data (EMPLOYEE) are distributed over three locations: New York, Atlanta, and Miami. Depending on the level of distribution transparency support, three different cases of queries are possible: Distributed DBMS Employee Table Fragment E1 E2 E3 Location New York Atlanta Miami 32

Distribution Transparency When a DBMS support fragmentation transparency the user views a single logical database SELECT * FROM EMPLOYEE WHERE SALARY > 50000;

Distribution Transparency When the DBMS supports location transparency the user needs to know the fragment names but need not know the actual location of the fragments SELECT * FROM E1 WHERE SALARY > 50000 UNION FROM E2 WHERE SALARY > 50000 FROM E3 WHERE SALARY > 50000;

Distribution Transparency When the DBMS supports local mapping transparency the user needs to know the fragment names as well as the actual location of the fragments SELECT * FROM E1 NODE NY WHERE SALARY > 50000 UNION SELECT * FROM E2 NODE ATL WHERE SALARY > 50000 UNION SELECT * FROM E3 NODE MIA WHERE SALARY > 50000;

Distribution Transparency Distribution transparency is supported by a distributed data dictionary which captures the distributed global schema. A local transaction processor uses this global schema to translate user requests into subqueries (remote requests) that will be processed by different data processors.

Transaction Transparency A distributed transaction updates and/or requests data from multiple remote sites. Transaction transparency ensures that the transaction will be completed only if all database sites involved in the transaction complete their part of the transaction. It maintains database integrity of a distributed database. Giving a 5% raise to all employees in the previous example involves updating the database at multiple locations. If the transaction cannot be committed in one location, it must be rolled back in all locations.

Distributed DB Transparency Failure Transparency ensures that failure of a node will not affect the operation of a DDBMS Performance Transparency ensures that the system performance will not degrade because of the distributed nature of the database. Query optimization becomes very complex in a distributed database due to fragmentation and replication of data in multiple remote nodes. Heterogeneity Transparency allows the integration of different types of DBMSs (multi vendor, multi model) under a common global schema. The DDBMS transparently translates the user requests from one local schema to another.

Distributed Database Design All design principles and concepts discussed in the context of a centralized database also apply to a distributed database. Three additional issues are relevant to the design of a distributed database: data fragmentation data replication data allocation

Data Fragmentation Data fragmentation allows us to break a single object (a database or a table) into two or more fragments. Three type of fragmentation strategies are available to distribute a table:Horizontal, Vertical, Mixed. Horizontal fragmentation divides a table into fragments consisting of sets of tuples Each fragment has unique rows and is stored at a different node Example: A bank may distribute its customer table by location

Data Fragmentation Vertical fragmentation divides a table into fragments consisting of sets of columns Each fragment is located at a different node and consists of unique columns - with the exception of the primary key column, which is common to all fragments Example: The Customer table may be divided into two fragments, one fragment consisting of Cust ID, name, and address may be located in the Service building and the other fragment with Cust ID, credit limit, balance, dues may be located in the Collection building.

Data Fragmentation Mixed fragmentation combines the horizontal and vertical strategies. A fragment may consist of a subset of rows and a subset of columns of the original table. Example: Customer table may be divided by state and grouped by columns. The service building in Texas will store Customer service related information for customers from Texas.

Data Replication Data replication involves storing multiple copies of a fragment in different locations. For example, a copy may be stored in New York and another in San Francisco. It improves response time and data availability. Data replication requires the DDBMS to maintain data consistency among the replicas. A fully replicated database stores multiple copies of each database fragment. A partially replicated database stores multiple copies of some database fragments at multiple sites.

Data Allocation Data allocation decision involves determining the location of the fragments so as to achieve the design goals of cost, response time and availability. Three data allocation strategies are: centralized, partitioned and replicated. A centralized allocation strategy stores the entire database in a single location. A partitioned strategy divides the database into disjointed parts (fragments) and allocates the fragments to different locations. In a replicated strategy copies of one or more database fragments are stored at several sites.