Database Design – Lecture 16

Slides:



Advertisements
Similar presentations
Database Systems: Design, Implementation, and Management
Advertisements

Distributed Database Management Systems
Database Architectures and the Web
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Distributed databases
Transaction.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Chapter 13 (Web): Distributed Databases
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
ABCSG - Distributed Database 1 Data Management Distributed Database Data Replication.
Distributed Database Management Systems
DISTRIBUTED DATABASE. Centralized & Distributed Database  Single site database – centralized database –A database is located at a single site or distributed.
Chapter 9 : Distributed Database.
Overview Distributed vs. decentralized Why distributed databases
Distributed Database Management Systems
©Silberschatz, Korth and Sudarshan19.1Database System Concepts Lecture-10 Distributed Database System A distributed database system consists of loosely.
Chapter 12 Distributed Database Management Systems
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Distributed Databases
DISTRIBUTED DATABASE MANAGEMENT SYSTEM CHAPTER 07.
Distributed databases
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
DATABASE MANAGEMENT SYSTEMS 2 ANGELITO I. CUNANAN JR.
Distributed Databases
Distributed Databases and DBMSs: Concepts and Design
Distributed Databases Dr. Lee By Alex Genadinik. Distributed Databases? What is that!?? Distributed Database - a collection of multiple logically interrelated.
ITEC 3220A Using and Designing Database Systems
Client/Server Databases and the Oracle 10g Relational Database
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
IMS 4212: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida Distributed Databases Business needs.
Database Design - Lecture 1
12 1 Chapter 12 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
III. Current Trends: 1 - Distributed DBMSsSlide 1/32 III. Current Trends Part 1: Distributed DBMSs: Concepts and Design Lecture 12 (2 hours) Lecturer:
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
1 Adapted from Pearson Prentice Hall Adapted form James A. Senn’s Information Technology, 3 rd Edition Chapter 7 Enterprise Databases and Data Warehouses.
Session-9 Data Management for Decision Support
Database Systems: Design, Implementation, and Management Tenth Edition
Session-8 Data Management for Decision Support
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
Chapter 10 Distributed Database Management System
Distributed Database Systems Overview
G063 - Distributed Databases. Learning Objectives: By the end of this topic you should be able to: explain how databases may be stored in more than one.
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
The Evolution of Distributed DBMS 4Social and Technical Changes in the 1980’s u Business operations became more decentralized geographically. u Competition.
DDBMS Distributed Database Management Systems Fragmentation
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
Chapter 12 Distributed Database Management Systems.
ASMA AHMAD 28 TH APRIL, 2011 Database Systems Distributed Databases I.
1 Distributed Databases BUAD/American University Distributed Databases.
Distributed Database. Introduction A major motivation behind the development of database systems is the desire to integrate the operational data of an.
Chapter 10 Distributed Database Management System
Distributed database system
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Chapter 12 Distributed Data Bases. Learning Objectives What a distributed database management system (DDBMS) is and what its components are How database.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.
IT 5433 LM1. Learning Objectives Understand key terms in database Explain file processing systems List parts of a database environment Explain types of.
Distributed DBMSs – Concepts and Design Chapter 24 in Textbook.
CMS Advanced Database and Client-Server Applications Distributed Databases slides by Martin Beer and Paul Crowther Connolly and Begg Chapter 22.
1 Chapter 22 Distributed DBMSs - Concepts and Design Simplified Transparencies © Pearson Education Limited 1995, 2005.
Chapter 12 Distributed Database Management Systems
Chapter 19: Distributed Databases
Replication.
Introduction of Week 14 Return assignment 12-1
Presentation transcript:

Database Design – Lecture 16 Distributed Databases

Lecture Objectives Distributed Processing and Distributed Databases Distributed Database Management System (DDBMS) Distributed Database Design

Distributed Processing Shares the database’s logical processing among two or more physically independent sites that are connected through a network. Note: data resides at only one site and is shared by other sites (“centralized”)

Distributed Databases Stores a logically related database over two or more physically independent sites. The sites are connected by a computer network. Note: database is composed of several parts know as database fragments. These fragments are located at several different sites.

Distributed Processing and Distributed Databases In a distributed database environment, the users do not need to know the name or location of each database fragment in order to access the database – transparent to the user Distributed processing does not require a distributed database but a distributed database requires distributed processing Both distributed processing and distributed databases require a network to connect all components

Lecture Objectives Distributed Processing and Distributed Databases Distributed Database Management System (DDBMS) Distributed Database Design

DDBMS Advantages Data are located near/at “greatest demand” site – improved performance Improved reliability – data replication Growth facilitation Reduced operating costs

DDBMS Disadvantages Complexity Cost Database design more complex

Distributed Database Management System(DDBMS) Governs the storage and processing of a single logically related database over interconnected computer systems in which both data and processing functions are distributed among several sites.

Distributed Database Management System(DDBMS) A DDBMS must have at least the following functions to be classified as distributed: - Application Interface - Validation - Transformation - Query Optimization - Mapping - I/O Interface - Formatting - Security - Backup & Recovery - DB Administration - Concurrency Control - Transaction Management - Computer Workstations (sites or nodes) - Network Hardware & Software - Communications Media

Distributed Database Management System(DDBMS) A DDBMS must have at least the following functions to be classified as distributed: Application Interface Allows the interaction with the end user or application programs and with other DBMSs within the distributed database Validation Able to analyze data requests Transformation To determine which data request components are distributed and which ones are local

Distributed Database Management System(DDBMS) A DDBMS must have at least the following functions to be classified as distributed: Query Optimization To find the best access strategy Mapping To determine the data location of local and remote fragments I/O Interface To read or write data from or to permanent local storage

Distributed Database Management System(DDBMS) A DDBMS must have at least the following functions to be classified as distributed: Formatting To prepare the data for presentation to the end user or an application program Security To provide data privacy at both local and remote databases Backup and Recovery To ensure the availability and recoverability of the database in case of a failure

Distributed Database Management System(DDBMS) A DDBMS must have at least the following functions to be classified as distributed: DB Administration To allow the Database Administrator to maintain the databases Concurrency Control To manage simultaneous data access and ensure data consistency across database fragments in the DDBMS Transaction Management To ensure that the data move from on consistent state to another – synchronizing transactions

Distributed Database Management System(DDBMS) A DDBMS must have at least the following components: Computer Workstations (sites or nodes) Form the network system Network Hardware and Software Components that reside in each workstation Allows all sites to interact and exchange data Communications media Carries data from one workstation to another

Distributed Database Management System(DDBMS) A DDBMS must have at least the following components: Transaction Processor (TP) Software component found in each computer that requests data Receives and processes the application’s data requests (remote and local) Data Processor (DP) Software component residing on each computer that stores and retrieves data located at the site

Distributed Database Environment

Lecture Objectives Distributed Processing and Distributed Databases Distributed Database Management System (DDBMS) Distributed Database Design

Distributed Database Design Designing for a relational data base structure does not change – start with a top down approach HOWEVER, need to consider the following as well: How to partition the database into fragments Which fragments to replicate Where to locate those fragments and replicas More frequently used fragments should be stored locally Fragments used by all users should be stored centrally

Distributed Database Design Data Fragmentation: Allows a single object to be broken into two or more segments or fragments Each fragment can be stored at any site on the network Data fragmentation information is stored in the distributed data catalog (DDC), from which it is accessed by the TP to process user requests

Distributed Database Design Types of Data Fragmentation: Horizontal Vertical Mixed

Distributed Database Design Types of Data Fragmentation: Horizontal The division of a relation into tuples (rows) Each fragment is stored at a different node and each fragment has unique rows Each tuple has the same attributes (columns) but the rows are fragmented

Distributed Database Design Example of horizontal fragmentation Original structure: 5th Edition Fragmented structure: Split by state 6th Edition

Distributed Database Design Example of horizontal fragmentation Resulting structure: Fragmented structure: Split by state 5th Edition

Distributed Database Design Types of Data Fragmentation: Vertical The division of a relation into subsets by attributes (column) Each subset is stored at a different node, and each fragment has unique columns – with the exception of the key column, which is common to all fragments Transaction issues here because same record may need to be inserted into two tables (part of record into 1 table and other part into another table). If only 1 insert is successful; end up with inconsistent data.

Distributed Database Design Original structure: 5th Edition Fragmented structure: Split by location 6th Edition

Distributed Database Design Original structure: 5th Edition Example of Vertical Fragmentation Fragmented structure: Split by location 5th Edition

Distributed Database Design Types of Data Fragmentation: Mixed A combination of horizontal and vertical strategies

Distributed Database Design Example of Mixed Fragmentation:

Distributed Database Design Example of Mixed Fragmentation:

Data Replication Storage of data copies at multiple sites served by a computer network Fragment copies can be stored at several sites to serve specific information requirements Can enhance data availability and response time Can help to reduce communication and total query costs

Replication Scenarios Fully replicated database: Stores multiple copies of each database fragment at multiple sites Can be impractical due to amount of overhead Partially replicated database: Stores multiple copies of some database fragments at multiple sites Most DDBMSs are able to handle the partially replicated database well Unreplicated database: Stores each database fragment at a single site No duplicate database fragments

Data Allocation Deciding where to locate data Allocation strategies: Centralized data allocation Entire database is stored at one site Partitioned data allocation Database is divided into several disjointed parts (fragments) and stored at several sites Replicated data allocation Copies of one or more database fragments are stored at several sites Data distribution over a computer network is achieved through data partition, data replication, or a combination of both

Distributed Database Design How is a distributed database managed? Distributed Data Catalog (DDC) Contains the description of the entire database as seen by the DBA Translates user requests into sub-queries (remote requests) that will be processed by different DPs DDC is distributed and replicated at network nodes (the location of a database fragment)

Examples of Distributed Databases Banking Account data distributed at each local branch Loan data distributed at each local branch Corporate data at head office (summarized branch information) Insurance Policy data with each branch Corporate data at head office

Examples of Distributed Databases Retail Inventory data distributed at each local store Employee Scheduling data at each store Corporate data at head office (summarized store information) Payroll data at head office Utilities Utility monitoring data at each location (I.e. nuclear station monitoring – air, water etc at each location) Corporate data at head office

Distributed Database vs Client Server Client/Server is really an architecture which models a computerized solution based on the distribution of functions between servers and clients. A client requests specific services from a server and a server provides requested services to clients Distributed processing could be one aspect of client/server architecture – data ‘centralized’ The DDBMS distributes data to different locations – could be used in a Client/Server architecture

Distributed Database Design Steps: Always start with a centralized view design Consider horizontal fragmentation of a centralized database Consider vertical fragmentation of a horizontally fragmented database Re-consider PK for all fragments of the database Define data replication rules (scenarios) Complete Design