Session-8 Data Management for Decision Support

Slides:



Advertisements
Similar presentations
Database Systems: Design, Implementation, and Management
Advertisements

Database Architectures and the Web
Basis for Distributed Database Technology
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Distributed databases
Transaction.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Chapter 13 (Web): Distributed Databases
Manajemen Basis Data Pertemuan 9 Matakuliah: M0264/Manajemen Basis Data Tahun: 2008.
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
ABCSG - Distributed Database 1 Data Management Distributed Database Data Replication.
Distributed DBMSs A distributed database is a single logical database that is physically distributed to computers on a network. Homogeneous DDBMS has the.
Distributed Database Management Systems
Chapter 9 : Distributed Database.
Overview Distributed vs. decentralized Why distributed databases
Chapter 12 Distributed Database Management Systems
Distributed Databases
DISTRIBUTED DATABASE MANAGEMENT SYSTEM CHAPTER 07.
Outline Introduction Background Distributed Database Design
Distributed databases
Alexandria Dodd Janelle Toungett
DATABASE MANAGEMENT SYSTEMS 2 ANGELITO I. CUNANAN JR.
Distributed Databases
Distributed Database and Replication. Distributed Database A logically interrelated collection of shared data and a description of this data physically.
Distributed Databases and DBMSs: Concepts and Design
Chapter 3 Database Architectures and the Web Pearson Education © 2009.
Distributed Databases Dr. Lee By Alex Genadinik. Distributed Databases? What is that!?? Distributed Database - a collection of multiple logically interrelated.
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
2. Introduction Chapter 1 Introduction 1.
Database Architectures and the Web
12 1 Chapter 12 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Database Architectures and the Web Session 5
Database Design – Lecture 16
III. Current Trends: 1 - Distributed DBMSsSlide 1/32 III. Current Trends Part 1: Distributed DBMSs: Concepts and Design Lecture 12 (2 hours) Lecturer:
Session-9 Data Management for Decision Support
1 About the Instructor Name: Gong Zhiguo Office: N512 Phone: Remark: Some of the slides are tailored from.
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
Distributed Database Systems Overview
Intro – Part 2 Introduction to Database Management: Ch 1 & 2.
Distributed Databases Midterm review. Lectures covered Everything until (including) March 2 nd Everything until (including) March 2 nd Focus on distributed.
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
The Evolution of Distributed DBMS 4Social and Technical Changes in the 1980’s u Business operations became more decentralized geographically. u Competition.
GSLIS - The University of Texas at Austin LIS 384K.11, Database-Management Principles and Applications LIS 384K.11 Database-Management Principles and Applications.
DDBMS Distributed Database Management Systems Fragmentation
Distributed DBMSs- Concept and Design Jing Luo CS 157B Dr. Lee Fall, 2003.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
Distributed Databases
ASMA AHMAD 28 TH APRIL, 2011 Database Systems Distributed Databases I.
1 Distributed Databases BUAD/American University Distributed Databases.
Distributed Database. Introduction A major motivation behind the development of database systems is the desire to integrate the operational data of an.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
Distributed database system
CS742 – Distributed & Parallel DBMSM. Tamer Özsu Page 1.1 Outline Introduction & architectural issues What is a distributed DBMS Problems Current state-of-affairs.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Chapter 12 Distributed Data Bases. Learning Objectives What a distributed database management system (DDBMS) is and what its components are How database.
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
Distributed Database Management Systems. Reading Textbook: Ch. 1, Ch. 3 Textbook: Ch. 1, Ch. 3 For next class: Ch. 4 For next class: Ch. 4 FarkasCSCE.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Distributed DBMS, Query Processing and Optimization
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
IT 5433 LM1. Learning Objectives Understand key terms in database Explain file processing systems List parts of a database environment Explain types of.
1 Chapter 22 Distributed DBMSs - Concepts and Design Simplified Transparencies © Pearson Education Limited 1995, 2005.
Virtual University of Pakistan Distributed database Management Systems Lecture 03.
CHAPTER 25 - Distributed Databases and Client–Server Architectures
Distributed Databases
Dr. Awad Khalil Computer Science Department AUC
Dr. Awad Khalil Computer Science Department AUC
Presentation transcript:

Session-8 Data Management for Decision Support DDBMS Architecture Session-8 Data Management for Decision Support

DDBMS Architecture DDBMS and Distribution Transparency Architecture Alternatives DDBMS Components

Distributed Database Management System A distributed database collection of multiple, logically interrelated stores data on multiple computers (nodes) over the network and permits access from any node to the joint data A distributed database management system (DDBMS) is a software system that permits the management of the distributed databases and makes the distribution transparent to the users.

Reasons for Data Distribution Several factors have led to the development of DDBS: Distributed nature of some database applications Increased reliability and availability Allowing data sharing while maintaining some measure of local control Improved performance

Distributed DBMS Environment Site 1 Site 2 Site 4 Site 3 Site 5 Site 6 Communication Network

Additional Functionality of DDBMS Distribution leads to increased complexity in the system design and implementation DDBMS must be able to provide additional functions to those of a centralized DBMS Some of these are: Access remote sites and transmit queries and data among the Track of the data distribution and replication Execution strategies for queries Copy Identification Consistency of copies of a replicated data item Global conceptual schema of the distributed database Recovery from individual site crashes

What is not a Distributed Database System? A DDBS is not a ``collection of files'' that can be individually stored at each node of a computer network files are not logically related no access via common interface

Centralized DBMS on a Network data resides only at one node the database management is no different from centralized DBMS remote processing, single server­multiple clients Site 1 Site 2 Site 4 Site 3 Site 5 Site 6 Communication Network

Distributed Database System Technology Distributed database technology attempts to achieve integration without centralization Database Technology Computers Networks Integration Distributed Computing Integration Without Centralization Distributed Database Systems

Example Multinational manufacturing company: Data and Information: head quarters in New York manufacturing plants in Chicago and Montreal warehouses in Phoenix and Edmonton R&D facilities in San Francisco Data and Information: employee records (working location) projects (R&D) engineering data (manufacturing plants, R&D) inventory (manufacturing, warehouse)

Promises of Distributed DBMS transparent management of distributed, fragmented, and replicated data improved reliability and availability through distributed transactions improved performance higher system extendibility motivation and major issues

Transparency Transparency refers to separation of the higher-level semantics of a system from lower-level implementation details. From data independence in centralized DBMS to fragmentation transparency in DDBMS. Issues Who should provide transparency? What is the state of the art in the industry?

Improved Reliability Distributed DBMS can use replicated components to eliminate single point failure. The users can still access part of the distributed database with “proper care” even though some of the data is unreachable. Distributed transactions facilitate maintenance of consistent database state even when failures occur.

Improved Performance Since each site handles only a portion of a database, the contention for CPU and I/O resources is not that severe. Data localization reduces communication overheads. Inherent parallelism of distributed systems may be exploited inter-query parallelism intra-query parallelism Performance models are not sufficiently developed. motivation and major issues

Easier System Expansion Ability to add new sites, data, and users over time without major restructuring. Huge centralized database systems (mainframes) are history (almost!). PC revolution (Compaq buying Digital, 1998) will make natural distributed processing environments. New applications (such as, supply chain) are naturally distributed - centralized systems will just not work. motivation and major issues

Disadvantages of DDBMSs Lack of Experience No operating true distributed database systems in existence Complexity DDBMS problems are inherently more complex than centralized DBMS ones Cost More hardware, software and people costs Distribution of control Problems of synchronization and coordination to maintain data consistency Security Database security + network security Difficult to convert No tools to convert centralized DBMSs to DDBMSs

Complicating Factors Data may be replicated in a distributed environment, consequently the DDBMS is responsible for choosing one of the stored copies of the requested data for access in case of retrievals making sure that the effect of an update is reflected on each and every copy of that data item If there is site/link failure while an update is being executed, the DDBMS must make sure that the effects will be reflected on the data residing at the failing or unreachable sites as soon as the system recovers from the failure

Complicating Factors Maintaining consistency of distributed/replicated data. Since each site cannot have instantaneous information on the actions currently carried out in other sites, the synchronization of transactions at multiple sites is harder than centralized system. motivation and major issues

Distributed DBMS Issues Distributed Database Design Distributed Query Processing Distributed Directory Management Distributed Concurrency Control Distributed Deadlock Management Reliability of Distributed Databases Operating Systems Support Heterogeneous Databases motivation and major issues

Distributed Database Design The problem is how the database and the applications that run against it should be placed across the sites. The two fundamental design issues are fragmentation (the separation of the database into partitions called fragments), and allocation (distribution), the optimum distribution of fragments. The general problem is NP­hard.

Distributed Query Processing Query processing deals with designing algorithms that analyze queries and convert them into a series of data manipulation operations. The problem is how to decide on strategy for executing each query over the network in the most cost effective way, however the cost is defined. The objective is to optimize where the inherent parallelism is used to improve the performance of executing the transaction

Distributed Directory Management A directory contains information (such as descriptions and locations) about data items in the database. A directory may be global to the entire DDBMS, or local to each site, distributed, multiple copies, etc.

Distributed Concurrency Control Concurrency control involves the synchronization of accesses to the distributed database, such that the integrity of the database is maintained. One not only has to worry about the integrity of a single database, but also about the consistency of multiple copies of the database (mutual consistency)