Distributed Database Management Systems. Reading Textbook: Ch. 4 Textbook: Ch. 4 FarkasCSCE 824 - Spring 20112.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

C6 Databases.
Introduction to Databases
Systems Development Life Cycle
Relational Databases Chapter 4.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.4/1 Outline Introduction Background Distributed Database Design Database Integration ➡ Schema Matching ➡
Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.
Chapter 6 Methodology Logical Database Design for the Relational Model Transparencies © Pearson Education Limited 1995, 2005.
Information Integration. Modes of Information Integration Applications involved more than one database source Three different modes –Federated Databases.
Chapter 3 Database Management
File Systems and Databases
1 Lecture 13: Database Heterogeneity Debriefing Project Phase 2.
Methodology Logical Database Design for the Relational Model
1 Lecture 13: Database Heterogeneity. 2 Outline Database Integration Wrappers Mediators Integration Conflicts.
Article by: Farshad Hakimpour, Andreas Geppert Article Summary by Mark Vickers.
CHAPTER 2: MODELING DATA IN THE ORGANIZATION © 2013 Pearson Education, Inc. Publishing as Prentice Hall 1 Modern Database Management 11 th Edition Jeffrey.
Data Modeling Introduction. Learning Objectives Define key data modeling terms –Entity type –Attribute –Multivalued attribute –Relationship –Degree –Cardinality.
6 Chapter 6 Database Design Hachim Haddouti. 6 2 Hachim Haddouti and Rob & Coronel, Ch6 In this chapter, you will learn: That successful database design.
Distributed databases
Distributed Databases and DBMSs: Concepts and Design
Michael F. Price College of Business Chapter 6: Logical database design and the relational model.
Chapter 14 & 15 Conceptual & Logical Database Design Methodology
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Entity-relationship Modeling Transparencies 1. ©Pearson Education 2009 Objectives How to use ER modeling in database design. The basic concepts of an.
Lecture 2 The Relational Model. Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical relations.
Chapter 4 The Relational Model Pearson Education © 2014.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Research Topics in Computing Data Modelling for Data Schema Integration 1 March 2005 David George.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
1 Database Systems: Design, Implementation, and Management CHAPTER 6 Database Design.
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
Session-9 Data Management for Decision Support
10/3/2012ISC329 Isabelle Bichindaritz1 Logical Design.
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Chapters 15 &16 Conceptual and Logical Database Design Methodology.
Distributed Databases Midterm review. Lectures covered Everything until (including) March 2 nd Everything until (including) March 2 nd Focus on distributed.
Relational Database. Database Management System (DBMS)
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
DataBase Management System What is DBMS Purpose of DBMS Data Abstraction Data Definition Language Data Manipulation Language Data Models Data Keys Relationships.
Technology Guide 3 Data & Database. Agenda Definition File processing problems Database Selection criteria.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
DDBMS Distributed Database Management Systems Fragmentation
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
Organizing Data and Information
Chapter 9 Logical Database Design : Mapping ER Model To Tables.
Foundations of Business Intelligence: Databases and Information Management.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
3/6: Data Management, pt. 2 Refresh your memory Relational Data Model
The Relational Model. 2 Relational Model Terminology u A relation is a table with columns and rows. –Only applies to logical structure of the database,
Distributed Database Management Systems. Reading Textbook: Ch. 1, Ch. 3 Textbook: Ch. 1, Ch. 3 For next class: Ch. 4 For next class: Ch. 4 FarkasCSCE.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
The Relational Model © Pearson Education Limited 1995, 2005 Bayu Adhi Tama, M.T.I.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Logical Design 12/10/2009GAK1. Learning Objectives How to remove features from a local conceptual model that are not compatible with the relational model.
The Relational Model Lecture #2 Monday 21 st October 2001.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Managing Data Resources File Organization and databases for business information systems.
Distributed Database Management Systems
Review of Week 1 Database DBMS File systems vs. database systems
Distributed Database Management Systems
Distributed Database Management Systems
Presentation transcript:

Distributed Database Management Systems

Reading Textbook: Ch. 4 Textbook: Ch. 4 FarkasCSCE Spring 20112

Design Issues Placing of data and programs (DBMS and application) Placing of data and programs (DBMS and application) Network issues Network issues FarkasCSCE Spring 20113

Level of Sharing No sharing No sharing Data sharing Data sharing Data and program sharing Data and program sharing FarkasCSCE Spring Heterogeneous environment!

Top-Down Design Global Conceptual schema  distribution Global Conceptual schema  distribution –Fragmentation –Replication –Allocation Figure 3.2 Figure 3.2 FarkasCSCE Spring 20115

Correctness of Fragmentation 1. Completeness: F R ={R 1, …, R n } 2. Reconstruction: R=  R i,  R i  R 3. Disjointness: –Horizontal: does not  d j  R i such that d j  R k where k  i –Vertical: same as horizontal for non- primary key attributes FarkasCSCE Spring &2: Lossless-join (normalization)

Data Directory Global vs. local conceptual schemas Global vs. local conceptual schemas –How to search? –Where to store? –Single vs. multiple copies? FarkasCSCE Spring 20117

Current Research Allocation: new requirements, technology, etc. Allocation: new requirements, technology, etc. Where to store the fragments? Where to store the fragments? Dynamic environment Dynamic environment –Usage pattern –Application characteristics –Network changes –Security FarkasCSCE Spring 20118

Bottom-Up Approach Multi-database systems Multi-database systems How to integrate them into 1 database? How to integrate them into 1 database? –Interoperability FarkasCSCE Spring 20119

Database Integration Physical integration Physical integration –Materialized database: data warehouses –Extract-transform-load (ETL) tools Logical integration Logical integration –Virtual (not materialized) integration –Enterprise Information Integration FarkasCSCE Spring

Data Warehouses On-line Analytical Processing (OLAP) applications: On-line Analytical Processing (OLAP) applications: –Decision support systems –Trend analysis and forecasting Complex queries, large databases Complex queries, large databases Materialized view maintanence Materialized view maintanence FarkasCSCE Spring

Logical Integration No materialized global database No materialized global database Virtual integration: data remains at the local (operational) databases Virtual integration: data remains at the local (operational) databases Global conceptual schema may not contain everything from local schemas Global conceptual schema may not contain everything from local schemas Autonomous and heterogeneous local systems Autonomous and heterogeneous local systems FarkasCSCE Spring

Bottom-Up Design Global Conceptual Schema (GCS or mediated schema) Global Conceptual Schema (GCS or mediated schema) –Defined first: local conceptual schemas (LCS) are mapped to GCS –Defined during the integration of the LCSs and develop the corresponding mappings from LCSs to the GCS FarkasCSCE Spring

GCS Defined First Local-as-view (LAV) systems Local-as-view (LAV) systems –Each LCS is treated as a view over the GCS –Query results: constrained to the objects in the local DBs while the GCS definition may be richer –Potential incomplete answers Global-as-view GCS is defined as a set of views over the LCSs Global-as-view GCS is defined as a set of views over the LCSs –View definition defines how to derive elements of the GCS –Query results: constrained to the GCS while the local DBs might be richer FarkasCSCE Spring

Design Tasks Schema translation Schema translation Schema generation Schema generation Figure 4.3 Figure 4.3 FarkasCSCE Spring

Intermediate Canonical Representation Expressive to incorporate all concepts in the local databases Expressive to incorporate all concepts in the local databases Simple, intuitive, practical, etc. Simple, intuitive, practical, etc. Example: E/R model, relational model, graph/tree models, etc. Example: E/R model, relational model, graph/tree models, etc. Tools Tools FarkasCSCE Spring

Schema Generation Schema matching: syntax and semantics Schema matching: syntax and semantics Integration of common schema elements Integration of common schema elements Schema mapping Schema mapping See example 4.1, 4.2 See example 4.1, 4.2 FarkasCSCE Spring

Schema Matching Defined or discovered (e.g., web data) Defined or discovered (e.g., web data) Rules: Rules: –Correspondence between 2 elements –Predicate whether the correspondence holds or not –Similarity value between the 2 elements FarkasCSCE Spring

Finding Correspondence Difficult process due to schema heterogeneity Difficult process due to schema heterogeneity Can be automated? Can be automated? –Insufficient schema and instance information –Unavailability of schema documentation –Subjectivity of matching FarkasCSCE Spring

Matching Algorithm Issues Schema vs. instance matching Schema vs. instance matching –Concept match –Data instance: semantic inconsistencies Element-level vs. structure-level mapping Element-level vs. structure-level mapping –Element name  semantics –Multiple attribute mapping? Matching cardinality Matching cardinality –One-to-one, one-to-many, many-to-many FarkasCSCE Spring

Semantic Schema Heterogeneity Semantic: meaning, interpretation, and intended use of data Semantic: meaning, interpretation, and intended use of data –Synonyms, homonyms, hypernyms –Different ontologies –Imprecise wording FarkasCSCE Spring

Structural Schema Heterogeneity –Type conflict: attribute vs. entity –Dependency conflict: mapping cardinality inconsistencies –Key conflict: different primary keys –Behavioral conflict: modeling assumptions, e.g., referential integrity, deletion, etc. Farkas CSCE Spring

Schema Integration Binary Binary N-ary N-ary FarkasCSCE Spring

Schema Mapping How the data from local databases can be mapped to GCS How the data from local databases can be mapped to GCS Mapping creating Mapping creating Mapping maintanence Mapping maintanence FarkasCSCE Spring

Mapping Creation Input: LCS, GCS, M (schema matches) Input: LCS, GCS, M (schema matches) Output: Q={Q 1, …, Q k } such that Output: Q={Q 1, …, Q k } such that –DB GCS =  Q(DB CLS ) FarkasCSCE Spring

Security Objectives Confidentiality Confidentiality Integrity Integrity Availability Availability FarkasCSCE Spring

Question 1 How distributed databases impact the security objectives? How distributed databases impact the security objectives? –Confidentiality in traditional vs. distributed DBs –Integrity in traditional vs. distributed DBs –Availability in traditional vs. distributed DBs FarkasCSCE Spring

Integrity Correctness criteria Correctness criteria –Top-down design –Bottom-up design FarkasCSCE Spring

Availability What are the issues related to availability when dealing with What are the issues related to availability when dealing with –Top-down design –Bottom-up design FarkasCSCE Spring

Confidentiality (will be covered in 2 nd part of semester but…) (will be covered in 2 nd part of semester but…) Centralized vs. distributed security policy Centralized vs. distributed security policy –Top-down design –Bottom-up design FarkasCSCE Spring

FarkasCSCE Spring Next Class Semantics-based Database Integration