1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009.

Slides:



Advertisements
Similar presentations
1 Term 2, 2004, Lecture 9, Distributed DatabasesMarian Ursu, Department of Computing, Goldsmiths College Distributed databases 3.
Advertisements

Database Systems: Design, Implementation, and Management
Database Management3-1 L3 Database Management Santa R. Susarapu Ph.D. Student Virginia Commonwealth University.
Distributed databases
The Relational Database Model
Transaction.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Chapter 13 (Web): Distributed Databases
Topic Denormalisation S McKeever Advanced Databases 1.
Distributed Databases Logical next step in geographically dispersed organisations goal is to provide location transparency starting point = a set of decentralised.
Databases. Database Information is not useful if not organized In database, data are organized in a way that people find meaningful and useful. Database.
Chapter 3 Database Management
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Database Management: Getting Data Together Chapter 14.
Distributed Database Management Systems
DISTRIBUTED DATABASE. Centralized & Distributed Database  Single site database – centralized database –A database is located at a single site or distributed.
Database management concepts Database Management Systems (DBMS) An example of a database (relational) Database schema (e.g. relational) Data independence.
Chapter 9 : Distributed Database.
Overview Distributed vs. decentralized Why distributed databases
Organizing Data & Information
Data Storage and Data Processing Architectures The difficulty is in the choice George Moore, 1900.
Chapter 12 Distributed Database Management Systems
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Parallel and Distributed Databases CS263 Lecture 16.
Centralized and Client/Server Architecture and Classification of DBMS
DATABASE MANAGEMENT SYSTEMS 2 ANGELITO I. CUNANAN JR.
Distributed Databases
Distributed Database and Replication. Distributed Database A logically interrelated collection of shared data and a description of this data physically.
Distributed Databases and DBMSs: Concepts and Design
1 Distributed and Parallel Databases. 2 Distributed Databases Distributed Systems goal: –to offer local DB autonomy at geographically distributed locations.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
IMS 4212: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida Distributed Databases Business needs.
12 1 Chapter 12 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Data Storage and Data Processing Architectures The difficulty is in the choice George Moore, 1900.
Database Design – Lecture 16
III. Current Trends: 1 - Distributed DBMSsSlide 1/32 III. Current Trends Part 1: Distributed DBMSs: Concepts and Design Lecture 12 (2 hours) Lecturer:
CHAPTER 8: MANAGING DATA RESOURCES. File Organization Terms Field: group of characters that represent something Record: group of related fields File:
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
10 1 Chapter 10 Distributed Database Management Systems Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
© Pearson Education Limited, Chapter 15 Physical Database Design – Step 7 (Consider Introduction of Controlled Redundancy) Transparencies.
MIS 327 Database Management system 1 MIS 327: DBMS Dr. Monther Tarawneh Dr. Monther Tarawneh Week 2: Basic Concepts.
Lecture2: Database Environment Prepared by L. Nouf Almujally 1 Ref. Chapter2 Lecture2.
G063 - Distributed Databases. Learning Objectives: By the end of this topic you should be able to: explain how databases may be stored in more than one.
Lecture # 3 & 4 Chapter # 2 Database System Concepts and Architecture Muhammad Emran Database Systems 1.
Kjell Orsborn UU - DIS - UDBL DATABASE SYSTEMS - 10p Course No. 2AD235 Spring 2002 A second course on development of database systems Kjell.
1 Distributed Databases BUAD/American University Distributed Databases.
Databases Illuminated
Distributed Database. Introduction A major motivation behind the development of database systems is the desire to integrate the operational data of an.
Distributed database system
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Object storage and object interoperability
Introduction to Distributed Databases Yiwei Wu. Introduction A distributed database is a database in which portions of the database are stored on multiple.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Distributed DBMS, Query Processing and Optimization
1 Chapter 22 Distributed DBMS Concepts and Design CS 157B Edward Chen.
uses of DB systems DB environment DB structure Codd’s rules current common RDBMs implementations.
Distributed Databases
1 Chapter 22 Distributed DBMSs - Concepts and Design Simplified Transparencies © Pearson Education Limited 1995, 2005.
CHAPTER 25 - Distributed Databases and Client–Server Architectures
Distributed DBMS Concepts of Distributed DBMS
Chapter 19: Distributed Databases
MANAGING DATA RESOURCES
Database management concepts
Presentation transcript:

1 Information Retrieval and Use De-normalisation and Distributed database systems Geoff Leese September 2008, revised October 2009

2 Mapping the logical model onto physical design n Entities become tables ä More often than not! n Attributes become fields (columns) n Unique identifiers become primary keys n Relationships implemented by foreign key columns n Resolve M:N relationships by inserting intersection table

3 Mapping considerations n Independence n Privacy n Efficiency of queries

4 Denormalisation n Joins take time! n Split or merge normalised entities based on frequent associated use ä Remove redundant relationships ä Merge entities with 1:1 relationships ä Use summary fields ä Use summary tables and views

5 Using summary field(1) n Consider running a query “give the total value of all orders for customer X” How many joins?

6 Using summary field (2) n Note summary field in Orders table How many joins now ?

7 Distributed database systems n Special rules apply!

8 The traditional model n One centralised database n Terminals at remote locations n Disadvantages ä Networks are slow (esp WANS!) ä Central machine does all processing ä If central machine fails, database is down (Integrity, redundancy and disaster recovery considered in later lectures!)

9 The Client/Server model n Client – application – “front end” n Server – DBMS – “back end” n Still dependent on central database

10 Client responsibilities n Manages user interface n Accepts user data n Has local processing capability within the application n Generates database requests and transmits them via network to server n Receives results from server and formats them as required by application

11 Server responsibilities n Accepts database requests from client n Processes database requests ä Handles security issues ä Deals with concurrency issues ä Optimizes queries ä Handles recovery/rollback issues n Returns results to client

12 Distributed database architecture n A collection of logically related “sites”, connected together so that the users view is that of a single database at a single location. n Each site is a database in it’s own right n Not necessarily physically or geographically separated, but often are – and are logically separated.

13 Advantages n Organisations are distributed, why shouldn’t their data be? n Improved efficiency ä Store data close to where it’s used

14 Types of DDS n Homogenous – same type of RDBMS at each site (easy!) n Heterogeneous – different types of DBMS at each site (not so easy!)

15 Implementation methods (1) n Fragmentation – splitting data between sites ä Horizontal – row based – e.g. store all employee records for a location at that location ä Vertical – column based – e.g. store all payroll columns in payroll department, all other employee data in HR n Either way, fragments must be able to be put back together!

16 Implementation methods (2) n Replication ä Controlled duplication of data at more than one site n Update propagation?

17 Objectives (1) n Local autonomy ä Local data locally owned and managed – minimal data requirements from remote sites. n No reliance on central site n Continuous operation ä Reliability ä Availability

18 Objectives (2) n Location independence ä From user’s view, all data is at their site. n Fragmentation independence ä Needs joins and unions to put fragments back together n Replication independence

19 Objectives (3) n Distributed query processing n Distributed transaction management ä Transactions carried out by “agents” at distributed sites ä Two-phase commit ä Locking issues (later lecture)

20 Objectives (4) n Hardware independence n Operating system independence n Network independence n DBMS independence

21 DDS issues n Query processing ä Optimisation even more important n Catalogue (data dictionary) management ä Centralised? ä Fully replicated? ä Partitioned? ä Combination of first and third?

22 DDS issues n Update propagation ä An issue where replication is used. ä “Primary copy” system n Recovery ä Two-phase commit n Recovery ä Locking strategies

23 Summary n Mapping the logical model n Denormalisation n Traditional database architecture n Client/server model n Distributed Database systems ä Advantages ä Objectives ä Implementation methods ä Issues

24 Further reading n Rolland chapter 10 n Hoffer chapters 12 n Denormalisation - click to follow the link! Denormalisation - click to follow the link!