Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.

Slides:



Advertisements
Similar presentations
Chapter 11: Data Warehousing
Advertisements

Author: Graeme C. Simsion and Graham C. Witt Chapter 11 Logical Database Design.
Dimensional Modeling.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 15-1 David M. Kroenke Database Processing Chapter 15 Business Intelligence.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Data Warehouse IMS5024 – presented by Eder Tsang.
Decision Support and Data Warehouse. Decision supports Systems Components Data management function –Data warehouse Model management function –Analytical.
Chapter 11: Data Warehousing
© 2007 by Prentice Hall 1 Chapter 11: Data Warehousing Modern Database Management 8 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
CHAPTER 2: MODELING DATA IN THE ORGANIZATION © 2013 Pearson Education, Inc. Publishing as Prentice Hall 1 Modern Database Management 11 th Edition Jeffrey.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
Chapter 13 The Data Warehouse
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Chapter 1: Data Warehousing
Components of the Data Warehouse Michael A. Fudge, Jr.
Chapter 4 Data Warehousing.
Chapter 9: Data Warehousing
Data Warehousing.
Chapter 9: data warehousing
Intro to MIS – MGS351 Databases and Data Warehouses Chapter 3.
Chapter 1: The Database Environment and Development Process
MBA 664 Database Management Systems Dave Salisbury ( )
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
Data Warehousing Concepts, by Dr. Khalil 1 Data Warehousing Design Dr. Awad Khalil Computer Science Department AUC.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 9 th Edition Jeffrey A. Hoffer,
1 Data Warehouses BUAD/American University Data Warehouses.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
1 Data Warehousing. 2Definition Data Warehouse Data Warehouse: – A subject-oriented, integrated, time-variant, non- updatable collection of data used.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Unit 1: Background and Terminology Chapters 1 + 2: Modern Database Management 9 th Edition.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Data Warehousing.
Chapter 9: data warehousing
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Chapter 11: Data Warehousing Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Chapter 4 Logical & Physical Database Design
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
Data Warehousing.
Business Intelligence Training Siemens Engineering Pakistan Zeeshan Shah December 07, 2009.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Carnegie Mellon University © Robert T. Monroe Management Information Systems Data Warehousing Management Information Systems Robert.
Chapter 1 © 2013 Pearson Education, Inc. Publishing as Prentice Hall Chapter 1: The Database Environment and Development Process Modern Database Management.
Chapter 4, Part A: Logical Database Design and the Relational Model
 Definition of terms  Reasons for need of data warehousing  Describe three levels of data warehouse architectures  Describe two components of star.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Lecture 14: Data Warehousing Modern Database Management 9 th Edition Jeffrey A. Hoffer, Mary.
1 LM 7 Data Warehouse Dr. Lei Li. Learning Objectives Describe the needs for data warehouse Describe the three levels of a data warehouse Explain the.
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
1 HCMC UT, 2008 Data Warehousing 1.Basic Concepts of data warehousing 2.Data warehouse architectures 3.Some characteristics of data warehouse data 4.The.
Summarized from various resources Modern Database Management
Chapter 11: Data Warehousing
Data Warehouse.
Overview and Fundamentals
Competing on Analytics II
Chapter 1: Data Warehousing
An Introduction to Data Warehousing
Presentation transcript:

Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING

Chapter 9 Copyright © 2016 Pearson Education, Inc. 9-2 DEFINITIONS  Data Warehouse  A subject-oriented, integrated, time-variant, non- updatable collection of data used in support of management decision-making processes  Subject-oriented: data organized by customers, patients, students, products  Integrated: consistent naming conventions, formats; from multiple data sources  Time-variant: can study trends and changes  Non-updatable: read-only, periodically refreshed  Data Mart  A data warehouse that is limited in scope

Chapter 9 Copyright © 2016 Pearson Education, Inc. 9-3 NEED FOR DATA WAREHOUSING  Integrated, company-wide view of high- quality information (from disparate databases)  Separation of operational and informational systems and data (for improved performance)  Operational system – real time, transaction processing systems  Informational system – decision support, read only.

Chapter 9 Copyright © 2016 Pearson Education, Inc. 9-4 ISSUES WITH COMPANY-WIDE VIEW  Inconsistent key structures  Synonyms  Free-form vs. structured fields  Inconsistent data values  Missing data See figure 9-1 for example

Chapter 9 Copyright © 2016 Pearson Education, Inc. 9-5 Figure 9-1 Examples of heterogeneous data

Chapter 9 Copyright © 2016 Pearson Education, Inc. 9-6 Figure 9-2 Independent data mart data warehousing architecture Data marts: Mini-warehouses, limited in scope E T L Separate ETL for each independent data mart Data access complexity due to multiple data marts

Chapter 9 Copyright © 2016 Pearson Education, Inc. 9-7 Figure 9-3 Dependent data mart with operational data store: a three-level architecture E T L Single ETL for (EDW) enterprise data warehouse (EDW) Simpler data access ODS ODS provides option for obtaining current data Dependent data marts loaded from EDW

Chapter 9 Copyright © 2016 Pearson Education, Inc. 9-8 E T L Near real-time ETL for Data Warehouse ODS data warehouse ODS and data warehouse are one and the same Data marts are NOT separate databases, but logical views of the data warehouse  Easier to create new data marts Figure 9-4 Logical data mart and real time warehouse architecture

Chapter 9 Copyright © 2016 Pearson Education, Inc. 9-9 DATA CHARACTERISTICS STATUS VS. EVENT DATA Status Event = a database action (create/ update/ delete) that results from a transaction Figure 9-6 Example of DBMS log entry

Chapter 9 Copyright © 2016 Pearson Education, Inc With transient data, changes to existing records are written over previous records, thus destroying the previous data content. Figure 9-7 Transient operational data DATA CHARACTERISTICS TRANSIENT VS. PERIODIC DATA

Chapter 9 Copyright © 2016 Pearson Education, Inc Periodic data are never physically altered or deleted once they have been added to the store. Figure 9-8 Periodic warehouse data DATA CHARACTERISTICS TRANSIENT VS. PERIODIC DATA

Chapter 9 Copyright © 2016 Pearson Education, Inc STAR SCHEMA (DIMENSIONAL MODEL)  Fact and dimension tables.  Each dimension table has a one-to-many relationship to the central fact table.  The primary key of a dimension table is a foreign key in the fact table. The primary key of the fact table is a composite key consisting of all foreign keys.

Chapter 9 Copyright © 2016 Pearson Education, Inc star schema Figure 9-9 Components of a star schema Fact tables contain factual or quantitative data Dimension tables contain descriptions about the subjects of the business 1:N relationship between dimension tables and fact tables Excellent for ad-hoc queries, but bad for online transaction processing Dimension tables are denormalized to maximize performance

Chapter 9 Copyright © 2016 Pearson Education, Inc Figure 9-10 Star schema example Fact table provides statistics for sales broken down by product, period and store dimensions

Chapter 9 Copyright © 2016 Pearson Education, Inc Figure 9-11 Star schema with sample data

Chapter 9 Copyright © 2016 Pearson Education, Inc ISSUES REGARDING STAR SCHEMA  Dimension table keys should be surrogate (non- intelligent and non-business related), because:  Keys may change over time  Length/format consistency  Granularity of fact table: level of detail (in time, location, product grouping, etc.) for each record in the fact table  Transactional grain – finest level  Aggregated grain – more summarized  Finer grain  more dimension tables, more rows in fact table  Duration of the database – how much history should be kept?

Chapter 9 Copyright © 2016 Pearson Education, Inc VARIATIONS OF THE STAR SCHEMA  Multiple Facts Tables  Can improve performance  Often used to store facts for different combinations of dimensions  Conformed dimensions  Factless Facts Tables  No nonkey data, but foreign keys for associated dimensions  Used for:  Tracking events  Inventory coverage

Chapter 9 Copyright © 2016 Pearson Education, Inc Figure 9-13 Conformed dimensions Conformed dimension Associated with multiple fact tables Two fact tables  two (connected) start schemas.

Chapter 9 Copyright © 2016 Pearson Education, Inc Figure 9-14a Factless fact table showing occurrence of an event No data in fact table, just keys associating dimension records Fact table forms an n-ary relationship between dimensions

Chapter 9 Copyright © 2016 Pearson Education, Inc  Include all information for a dimension table in a single de- normalized table  Normalize the dimension into a snowflake schema NORMALIZATION ISSUES IN DIMENSION TABLES

Chapter 9 Copyright © 2016 Pearson Education, Inc. 9-21