Big Data, Data Warehouses, and Business Intelligence Systems Chapter Eight DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 7 th Edition.

Slides:



Advertisements
Similar presentations
Chapter 1 Getting Started with Access Databases. Objectives Identify Good Database Design Create a Table and Define Fields in a New Blank Database Change.
Advertisements

Data Modeling and the Entity-Relationship Model
Database Administration Chapter Six DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 4 th Edition.
Data Modeling and the Entity-Relationship Model
Database Design Chapter Five DATABASE CONCEPTS, 6th Edition
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall COS 236 Day 25.
Data Modeling and the Entity-Relationship Model
Data Modeling and the Entity-Relationship Model
Database Processing Applications and Business Intelligence Chapter Seven DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 3 rd Edition.
The Relational Model Chapter Two DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Chapter 9 Business Intelligence Systems
Chapter 9 Competitive Advantage with Information Systems for Decision Making © 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall COS 346 Day 26.
Data Modeling and the Entity-Relationship Model Chapter Four DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 5 th Edition.
Information Technology in Organizations
SQL Views Chapter 3A DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 5 th Edition.
Database Design Chapter Five DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 7 th Edition.
Database Processing for Business Intelligence Systems
Big Data, Data Warehouses, and Business Intelligence Systems
Getting Started with Microsoft Visio 2010 Appendix G DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Getting Started Chapter One DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 5 th Edition.
Getting Started with Microsoft SQL Server 2012 Express Edition Appendix A DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Getting Started with Web Servers, PHP, and the Eclipse PDT Appendix I DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
David M. Kroenke and David J. Auer Database Processing—12 th Edition Fundamentals, Design, and Implementation Chapter One: Introduction KROENKE AND AUER.
Getting Started Chapter One DATABASE CONCEPTS, 7th Edition
Advanced Topics Chapter Seven DAVID M. KROENKE’S DATABASE CONCEPTS, 2 nd Edition.
Getting Started with Microsoft Access The Access Workbench: Section One DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 4 th Edition.
Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall. 1 by Mary Anne Poatsy, Keith Mulbery, Lynn Hogan, Amy Rutledge, Cyndi Krebs, Eric.
Getting Started with Oracle Database 11g Release 2 Express Edition Appendix B DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Getting Started Chapter One DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Chapter 11 Designing Distributed and Internet Systems Modern Systems Analysis and Design Sixth Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich.
Computers Are Your Future Tenth Edition Chapter 12: Databases & Information Systems Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall1.
GO! with Office 2013 Volume 1 By: Shelley Gaskin, Alicia Vargas, and Carolyn McLellan Excel Chapter 2 Using Functions, Creating Tables, and Managing Large.
Chapter 9 Business Intelligence and Information Systems for Decision Making.
© 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke Slide 1 Chapter 9 Competitive Advantage with Information Systems for Decision Making.
Technology in Action Alan Evans Kendall Martin Mary Anne Poatsy Twelfth Edition.
Chapter 9 Designing Databases Modern Systems Analysis and Design Sixth Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich.
1 Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall. Access Module 1 Workshop 1 The Four Main Database Objects Series Editor Amy Kinser.
SQL Views Appendix E DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Chapter 1: The Database Environment Modern Database Management 9 th Edition Jeffrey A. Hoffer,
Chapter 14 Sharing Enterprise Data David M. Kroenke Database Processing © 2000 Prentice Hall.
1 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall.
Advanced Topics Chapter Seven DAVID M. KROENKE’S DATABASE CONCEPTS, 2 nd Edition.
GO! with Office 2013 Volume 1 By: Shelley Gaskin, Alicia Vargas, and Carolyn McLellan Access Chapter 3 Forms, Filters, and Reports.
SQL Views Chapter 3A DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 4 th Edition.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Business Intelligence Systems Appendix J DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
1 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall.
Chapter Six Competitor Analysis and Sources of Advantage.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Getting Started Chapter One DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 4 th Edition.
1 Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall. by Mary Anne Poatsy, Keith Mulbery, Eric Cameron, Jason Davidson, Rebecca Lawson,
Skills for Success with Microsoft Office 2013 Volume 1 Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall. by Kris Townsend, Catherine.
Skills for Success with Microsoft Office 2013 Volume 1 Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall. by Kris Townsend, Catherine.
KROENKE and AUER - DATABASE CONCEPTS (3 rd Edition) © 2008 Pearson Prentice Hall 6-1 Chapter Objectives Understand the need for and importance of database.
+ Big Data. + Chapter Objectives Learn the basic concepts of Big Data, structured storage, and the MapReduce process Learn the basic concepts of data.
© 2012 Pearson Education, Inc. publishing Prentice Hall. Note 9 The Product Life Cycle.
GO! with Office 2013 Volume 1 By: Shelley Gaskin, Alicia Vargas, and Carolyn McLellan Excel Chapter 3 Analyzing Data with Pie Charts, Line Charts, and.
© 2002 by Prentice Hall 1 David M. Kroenke Database Processing Eighth Edition Chapter 17 Sharing Enterprise Data.
David M. Kroenke and David J. Auer Database Processing: Fundamentals, Design, and Implementation Chapter One: Introduction.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Chapter Twelve: Big Data, Data Warehouses, and Business.
David M. Kroenke and David J. Auer Database Processing Fundamentals, Design, and Implementation Appendix J: Business Intelligence Systems.
Computers Are Your Future Tenth Edition Spotlight 5: Microsoft Office Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall1.
David M. Kroenke and David J. Auer Database Processing: Fundamentals, Design, and Implementation Chapter Ten: Managing Databases with SQL Server 2012,
TECHNOLOGY IN ACTION. Chapter 11 Behind the Scenes: Databases and Information Systems.
David M. Kroenke and David J
GO! with Microsoft® Access e
David M. Kroenke and David J
Getting Started Chapter One DATABASE CONCEPTS, 5th Edition
Getting Started Chapter One DATABASE CONCEPTS, 4th Edition
Presentation transcript:

Big Data, Data Warehouses, and Business Intelligence Systems Chapter Eight DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 7 th Edition

Chapter Objectives Learn the basic concepts of Big Data, structured storage, and the MapReduce process Learn the basic concepts of data warehouses and data marts Learn the basic concepts of dimensional databases Learn the basic concepts of business intelligence (BI) systems Learn the basic concepts of Online Analytical Processing (OLAP) 8-2 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Big Data The rapidly expanding amount of data being stored and used in enterprise information systems Search tools –Google –Bing Web 2.0 social networks –Facebook –LinkedIn –Twitter 8-3 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Storage Capacity Terms 8-4 Figure 8-1: Storage Capacity Terms KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Heather Sweeney Designs Review: Database Design 5-5 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Heather Sweeney Designs Review: HSD Database Diagram in SQL Server KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Educations, Inc. Publishing as Prentice Hall Figure 8-2: The HSD Database Diagram

Business Intelligence Systems Business intelligence (BI) systems are information systems that –Assist managers and other professionals in the analysis of current and past activities and in the prediction of future events. –Do not support operational activities, such as the recording and processing of orders. These are supported by transaction processing systems. –Support management assessment, analysis, planning and control. BI systems fall into two broad categories: –Reporting systems that sort, filter, group, and make elementary calculations on operational data. –Data mining applications that perform sophisticated analyses on data; analyses that usually involve complex statistical and mathematical processing. 8-7 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

The Relationship Among Operational and BI Applications 8-8 Figure 8-3: The Relationship Between Operational and BI Applications KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Characteristics of Business Intelligence Applications 8-9 Figure 8-4: Characteristics of Business Intelligence Applications KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Components of a Data Warehouse 8-10 Figure 8-5: Components of a Data Warehouse KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Problems with Operational Data “Dirty Data” –Example – “G” for Gender –Example – “213” for Age Missing Values Inconsistent Data –Example – data that has changed, such as a customer’s phone number 8-11 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Problems with Operational Data (Continued) Nonintegrated Data –Example – data from two or more sources that need to be combined Incorrect Format –Example – time data in hours when needed in minutes Too Much Data –Example – An excess number of columns 8-12 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

ETL Data Transformation Data may need to be transformed for use in a data warehouse. –Example {CountryCode  CountryName} “US”  “United States” –Example address to domain  8-13 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Characteristics of a Data Mart 8-14 Figure 8-6: Data Warehouses and Data Marts KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Enterprise Data Warehouse (EDW) Architecture Combines the data warehouse structure and the data mart structures shown above Expensive to create, staff and operate Smaller organizations use subsets of the EDW architecture 8-15 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Dimensional Databases A non-normalized database structure used for data warehouses May use slowly changing dimensions –Values change infrequently Phone Number Address Use a Date or Time dimension 8-16 Figure 8-7: Characteristics of Operational and Dimensional Databases KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Star Schema 8-17 Figure 8-8: The Star Schema KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

HSD-DW Star Schema 8-18 Figure 8-9: The HSD-DW Star Schema KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Two-Dimensional Matrix 8-19 Figure 8-13: The Two-Dimensional ProductNumber–CustomerID Matrix KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Three-Dimensional Matrix 8-20 Figure 8-14: The Three-Dimensional Time–ProductNumber–CustomerID Cube KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Conformed Dimensions and the Extended HSD-DW Schema 8-21 Figure 8-15: The Extended HSD-DW Star Schema KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

OnLine Analytical Processing (OLAP) OnLine Analytical Processing (OLAP) is a technique for dynamically examining database data. –OLAP uses arithmetic functions such as Sum and Average KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

OLAP Reports OLAP systems produce an OLAP report, also know as an OLAP cube. The OLAP report uses inputs called dimensions. The OLAP report calculates outputs called measures. Excel PivotTables can be used to create OLAP reports KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

SQL Query for OLAP Data 8-24 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

SQL View for OLAP Data 8-25 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Excel PivotTable OLAP Report I 8-26 Figure 8-17: OLAP ProductNumber by City Report KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Excel PivotTable OLAP Report II 8-27 Figure 8-18: OLAP ProductNumber by City, Customer, and Year Report KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Excel PivotTable OLAP Report III 8-28 Figure 8-19: OLAP City by ProductNumber, Customer, and Year Report KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Distributed Database Processing A database is distributed when it is: –Partitioned –Replicated –Both partitioned and replicated This is fairly straightforward for read- only replicas, but it can be very difficult for other installations KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Type of Distributed Databases 6-30 Figure 8-20: Types of Distributed Databases KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Type of Distributed Databases (Cont’d) 8-31 Figure 8-20 Types of Distributed Databases (Cont’d) KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Object-Relational Database Management Object-oriented programming (OOP) is based on objects, and OOP is now used as the basis of many computer programming languages: –Java –VisualBasic.Net –C++ –C# 6-32 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Objects Object classes have –Identifiers –Properties These are data items associated with the object. –Methods These are programs that allow the object to perform tasks. The only difference between entity classes and object classes are the methods KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Object Persistence Object persistence means that values of the object properties are storable and retrievable. Object persistence can be achieved by various techniques. –A main technique is database technology. –Relational databases can be used, but require substantial programming KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

OODBMS Object-Oriented DBMSs (OODBMSs) have been developed. –Never achieved commercial success It would be too expensive to transfer existing data from relational and other legacy databases. The OODBMSs were, therefore, not cost justifiable KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Object-Relational DBMSs Some relational DBMS vendors have added object-oriented features to their products. –Example: Oracle These products are known as object-relational DBMSs and support object-relational databases KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

The NoSQL Movement I The NoSQL movement is a movement to use non-relational databases. These databases are often classified into four categories: –Key-Value—Dynamo and MemcacheDB –Document—Couchbase and MongoDB –Column Family—Apache Cassandra and HBase –Graph—Neo4J and AllegroGraph 7-37 KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Column Family Databases: A Column 8-38 (a) A Column Figure 8-21: A Generalized Column Family Database Storage System KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Column Family Databases: A Super Column 8-39 (b) A Super Column Figure 8-21: A Generalized Column Family Database Storage System (Cont’d) KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Column Family Databases: A Column Family 8-40 Figure 8-21: A Generalized Column Family Database Storage System (Cont’d) (c) A Column Family KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

Column Family Databases: A Super Column Family 8-41 Figure 8-21: A Generalized Column Family Database Storage System (Cont’d) (d) A Super Column Family KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

The MapReduce Process 8-42 Figure 8-22: MapReduce KROENKE and AUER - DATABASE CONCEPTS (7th Edition) Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. Printed in the United States of America. Copyright © 2015 Pearson Education, Inc. Publishing as Prentice Hall DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 7 th Edition

Big Data, Data Warehouses, and Business Intelligence Systems End of Presentation on Chapter Eight DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 7 th Edition