Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2009.

Slides:



Advertisements
Similar presentations
Dimensional Modeling.
Advertisements

MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
IS 4420 Database Fundamentals Chapter 11: Data Warehousing Leon Chen
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Dimensional Modeling Business Intelligence Solutions.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Data Warehouse IMS5024 – presented by Eder Tsang.
Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2008.
Data Warehouse and Business Intelligence
Data Warehouse and Business Intelligence Dr. Minder Chen Spring 2010.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
MIS 451 Building Business Intelligence Systems Logical Design (3) – Design Multiple-fact Dimensional Model.
COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
Business Intelligence
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Data warehousing theory and modelling techniques Building Dimensional Models.
Business Intelligence
DWH – Dimesional Modeling PDT Genči. 2 Outline Requirement gathering Fact and Dimension table Star schema Inside dimension table Inside fact table STAR.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts - 5 th Edition, Aug 26, 2005 Buzzword List OLTP – OnLine Transaction Processing (normalized,
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
Business Intelligence Zamaneh Jahed. What is Business Intelligence? Business Intelligence (BI) is a broad category of applications and technologies for.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
Data Warehouse. Design DataWarehouse Key Design Considerations it is important to consider the intended purpose of the data warehouse or business intelligence.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehousing.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Building Data and Document-Driven Decision Support Systems How do managers access and use large databases of historical and external facts?
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Normalized model vs dimensional model
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
CISB113 Fundamentals of Information Systems Data Management.
UNIT-II Principles of dimensional modeling
Decision supports Systems Components
Foundations of Business Intelligence: Databases and Information Management.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Data Warehousing.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Lecture 14: Data Warehousing Modern Database Management 9 th Edition Jeffrey A. Hoffer, Mary.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Data warehouse and OLAP
Data Warehouse.
Star Schema.
Data Warehouse and Business Intelligence
Competing on Analytics II
Retail Sales is used to illustrate a first dimensional model
Data Warehouse and OLAP
CHAPTER SIX OVERVIEW SECTION 6.1 – DATABASE FUNDAMENTALS
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Retail Sales is used to illustrate a first dimensional model
Retail Sales is used to illustrate a first dimensional model
DWH – Dimesional Modeling
Data Warehouse and OLAP
Data Warehouse and OLAP Technology
Presentation transcript:

Data Warehouse and Business Intelligence Dr. Minder Chen Fall 2009

Data Warehouse - 2 © Minder Chen, BI Business Intelligence (BI) is the process of gathering meaningful information to answer questions and identify significant trends or patterns, giving key stakeholders the ability to make better business decisions. “The key in business is to know something that nobody else knows.” -- Aristotle Onassis PHOTO: HULTON-DEUTSCH COLLHULTON-DEUTSCH COLL “To understand is to perceive patterns.” — Sir Isaiah Berlin "The manager asks how and when, the leader asks what and why. " — “ On Becoming a Leader” by Warren Bennis

Data Warehouse - 3 © Minder Chen, BI Questions What happened? –What were our total sales this month? What’s happening? – Are our sales going up or down, trend analysis Why? –Why have sales gone down? What will happen? –Forecasting & “What If” Analysis What do I want to happen? –Planning & Targets Source: Bill Baker, Microsoft

Data Warehouse - 4 © Minder Chen, Increasing potential to support business decisions (MIS) End User Business Analyst Data Analyst DBA Making Decisions Data Presentation Visualization Techniques Data Mining Information Discovery Data Exploration OLAP, MDA, Statistical Analysis, Querying and Reporting Data Warehouses / Data Marts Data Sources (Paper, Files, Information Providers, Database Systems, OLTP) Business Intelligence

Data Warehouse - 5 © Minder Chen, Inmon's Definition of Data Warehouse – Data View A warehouse is a –subject-oriented, –integrated, –time-variant and –non-volatile collection of data in support of management's decision making process. –Bill Inmon in 1990 Source:

Data Warehouse - 6 © Minder Chen, Inmon's Definition Explain Subject-oriented: They are organized around major subjects such as customer, supplier, product, and sales. Data warehouses focus on modeling and analysis to support planning and management decisions vs. operations and transaction processing. Integrated: Data warehouses involve an integration of sources such as relational databases, flat files, and on- line transaction records. Processes such as data cleansing and data scrubbing achieve data consistency in naming conventions, encoding structures, and attribute measures. Time-variant: Data contained in the warehouse provide information from an historical perspective. Nonvolatile: Data contained in the warehouse are physically separate from data present in the operational environment.

Data Warehouse - 7 © Minder Chen, The Data Warehouse Process Data Marts and cubes DataWarehouse SourceSystems Clients Design the PopulateCreateQuery Data Warehouse Data WarehouseOLAP CubesData Design the PopulateCreateQuery Data Warehouse Data WarehouseOLAP CubesData 34 Query Tools ReportingAnalysis Data Mining 2 1 Extract, Transform, & Loading ETL

Data Warehouse - 8 © Minder Chen, OLTP Normalized Design Ordering Process Ware- house POS Process Chain Retailer Retailer Returns Retailer Payments Store Product Brand GL Account Clerk Retail Cust Cash Register Retail Promo

Data Warehouse - 9 © Minder Chen, OLTP Versus Business Intelligence: Who asks what? OLTP Questions When did that order ship? How many units are in inventory? Does this customer have unpaid bills? Are any of customer X’s line items on backorder? Analysis Questions What factors affect order processing time? How did each product line (or product) contribute to profit last quarter? Which products have the lowest Gross Margin? What is the value of items on backorder, and is it trending up or down over time?

Data Warehouse - 10 © Minder Chen, OLTP vs. OLAP Source:

Data Warehouse - 11 © Minder Chen, Dimensional Design Process Select the business process to model Declare the grain of the business process/data in the fact table Choose the dimensions that apply to each fact table row Identify the numeric facts that will populate each fact table row Business Requirements Data Realities

Data Warehouse - 12 © Minder Chen, Select a business process to model Not business departments or business functions Cross-functional business processes Business events Examples: –Raw materials purchasing –Order fulfillment process –Shipments –Invoicing –Inventory –General ledger

Data Warehouse - 13 © Minder Chen, Requirements

Data Warehouse - 14 © Minder Chen, Identifying Measures and Dimensions The attribute varies continuously: Balance Unit Sold Cost Sales The attribute is perceived as a constant or discrete value: Description Location Color Size DimensionsMeasures Performance Measures for KPI Performance Drivers

Data Warehouse - 15 © Minder Chen, A Dimensional Model for a Grocery Store Sales

Data Warehouse - 16 © Minder Chen, Product Dimension SKU: Stock Keeping Unit Hierarchy: –Department  Category  Subcategory  Brand  Product

Data Warehouse - 17 © Minder Chen, Creating Dimensional Model Identify fact tables Translate business measures into fact tables Analyze source system information for additional measures Identify base and derived measures Document additivity of measures Identify dimension tables Link fact tables to the dimension tables Create views for users

Data Warehouse - 18 © Minder Chen, Transaction Level Order Item Fact Table Always has a date dimension

Data Warehouse - 19 © Minder Chen, Inside a Dimension Table Dimension table key: Uniquely identify each row. Use surrogate key (integer). Table is wide: A table may have many attributes (columns). Textual attributes. Descriptive attributes in string format. No numerical values for calculation. Attributes not directly related: E.g., product color and product package size. No transitive dependency. Not normalized (star schemar). Drilling down and rolling up along a dimension. One or more hierarchy within a dimension. Fewer number of records.

Data Warehouse - 20 © Minder Chen, Fact Tables Fact tables have the following characteristics: Contain numeric measures (metric) of the business May contain summarized (aggregated) data May contain date-stamped data Are typically additive Have key value that is typically a concatenated key composed of the primary keys of the dimensions Joined to dimension tables through foreign keys that reference primary keys in the dimension tables