Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales.

Slides:



Advertisements
Similar presentations
An overview of Data Warehousing and OLAP Technology Presented By Manish Desai.
Advertisements

ITEC 423 Data Warehousing and Data Mining Lecture 3.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
Ch1: File Systems and Databases Hachim Haddouti
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Warehouse Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
An Overview of Data Warehousing and OLTP Technology Presenter: Parminder Jeet Kaur Discussion Lead: Kailang.
A Comparsion of Databases and Data Warehouses Name: Liliana Livorová Subject: Distributed Data Processing.
By N.Gopinath AP/CSE. Why a Data Warehouse Application – Business Perspectives  There are several reasons why organizations consider Data Warehousing.
Database Systems: Design, Implementation, and Management Ninth Edition
Basic Concepts of Datawarehousing An Overview Prasanth Gurram.
DATA WAREHOUSING Introduction and Overview. What is a Data Warehouse? A complete repository of corporate data extracted from transaction systems that.
Understanding Data Warehousing
Bab 3 Data Warehousing. Why Data Warehouse? Scenario 1 ABC Pvt Ltd is a company with branches at Mumbai, Delhi, Chennai and Banglore. The Sales Manager.
DW-1: Introduction to Data Warehousing. Overview What is Database What Is Data Warehousing Data Marts and Data Warehouses The Data Warehousing Process.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
AN OVERVIEW OF DATA WAREHOUSING
© 2007 by Prentice Hall 1 Introduction to databases.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
1 Data Warehouses BUAD/American University Data Warehouses.
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Dr. Abdul Basit Siddiqui Assistant Professor FUIEMS (Lecture Slides Week # 2)
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
CISB594 – Business Intelligence
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
CISB594 – Business Intelligence Data Warehousing Part I.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
CISB594 – Business Intelligence Data Warehousing Part I.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Data Mining Data Warehouses.
Pooja Sharma Shanti Ragathi Vaishnavi Kasala. BUSINESS BACKGROUND Lowe's started as a single hardware store in North Carolina in 1946 and since then has.
CISB594 – Business Intelligence Data Warehousing Part I.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse.
Introduction to Data Warehousing. Why Data Warehouse? Necessity is the mother of invention.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Introduction DATAWAREHOUSE What is Data Warehousing? A process of transforming data into information and making it available to.
Chapter 8: Data Warehousing. Data Warehouse Defined A physical repository where relational data are specially organized to provide enterprise- wide, cleansed.
Data Warehouse Data Mart Elahe Soroush. Agenda  Data Warehouse definition  Concepts  Logical transformation  Physical transformation  DW components.
Data Warehouse – Your Key to Success. Data Warehouse A data warehouse is a  subject-oriented  Integrated  Time-variant  Non-volatile  Restructure.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing.
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
Introduction to Data Warehousing. Subject: Data Warehousing.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Business Intelligence Overview
Advanced Applied IT for Business 2
Data warehouse and OLAP
Data Warehouse—Subject‐Oriented
Data Warehouse.
Data Warehouse and OLAP
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
An Introduction to Data Warehousing
Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
Data Warehousing Concepts
Data Warehouse and OLAP
Data Warehouse and OLAP Technology
Presentation transcript:

Why Data Warehouse?

Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales report. Each branch has a separate operational system.

Scenario 1 : ABC Pvt Ltd. Mumbai Delhi Chennai Banglore Sales Manager Sales per item type per branch for first quarter.

Solution 1:ABC Pvt Ltd. Extract sales information from each database. Store the information in a common repository at a single site.

Solution 1:ABC Pvt Ltd. Mumbai Delhi Chennai Banglore Data Warehouse Sales Manager Query & Analysis tools Report

Scenario 2 One Stop Shopping Super Market has huge operational database. Whenever Executives wants some report the OLTP system becomes slow and data entry operators have to wait for some time.

Scenario 2 : One Stop Shopping Operational Database Data Entry Operator Management Wait Report

Solution 2 Extract data needed for analysis from operational database. Store it in another system, the data warehouse. Refresh warehouse at regular intervals so that it contains up to date information for analysis. Warehouse will contain data with historical perspective.

Solution 2 Operational database Data Warehouse Extract data Data Entry Operator Data Entry Operator Manager Transaction Report

Scenario 3 Cakes & Cookies is a small, new company. The chairman of this company wants his company to grow. He needs information so that he can make correct decisions.

Solution 3 Improve the quality of data before loading it into the warehouse. Perform data cleaning and transformation before loading the data. Use query analysis tools to support adhoc queries.

Solution 3 Query & Analysis tool Chairman Expansio n Improvemen t sales time Data Warehouse

Summing up? Why do you need a warehouse? Operational systems could not provide strategic information Executive and managers need such strategic information for Making proper decision Formulating business strategies Establishing goals Setting objectives Monitoring results

Why operational data is not capable of producing valuable information? Data is spread across incompatible structures and systems Not only that, improvements in technology had made computing faster, cheaper and available

FAILURES OF PAST DECISION- SUPPORT SYSTEMS OLTP systems

Decision support systems

Operational and informational

What is Data Warehouse?? Is it the only viable solution

Business intelligence at DW

Functional definition of a DW The data warehouse is an informational environment that Provides an integrated and total view of the enterprise Makes the enterprise’s current and historical information easily available for decision making Makes decision-support transactions possible without hindering operational systems Renders the organization’s information consistent Presents a flexible and interactive source of strategic information

Questions???? Describe five differences between operational systems and informational systems A data warehouse in an environment, not a product. Discuss.

)A data warehouse is not a single software or hardware product that provide the strategic information. It is a computing environment where the users can find strategic information, an environment where the users are put directly in touch with the data to make better decisions. It is a user centric environment. The characteristics of this computing environment are, 1. An ideal environment for data analysis and decision support. 2. Fluid, flexible and interactive 3. It is hundreds percent user-driven 4. Provide the ability to discover answer to complex and unpredictable questions

Inmons’s definition A data warehouse is - subject-oriented, - integrated, - time-variant, - nonvolatile collection of data in support of management’s decision making process.

Subject-oriented Data warehouse is organized around subjects such as sales, product, customer. It focuses on modeling and analysis of data for decision makers. Excludes data not useful in decision support process.

Integration Data Warehouse is constructed by integrating multiple heterogeneous sources. Data Preprocessing are applied to ensure consistency. RDBMS Legacy System Data Warehouse Flat File Data Processing Data Transformation

Integration In terms of data. encoding structures. Measurement of attributes. physical attribute. of data naming conventions. Data type format remarks

Time-variant Provides information from historical perspective, e.g. past 5-10 years Every key structure contains either implicitly or explicitly an element of time, i.e., every record has a timestamp. The time-variant nature in a DW Allows for analysis of the past Relates information to the present Enables forecasts for the future

Non-volatile Data once recorded cannot be updated. Data warehouse requires two operations in data accessing Initial loading of data Incremental loading of data load access

Data Granularity In an operational system, data is usually kept at the lowest level of detail. In a DW, data is summarized at different levels. Three data levels in a banking data warehouse Daily DetailMonthly SummaryQuaterly Summary Account Activity DateMonth AmountNo. of transactions Deposit/ WithdrawWithdrawals Deposits Beginning Balance Ending Balance

Operational v/s Information System FeaturesOperationalInformation CharacteristicsOperational processingInformational processing OrientationTransactionAnalysis UserClerk,DBA,database professional Knowledge workers FunctionDay to day operationDecision support Data ContentCurrentHistorical, archived, derived ViewDetailed, flat relationalSummarized, multidimensional DB designApplication orientedSubject oriented Unit of workShort,simple transactionComplex query AccessRead/writeRead only

Operational v/s Information System FeaturesOperationalInformation FocusData inInformation out No. of records accessed tens/ hundredsmillions Number of usersthousandshundreds DB size100MB to GB100 GB to TB UsagePredictable, repetitiveAd hoc, random, heuristic Response TimeSub-secondsSeveral seconds to minutes PriorityHigh performance,high availability High flexibility,end- user autonomy MetricTransaction throughputQuery throughput

Two approaches in designing a DW Top-down approachBottom-up approach Enterprise view of dataNarrow view of data Inherently architectedInherently incremental Single, central storage of dataFaster implementation of manageable parts Centralized rules and controlEach datamart is developed independently Takes longer time to buildComparatively less time than a DW Higher risk to failureLess risk of failure Needs higher level of cross-functional skills Unmanageable interfaces

Bottom Up Approach

Top Down Approach

A Practical Approach-Kimball 1. Plan and Define requirements 2. Create a surrounding architecture 3. Conform and Standardize the data Content 4. Implement Data Warehouse as series of super-mart one at a time.

Individual Architected Data Marts Common Logical Subject Area ERD Common Business Dimensions Common Business Rules Common Business Metrics Glossary Sales Distribution Product Marketing Customer Accounts Finance Inventory Vendors An Incremental Approach

ArchitectedEnterpriseFoundation Sales Distribution Product Marketing Customer Accounts Finance Inventory Vendors Enterprise Data Warehouse The Eventual Result

Data Warehouse: Holds multiple subject areas Holds very detailed information Works to integrate all data sources Does not necessarily use a dimensional model but feeds dimensional models. Data Mart Often holds only one subject area- for example, Finance, or Sales May hold more summarised data (although many hold full detail) Concentrates on integrating information from a given subject area or set of source systems Is built focused on a dimensional model using a star schema.

Data Warehouse verses data marts