Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.

Slides:



Advertisements
Similar presentations
1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
Advertisements

Chapter 13 The Data Warehouse
ITEC 423 Data Warehousing and Data Mining Lecture 3.
Data Warehouse Architecture Sakthi Angappamudali Data Architect, The Oregon State University, Corvallis 16 th May, 2005.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
DATA WAREHOUSE CONCEPTS. A Definition · A Data Warehouse: Is a repository for collecting, standardizing, and summarizing snapshots of transactional data.
Introduction to Data Warehousing Enrico Franconi CS 636.
Business Intelligence: Essential of Business
DATA WAREHOUSE (Muscat, Oman).
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
A Comparsion of Databases and Data Warehouses Name: Liliana Livorová Subject: Distributed Data Processing.
© 2003, Prentice-Hall Chapter Chapter 2: The Data Warehouse Modern Data Warehousing, Mining, and Visualization: Core Concepts by George M. Marakas.
1 Database Administration (CG168) – Lecture 10a: Introduction to Data Warehousing Data Warehousing “An Introduction” Dr. Akhtar Ali School of Computing,
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Electronic Commerce Semester 2 Term 2 Lecture 24.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 10: The Data Warehouse Decision Support Systems in the 21 st.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
CISB594 – Business Intelligence
Data Warehouse. Group 5 Kacie Johnson Summer Bird Washington Farver Jonathan Wright Mike Muchane.
CISB594 – Business Intelligence Data Warehousing Part I.
Data Warehouses and OLAP Data Management Dennis Volemi D61/70384/2009 Judy Mwangoe D61/73260/2009 Jeremy Ndirangu D61/75216/2009.
By N.Gopinath AP/CSE. There are 5 categories of Decision support tools, They are; 1. Reporting 2. Managed Query 3. Executive Information Systems 4. OLAP.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
CISB594 – Business Intelligence Data Warehousing Part I.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
DATA RESOURCE MANAGEMENT
CISB594 – Business Intelligence Data Warehousing Part I.
Why BI….? Most companies collect a large amount of data from their business operations. To keep track of that information, a business and would need to.
Business Intelligence Training Siemens Engineering Pakistan Zeeshan Shah December 07, 2009.
Advanced Database Concepts
Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 1 Database Systems.
Data Warehouse A place the information system department puts the data that is turned into information. Data must be properly prepared,organized,and presented.
Oracle 8i Data Warehousing (chapter 1, 2) Data Warehousing Lab. 석사 1 학기 HyunSuk Jung.
Business intelligence systems. Data warehousing. An orderly and accessible repositery of known facts and related data used as a basis for making better.
Data Warehousing/Mining 1 Data Warehousing/Mining Introduction.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Data Warehouse Data Mart Elahe Soroush. Agenda  Data Warehouse definition  Concepts  Logical transformation  Physical transformation  DW components.
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Defining Data Warehouse Concepts and Terminology
Data warehouse.
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Data Warehouse—Subject‐Oriented
Data Warehousing and Data Mining By N.Gopinath AP/CSE
Data Warehouse.
Defining Data Warehouse Concepts and Terminology
Data Warehouse and OLAP
DATA WAREHOUSE: THE BUILDING BLOCKS
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
An Introduction to Data Warehousing
Introduction to Data Warehousing
Data Warehouse A place the information system department puts the data that is turned into information. Data must be properly prepared,organized,and presented.
Data and Interoperability:
Data Warehousing Data Model –Part 1
Data Warehouse.
Data Warehousing Concepts
Data Warehouse and OLAP
Data Warehouse and OLAP Technology
Presentation transcript:

Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2

2 Your Assignment For an airlines company, how can strategic information increase the number of frequent flyers? Discuss giving specific details. You are a Senior Analyst in the IT department of a company manufacturing automobile parts. The marketing heads are complaining about the poor response by IT in providing strategic information. Draft a proposal to them explaining the reasons for the problems and why a data warehouse would be the only viable solution.

3 Lecture Objectives Review formal definitions of a data warehouse Discuss the defining features Distinguish between data warehouses and data marts Study each component or building block that makes up a data warehouse

4 What is a Data Warehouse? (a practitioner’s viewpoint) “A data warehouse is simply a single, complete, and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use it in a business context” – Barry Devlin, IBM Consultant “A data warehouse is a database of data gathered from many systems and intended to support management reporting and decision making” – Michael Corey et al, CTO of OneWarranty.com

5 What is a Data Warehouse? (a Classical viewpoint) According to W. H. Inmon (Building a Data Warehouse, 1992) “A DW is a subject oriented, integrated, time varying, non- volatile collection of data that is used primarily in organizational decision making.”

6 WHAT IS DATA WAREHOUSING A data warehouse is typically a dedicated database system for decision making that is separate from the production database(s) used operationally. It differs from production system in that: it covers a much longer time horizon than transaction systems it includes multiple databases that have been processed so that the warehouse’s data are defined uniformly (i.e., ‘clean’ data) it is optimized for answering complex queries from managers and analysts.

7 Standard DB v. DW

8 CHARACTERISTICS

9

10 Characteristics of a Data Warehouse

11 Characteristics of a Data Warehouse

12 SUBJECT ORIENTATION Data is organized around major subjects of the enterprise.

13 Subject Oriented Data warehouses are designed to help you analyze data. For example, to learn more about your company's sales data, you can build a warehouse that concentrates on sales. Using this warehouse, you can answer questions like "Who was our best customer for this item last year?" This ability to define a data warehouse by subject matter, sales in this case, makes the data warehouse subject oriented. E.g. claims data are organized around the subject of claims and not by individual applications of Auto Insurance and Workers’ Comp

14 Class Activity A data warehouse is a subject oriented. What would be the major critical business subject for : A local community bank as a business unit Customer Profit Loans

15 Integrated Integration is closely related to subject orientation. Data warehouses must put data from disparate sources into a consistent format. They must resolve such problems as naming conflicts and inconsistencies among units of measure. When they achieve this, they are said to be integrated.

16 Non volatile Non-volatile means that, once entered into the warehouse, data are not changed/updated. This is logical because the purpose of a warehouse is to enable you to analyze what has occurred.

17 Time Variant In order to discover trends in business, analysts need large amounts of data. This is very much in contrast to online transaction processing (OLTP) systems, where performance requirements demand that historical data be moved to an archive. The data are kept for many years so they can be used for trends, forecasting, and comparisons over time. A data warehouse's focus on change over time is what is meant by the term time variant.

18 Data Granularity

19 DATA MARTS Data Mart: A scaled-down version of the data warehouse A data mart is a small warehouse designed for the Small Business Unit (SBU) or department level. It is often a way to gain entry and provide an opportunity to learn Major problem: if they differ from department to department, they can be difficult to integrate enterprise-wide

20 Data Warehouse and Data Mart

21 Data Mart and Data Warehouse

22 Data Warehouse COST Data warehouses are not cheap Median cost to create (does not include operating cost) = $2.2M Multimillion dollar costs are common Their design and implementation is still an art and they require considerable time to create.

23 Data Warehouse SIZE Being designed for the enterprise so that everyone has a common data set, they are large and increase in size with time. Typical storage sizes run from 50 Gigabytes to several Terabytes

24 APPLICATION - DATA MINING Also known as Knowledge Data Discovery (KDD) Mining terminology refers to finding answers about a business from the data warehouse that the executive or analyst had not thought to ask

25 Data Warehouse Architectures

26 Data Warehouse Architectures: Basic

27 Data Warehouse Architectures: with a Staging Area

28 Data Warehouse Architectures: with a Staging Area and Data Marts

29 A General Architecture for Data Warehousing

30

31 Problems and Issues

32 Data Systems Supporting DW

33 Class Activity

34 Class Activity What are the main components of a data warehouse for your school system?

35 Project Egypt Election System Governorates’ database system Multiple databases on Multiple Servers Summarization System Meta data Data Warehouse Server Web page with query based system