DWH-Ahsan Abdullah 1 Data Warehousing Lecture-2 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.

Slides:



Advertisements
Similar presentations
Lecture-19 ETL Detail: Data Cleansing
Advertisements

Lecture-1 Introduction and Background
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-5 Types & Typical Applications of DWH Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Workload Management BMO Financial Group Case Study IRMAC, January 2008 Sorina Faur, Database Development Manager.
Data Management for Decision Support Session - 1 Prof. Bharat Bhasker.
ICS 421 Spring 2010 Data Warehousing (1) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/18/20101Lipyeow.
Business Intelligence Michael Gross Tina Larsell Chad Anderson.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
DWH-Ahsan Abdullah 1 Data Warehousing Lab Lect-1 DTS: Introduction Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Lecture-33 DWH Implementation: Goal Driven Approach (1)
Introduction to Data Warehousing Enrico Franconi CS 636.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
Lecture-1 Introduction and Background
DWH-Ahsan Abdullah 1 Data Warehousing Lab Lect-2 Lab Data Set Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
D ATABASE S YSTEMS D ATA W AREHOUSING I Asma Ahmad 29 th April, 2011.
Ahsan Abdullah 1 Data Warehousing Lecture-12 Relational OLAP (ROLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
CIS 429—Chapter 8 Accessing Organizational Information—Data Warehouse.
Ahsan Abdullah 1 Data Warehousing Lecture-17 Issues of ETL Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
© 2008 Pearson Prentice Hall, Experiencing MIS, David Kroenke Slide 1 Chapter 9 Competitive Advantage with Information Systems for Decision Making.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Computer Science Centre University of Indonesia Chapter 1 Introduction to Decision Support Systems Mallach, 2000.
Ahsan Abdullah 1 Data Warehousing Lecture-11 Multidimensional OLAP (MOLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
Data Warehousing 1 Lecture-24 Need for Speed: Parallelism Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-37 Case Study: Agri-Data Warehouse Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
1 Data Warehousing Lecture-13 Dimensional Modeling (DM) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
Ahsan Abdullah 1 Data Warehousing Lecture-7De-normalization Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-4 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
1 Data Warehouses BUAD/American University Data Warehouses.
Ahsan Abdullah 1 Data Warehousing Lecture-18 ETL Detail: Data Extraction & Transformation Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. &
Ahsan Abdullah 1 Data Warehousing Lecture-9 Issues of De-normalization Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Data Warehousing 1 Lecture-28 Need for Speed: Join Techniques Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
1 Data Warehousing Lecture-14 Process of Dimensional Modeling Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Data Warehousing Lecture-2 Introduction and Background 1.
Dr. Abdul Basit Siddiqui Assistant Professor FUIEMS (Lecture Slides Week # 2)
Ahsan Abdullah 1 Data Warehousing Lecture-10 Online Analytical Processing (OLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Data Warehousing Lecture-31 Supervised vs. Unsupervised Learning Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Ahsan Abdullah 1 Data Warehousing Lecture-16 Extract Transform Load (ETL) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
1 Technology in Action Chapter 11 Behind the Scenes: Databases and Information Systems Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
1 Data Warehousing Lecture-15 Issues of Dimensional Modeling Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-29 Brief Intro. to Data Mining Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-22 DQM: Quantifying Data Quality Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse.
Data Warehousing/Mining 1 Data Warehousing/Mining Introduction.
Ahsan Abdullah 1 Data Warehousing Lecture-8 De-normalization Techniques Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
The Need for Data Analysis 2 Managers track daily transactions to evaluate how the business is performing Strategies should be developed to meet organizational.
Data Warehouse – Your Key to Success. Data Warehouse A data warehouse is a  subject-oriented  Integrated  Time-variant  Non-volatile  Restructure.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-21 Introduction to Data Quality Management (DQM) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
Business Intelligence Overview
Lecture-3 Introduction and Background
MIS2502: Data Analytics Advanced Analytics - Introduction
Data warehouse.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Lecture-32 DWH Lifecycle: Methodologies
Data warehouse and OLAP
Data Mining.
Information Technology for Management
Introduction to Data Warehousing
Lecture-38 Case Study: Agri-Data Warehouse
Lecture-35 DWH Implementation: Pitfalls, Mistakes, Keys
Business Intelligence
Data Warehousing Concepts
Data Warehouse and OLAP Technology
Data Warehousing & DATA MINING (SE-409) Lecture-1 Introduction and Background Huma Ayub Software Engineering department University of Engineering and Technology,
Presentation transcript:

DWH-Ahsan Abdullah 1 Data Warehousing Lecture-2 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research FAST National University of Computers & Emerging Sciences, Islamabad

DWH-Ahsan Abdullah 2 Introduction and Background

DWH-Ahsan Abdullah 3 Why a Data Warehouse (DWH)?  Data recording and storage is growing.  History is excellent predictor of the future.  Gives total view of the organization.  Intelligent decision-support is required for decision-making.

DWH-Ahsan Abdullah 4  Data Sets are growing. How Much Data is that? 1 MB 2 20 or 10 6 bytes Small novel – 3 1 / 2 Disk 1 GB 2 30 or 10 9 bytes Paper rims that could fill the back of a pickup van 1 TB 2 40 or bytes 50,000 trees chopped and converted into paper and printed 2 PB 1 PB = 2 50 or bytes Academic research libraries across the U.S. 5 EB 1 EB = 2 60 or bytes All words ever spoken by human beings Reason-1: Why a Data Warehouse?

DWH-Ahsan Abdullah 5 Reason-1: Why a Data Warehouse?  Size of Data Sets are going up .  Cost of data storage is coming down .  The amount of data average business collects and stores is doubling every year  Total hardware and software cost to store and manage 1 Mbyte of data  1990: ~ $15  2002: ~ ¢15 (Down 100 times)  By 2007: < ¢1 (Down 150 times)

DWH-Ahsan Abdullah 6 Reason-1: Why a Data Warehouse?  A Few Examples  WalMart: 24 TB  France Telecom: ~ 100 TB  CERN: Up to 20 PB by 2006  Stanford Linear Accelerator Center (SLAC): 500TB

DWH-Ahsan Abdullah 7Caution! A Warehouse of Data is NOT a Data Warehouse

DWH-Ahsan Abdullah 8Caution! Size is NOT Everything

DWH-Ahsan Abdullah 9  Businesses demand Intelligence (BI).  Complex questions from integrated data.  “Intelligent Enterprise” Reason-2: Why a Data Warehouse?

DWH-Ahsan Abdullah 10 Reason-2: Why a Data Warehouse? List of all items that were sold last month? List of all items purchased by Tariq Majeed? The total sales of the last month grouped by branch? How many sales transactions occurred during the month of January? DBMS Approach

DWH-Ahsan Abdullah 11 Reason-2: Why a Data Warehouse? Which items sell together? Which items to stock? Where and how to place the items? What discounts to offer? How best to target customers to increase sales at a branch? Which customers are most likely to respond to my next promotional campaign, and why? Intelligent Enterprise

DWH-Ahsan Abdullah 12  Businesses want much more…  What happened?  Why it happened?  What will happen?  What is happening?  What do you want to happen? Reason-3: Why a Data Warehouse? Stages of Data Warehouse

DWH-Ahsan Abdullah 13 What is a Data Warehouse? A complete repository of historical corporate data extracted from transaction systems that is available for ad-hoc access by knowledge workers.

DWH-Ahsan Abdullah 14 What is a Data Warehouse? Complete repository History Transaction System Ad-Hoc access Knowledge workers

DWH-Ahsan Abdullah 15 What is a Data Warehouse? Transaction System  Management Information System (MIS)  Could be typed sheets (NOT transaction system) Ad-Hoc access  D ose not have a certain access pattern.  Queries not known in advance.  Difficult to write SQL in advance. Knowledge workers  Typically NOT IT literate (Executives, Analysts, Managers).  NOT clerical workers.  Decision makers.

DWH-Ahsan Abdullah 16 Another View of a DWH Subject Oriented Integrated Time Variant Non Volatile