Advanced Applied IT for Business 2

Slides:



Advertisements
Similar presentations
 Data Warehouse Architecture By: Harrison Reid. Outline  What is a Data Warehouse Architecture  Five Main Data Warehouse Architectures  Factors That.
Advertisements

Data Warehouse IMS5024 – presented by Eder Tsang.
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Chapter 5 DATA WAREHOUSING.
Business Intelligence: A Managerial Approach (2nd Edition)
13 Chapter 13 The Data Warehouse Hachim Haddouti.
Chapter 2: Data Warehousing
Chapter 2: Data Warehousing
Chapter 13 The Data Warehouse
Data Warehousing ISYS 650. What is a data warehouse? A data warehouse is a subject-oriented, integrated, nonvolatile, time-variant collection of data.
Chapter 8: Data Warehousing
Chapter 2 Data Warehousing
© 2003, Prentice-Hall Chapter Chapter 2: The Data Warehouse Modern Data Warehousing, Mining, and Visualization: Core Concepts by George M. Marakas.
2nd semester 2010 Dr. Qusai Abuein
Week 6 Lecture The Data Warehouse Samuel Conn, Asst. Professor
Chapter 2 Data Warehousing
Data Warehouse & Data Mining
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Intro. to Data Warehouse
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 10: The Data Warehouse Decision Support Systems in the 21 st.
1 Data Warehouses BUAD/American University Data Warehouses.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
CISB594 – Business Intelligence
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
CISB594 – Business Intelligence Data Warehousing Part I.
CISB594 – Business Intelligence Data Warehousing Part I.
 Understand the basic definitions and concepts of data warehouses  Describe data warehouse architectures (high level).  Describe the processes used.
UNIT-II Principles of dimensional modeling
Chapter 5 DATA WAREHOUSING Study Sections 5.2, 5.3, 5.5, Pages: & Snowflake schema.
CISB594 – Business Intelligence Data Warehousing Part I.
Creating a Data Warehouse Data Acquisition: Extract, Transform, Load Extraction Process of identifying and retrieving a set of data from the operational.
Decision Support Systems Data Warehousing. Modified from Decision Support Systems and Business Intelligence Systems 9E. 1-2 Learning Objectives Understand.
1 ISQS 3358, Business Intelligence Data Warehousing Zhangxi Lin Texas Tech University 1.
Chapter 2 Data Warehousing. Learning Objectives  Understand the basic definitions and concepts of data warehouses  Describe data warehouse architectures.
Chapter 2 Data Warehousing. Learning Objectives  Understand the basic definitions and concepts of data warehouses  Understand data warehousing architectures.
CISB594 – Business Intelligence Data Warehousing Part I.
 Understand the basic definitions and concepts of data warehouses  Describe data warehouse architectures (high level).  Describe the processes used.
Business Intelligence and Analytics: Systems for Decision Support Global Edition (10 th Edition) Chapter 3: Data Warehousing.
Chapter 8: Data Warehousing. Data Warehouse Defined A physical repository where relational data are specially organized to provide enterprise- wide, cleansed.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 5: Data Warehousing.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing.
DATA WAREHOUSING. Learning Objectives  Understand the basic definitions and concepts of data warehouses  Understand data warehousing architectures 
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing.
Data Warehouse/Data Mart It’s all about the data.
Business Intelligence Overview
Dr.S.Sridhar,Ph.D., RACI(Paris),RZFM(Germany),RMR(USA),RIEEEProc.
Business Intelligence: A Managerial Approach (2nd Edition)
Chapter 2 Data Warehousing
Data warehouse and OLAP
Chapter 13 The Data Warehouse
Chapter 5 Data Management
Data Warehousing and Data Mining By N.Gopinath AP/CSE
Data Warehouse.
Chapter 8: Data Warehousing
Chapter 13 – Data Warehousing
Data Warehouse Architecture
المحاضرة 4 : مستودعات البيانات (Data warehouse)
Data Warehouse and OLAP
An Introduction to Data Warehousing
Introduction of Week 9 Return assignment 5-2
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Data Warehouse.
Chapter 3 DATA WAREHOUSING.
Data Warehousing Concepts
Technical Architecture
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Data Warehouse and OLAP
Presentation transcript:

Advanced Applied IT for Business 2

What is a Data Warehouse? A physical repository where relational data are specially organized to provide enterprise-wide, cleansed data in a standardized format “The data warehouse is a collection of integrated, subject-oriented databases designed to support DSS functions, where each unit of data is non-volatile and relevant to some moment in time”

Characteristics of DW Subject oriented Integrated Time-variant (time series) Nonvolatile Summarized Not normalized Metadata Web based, relational/multi-dimensional Client/server Real-time and/or right-time (active)

A departmental data warehouse that stores only relevant data Data Mart A departmental data warehouse that stores only relevant data Dependent data mart A subset that is created directly from a data warehouse Independent data mart A small data warehouse designed for a strategic business unit or a department

Data Warehousing Definitions Operational data stores (ODS) A type of database often used as an interim area for a data warehouse Oper marts An operational data mart Enterprise data warehouse (EDW) A data warehouse for the enterprise Metadata Data about data. In a data warehouse, metadata describe the contents of a data warehouse and the manner of its acquisition and use

DW Framework

DW Architecture Three-tier architecture Two-tier architecture Data acquisition software (back-end) The data warehouse that contains the data & software Client (front-end) software that allows users to access and analyze data from the warehouse Two-tier architecture First 2 tiers in three-tier architecture is combined into one Sometimes there is only one tier

DW Architectures 3-tier architecture 2-tier architecture

A Web-based DW Architecture

Data Warehousing Architectures Issues to consider when deciding which architecture to use: Which database management system (DBMS) should be used? Will parallel processing and/or partitioning be used? Will data migration tools be used to load the data warehouse? What tools will be used to support data retrieval and analysis?

Alternative DW Architectures

Alternative DW Architectures

Alternative DW Architectures Independent Data Marts Data Mart Bus Architecture Hub-and-Spoke Architecture Centralized Data Warehouse Federated Data Warehouse Each has pros and cons!

Teradata Corp. DW Architecture

Data Warehousing Architectures Ten factors that potentially affect the architecture selection decision: Information interdependence between organizational units Upper management’s information needs Urgency of need for a data warehouse Nature of end-user tasks Constraints on resources Strategic view of the data warehouse prior to implementation Compatibility with existing systems Perceived ability of the in- house IT staff Technical issues Social/political factors

Data Integration and the Extraction, Transformation, and Load (ETL) Process Integration that comprises three major processes: data access, data federation, and change capture Enterprise application integration (EAI) A technology that provides a vehicle for pushing data from source systems into a data warehouse Enterprise information integration (EII) An evolving tool space that promises real-time data integration from a variety of sources, such as relational databases, Web services, and multidimensional databases

Data Integration and the Extraction, Transformation, and Load (ETL) Process

ETL Issues affecting the purchase of ETL tool Data transformation tools are expensive Data transformation tools may have a long learning curve Important criteria in selecting an ETL tool Ability to read from and write to an unlimited number of data sources/architectures Automatic capturing and delivery of metadata A history of conforming to open standards An easy-to-use interface for the developer and the functional user

Data Warehouse Development Data warehouse development approaches Inmon Model: EDW approach (top-down) Kimball Model: Data mart approach (bottom- up) Which model is best? There is no one-size-fits-all strategy to DW One alternative is the hosted warehouse Data warehouse structure: The Star Schema vs. Relational Real-time data warehousing?

Hosted Data Warehouses Benefits: Requires minimal investment in infrastructure Frees up capacity on in-house systems Frees up cash flow Makes powerful solutions affordable Enables powerful solutions that provide for growth Offers better quality equipment and software Provides faster connections Enables users to access data remotely Allows a company to focus on core business Meets storage needs for large volumes of data

Representation of Data in DW Dimensional Modeling – a retrieval-based system that supports high-volume query access Star schema – the most commonly used and the simplest style of dimensional modeling Contain a fact table surrounded by and connected to several dimension tables Fact table contains the descriptive attributes (numerical values) needed to perform decision analysis and query reporting Dimension tables contain classification and aggregation information about the values in the fact table Snowflakes schema – an extension of star schema where the diagram resembles a snowflake in shape

Multidimensionality Multidimensionality Multidimensional presentation The ability to organize, present, and analyze data by several dimensions, such as sales by region, by product, by salesperson, and by time (four dimensions) Multidimensional presentation Dimensions: products, salespeople, market segments, business units, geographical locations, distribution channels, country, or industry Measures: money, sales volume, head count, inventory profit, actual versus forecast Time: daily, weekly, monthly, quarterly, or yearly

Star vs Snowflake Schema