COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University.

Slides:



Advertisements
Similar presentations
Chapter 11: Data Warehousing
Advertisements

MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Technical BI Project Lifecycle
Chapter 18: Data Analysis and Mining Kat Powell. Chapter 18: Data Analysis and Mining ➔ Decision Support Systems ➔ Data Analysis and OLAP ➔ Data Warehousing.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
OLAP Services Business Intelligence Solutions. Agenda Definition of OLAP Types of OLAP Definition of Cube Definition of DMR Differences between Cube and.
Decision Support and Data Warehouse. Decision supports Systems Components Data management function –Data warehouse Model management function –Analytical.
OLAP. Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more time-consuming, analytic queries.
Introduction to Data Warehousing. From DBMS to Decision Support DBMSs widely used to maintain transactional data Attempts to use of these data for analysis,
Online Analytical Processing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional.
Data Sources Data Warehouse Analysis Results Data visualisation Analytical tools OLAP Data Mining Overview of Business Intelligence Data visualisation.
Data Cube and OLAP Server
Business Intelligence. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Chapter 13 The Data Warehouse
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
Ch3 Data Warehouse part2 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
DATA WAREHOUSE (Muscat, Oman).
Online Analytical Processing (OLAP) Hweichao Lu CS157B-02 Spring 2007.
OLAP OPERATIONS. OLAP ONLINE ANALYTICAL PROCESSING OLAP provides a user-friendly environment for Interactive data analysis. In the multidimensional model,
Chetan Bhirud Raza Mohammad Abinash Sahoo Online Marketing Giant.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Decision Support Chapter 23.
Data Warehouse & Data Mining
Online Analytical Processing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
1 Data Warehouses BUAD/American University Data Warehouses.
OLAP & DSS SUPPORT IN DATA WAREHOUSE By - Pooja Sinha Kaushalya Bakde.
Data Warehouse & OLAP Kuliah 1 Introduction Slide banyak mengambil dari acuan- acuan yang dipakai.
Data Warehousing.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
By N.Gopinath AP/CSE. There are 5 categories of Decision support tools, They are; 1. Reporting 2. Managed Query 3. Executive Information Systems 4. OLAP.
Ayyat IT Group Murad Faridi Roll NO#2492 Muhammad Waqas Roll NO#2803 Salman Raza Roll NO#2473 Junaid Pervaiz Roll NO#2468 Instructor :- “ Madam Sana Saeed”
1 On-Line Analytic Processing Warehousing Data Cubes.
OLAP in DWH Ján Genči PDT. 2 Outline OLAP Definitions and Rules The term OLAP was introduced in a paper entitled “Providing On-Line Analytical.
Decision supports Systems Components
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
Data Warehousing Multidimensional Analysis
Data Mining Data Warehouses.
Chapter 11: Data Warehousing Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden.
Business Intelligence Transparencies 1. ©Pearson Education 2009 Objectives What business intelligence (BI) represents. The technologies associated with.
Data Warehousing.
Advanced Database Concepts
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 6 The Data Warehouse Jason C. H. Chen, Ph.D. Professor of MIS School of Business Administration.
12 1 Database Systems: Design, Implementation, & Management, 6 th Edition, Rob & Coronel 12.4 Online Analytical Processing OLAP creates an advanced data.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
 Definition of terms  Reasons for need of data warehousing  Describe three levels of data warehouse architectures  Describe two components of star.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
© 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Lecture 14: Data Warehousing Modern Database Management 9 th Edition Jeffrey A. Hoffer, Mary.
Pindaro Demertzoglou Data Resource Management – MGMT 4170 Lally School of Management Rensselaer Polytechnic Institute.
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
1 Data Warehousing Data Warehousing. 2 Objectives Definition of terms Definition of terms Reasons for information gap between information needs and availability.
CMPE 226 Database Systems April 12 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
Data Mining & OLAP What is Data Mining? Data Mining is the set of activities used to find new, hidden, or unexpected patterns in data.
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Chapter 13 Business Intelligence and Data Warehouses
On-Line Analytic Processing
Chapter 13 The Data Warehouse
Data Warehouse.
المحاضرة 4 : مستودعات البيانات (Data warehouse)
Data Warehouse and OLAP
University of Houston-Clear Lake Kaiser Permanente San Jose
Introduction of Week 9 Return assignment 5-2
Data Warehousing Concepts
Data Warehouse and OLAP
Online Analytical Processing
Presentation transcript:

COMP 578 Data Warehousing And OLAP Technology Keith C.C. Chan Department of Computing The Hong Kong Polytechnic University

Winter, 2001Keith C.C. Chan2 A Generic Three-Level Architecture Data Warehouse Extract Transform Load Refresh Monitor & Integrator Metadata Data Marts Operational DBs other sources OLAP Server Selection & Aggregation

Winter, 2001Keith C.C. Chan3 The User Interface Even a well-designed data mart used only when provided with powerful, intuitive interface for access and analysis of those data. A variety of tools are available to query and analyze data stored in data warehouses and data marts. These tools may be classified as follows: – Traditional query and reporting tools. – OLAP tools. – Data mining tools. – Data visualization tools.

Winter, 2001Keith C.C. Chan4 Role of DM Metadata To build a user-friendly interface requires: – A set of metadata that describes the data in the data mart in business terms that users can easily understand. – Metadata associated with data marts are “data catalog” or “ data directory” or similar term. – Metadata serve as kind of a yellow pages directory to the data in the data marts. The metadata should allow users to easily answer questions such as: – what subjects are described in the data mart? (E.g. customers, patients, students, products, courses, etc.)

Winter, 2001Keith C.C. Chan5 Role of DM Metadata (2) – What dimensions and facts are included in the data mart? What is the grain of the fact table? – How are the data in the data mart derived from the EDW? What rules are used in the derivation? – How are the data in the EDW derived from operational data? What rules are used in this derivation? – What reports and predefined queries are available to view the data? – What drill-down and other data analysis techniques are available? Who is responsible for the quality of data in the data marts, and to whom are requests for changes made?

Winter, 2001Keith C.C. Chan6 OLAP Tools On-line analytical processing (OLAP) to contrast with the more traditional term OLTP. Also referred to as multidimensional analysis. Concerns with use of a set of graphical tools for multidimensional views of data. Allows users to analyze data using simple windowing techniques. Data can be viewed as a simple 3-D cube.

Winter, 2001Keith C.C. Chan7 A Hypercube

Winter, 2001Keith C.C. Chan8 OLAP Queries and SQL A large number of OLAP queries can be expressed using traditional SQL. This applies to simple operations as: – Roll-up and drill-down operations. – Slice_and_Dice operations – Pivoting. OLAP queries mainly use SQL aggregate functions and GROUP BY clause.

Winter, 2001Keith C.C. Chan9 OLAP queries and SQL

Winter, 2001Keith C.C. Chan10 Slicing a Cube Slicing the data cube to produce a simple two- dimensional table or view. – E.g. A slice is for the product named shoes. – Other views developed by simple “drag and drop”. – This type of operation is often called “slicing and dicing” the cube. Closely related to slicing and dicing is data pivoting. – This term refers to rotating the view for a particular data point, to obtain another perspective. – The analyst could pivot this view to obtain the sales of shoes by store for the same month.

Winter, 2001Keith C.C. Chan11 Example of Cube Slicing

Winter, 2001Keith C.C. Chan12 Dice Operation Slice_and_Dice operations reduce the number of dimensins by taking a projection of facts on a subset of dimensions and for some selected values of dimensions that are being dropped. Dice operation amounts to range select conditions.

Winter, 2001Keith C.C. Chan13 Dice Operation

Winter, 2001Keith C.C. Chan14 Pivoting In a fact that, each tuple corresponds to a measure value and there is one column for each of dimensions. The simplest view of pivoting is that it selects two dimensions to aggregate the measure. The aggregated values are often displayed in a grid where every point in the (x, y) coordinate system corresponds to an aggregated value of the measure. The x and y coordinate values are the values of the selected two dimensions The result of pivoting is called cross-tabulation.

Winter, 2001Keith C.C. Chan15 Pivoting on City & Day

Winter, 2001Keith C.C. Chan16 Roll-up and Drill Down Roll-up operation corresponds to taking the current aggregation level of fact values and doing a further aggregation on one (or more) of the dimensions. That is equivalent to doing GROUP BY to this dimension(s) by using attribute hierarchy. Roll-up operation can be understood as lowering the number of dimensions. In this case, the measure is calculated without regard to dimensions to be omitted.

Winter, 2001Keith C.C. Chan17 Drill-Down Analyzing a set of data at a finer level of detail. – E.g. A summary report for the total sales of three package sizes for a given brand of paper towels. – Further breakdown of sales by color within each of these package sizes. Breakdown obtained using a “point-and-click”. A drill-down presentation is equivalent to adding another column to the original report (in this case, a column was added for the attribute “color”).

Winter, 2001Keith C.C. Chan18 Drill-Down (2) Executing a drill-down may require that the OLAP tool “reach back” to the DW to obtain the detailed data necessary for the drill-down. This type of operation can be performed by an OLAP tool (without user participation) only if an integrated set of metadata is available to that tool. Some tools even permit the OLAP tool to reach back to the operational data if necessary for a given query.

Winter, 2001Keith C.C. Chan19 Example of OLAP Operations

Winter, 2001Keith C.C. Chan20 Data Visualization Human eye can best discern patterns when data are represented graphically. Data visualization is the representation of data in graphical and multimedia formats for human analysis. Benefits of data visualization include the ability to better observe trends and patterns, and to identify correlations and clusters. Data visualization is often used in conjunction with data mining and other analytical techniques.

Winter, 2001Keith C.C. Chan21

Winter, 2001Keith C.C. Chan22