Chapter 4 Tutorial.

Slides:



Advertisements
Similar presentations
Chapter 4 Tutorial.
Advertisements

Chapter 4 Tutorial.
Data Warehousing and Decision Support, part 2
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Data Warehousing.
5.1Database System Concepts - 6 th Edition Chapter 5: Advanced SQL Advanced Aggregation Features OLAP.
Data Warehousing CPS216 Notes 13 Shivnath Babu. 2 Warehousing l Growing industry: $8 billion way back in 1998 l Range from desktop to huge: u Walmart:
Introduction to Data Warehousing CPS Notes 6.
1 Lecture 09: OLAP
Data Warehousing Xintao Wu. Evolution of Database Technology (See Fig. 1.1) 1960s: Data collection, database creation, IMS and network DBMS 1970s: Relational.
Dr. M. Sulaiman Khan Dept. of Computer Science University of Liverpool 2010 COMP207: Data Mining Data Warehousing COMP207: Data Mining.
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
Business Intelligence. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views.
1 Lecture 10: More OLAP - Dimensional modeling
Lab3 CPIT 440 Data Mining and Warehouse.
By N.Gopinath AP/CSE. Two common multi-dimensional schemas are 1. Star schema: Consists of a fact table with a single table for each dimension 2. Snowflake.
Data Warehousing. On-Line Analytical Processing (OLAP) Tools The use of a set of graphical tools that provides users with multidimensional views of their.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Microsoft SQL Server 2012 Analysis Services (SSAS) Reporting Services (SSRS)
Ch3 Data Warehouse part2 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
1 Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously.  A decision support database that is maintained.
CS346: Advanced Databases
Ch3 Data Warehouse Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2010.
1 Data Warehouses C hapter 2. 2 Chapter 2 Outline Chapter 2 Outline – Introduction –Data Warehouses –Data Warehouse in Organisation – OLTP vs. OLAP –Why.
Business Intelligence Instructor: Bajuna Salehe Web:
OLAP OPERATIONS. OLAP ONLINE ANALYTICAL PROCESSING OLAP provides a user-friendly environment for Interactive data analysis. In the multidimensional model,
Dr. Bernard Chen Ph.D. University of Central Arkansas
8/20/ Data Warehousing and OLAP. 2 Data Warehousing & OLAP Defined in many different ways, but not rigorously. Defined in many different ways, but.
Chetan Bhirud Raza Mohammad Abinash Sahoo Online Marketing Giant.
COURSE REGISTRATION SYSTEM Case study IST2101. Case Study: Course Registration (1) IST2102 You are helping Penn State create a course registration system.
Multi-Dimensional Databases & Online Analytical Processing This presentation uses some materials from: “ An Introduction to Multidimensional Database Technology,
Ahsan Abdullah 1 Data Warehousing Lecture-11 Multidimensional OLAP (MOLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
CS 157B: Database Management Systems II March 20 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
Data Warehousing Xintao Wu. Can You Easily Answer These Questions? What are Personnel Services costs across all departments for all funding sources? What.
Module 1: Introduction to Data Warehousing and OLAP
Roadmap 1.What is the data warehouse, data mart 2.Multi-dimensional data modeling 3.Data warehouse design – schemas, indices 4.The Data Cube operator –
BI Terminologies.
October 28, Data Warehouse Architecture Data Sources Operational DBs other sources Analysis Query Reports Data mining Front-End Tools OLAP Engine.
Some OLAP Issues CMPT 455/826 - Week 9, Day 2 Jan-Apr 2009 – w9d21.
Dr. N. MamoulisAdvanced Database Technologies1 Topic 6: Data Warehousing & OLAP Defined in many different ways, but not rigorously. A decision support.
SHIFALI CHOUBEY GISE LAB IITB Decision Support System For Farmers.
UNIT-II Principles of dimensional modeling
Shilpa Seth.  Multidimensional Data Model Concepts Multidimensional Data Model Concepts  Data Cube Data Cube  Data warehouse Schemas Data warehouse.
CMPE 226 Database Systems October 21 Class Meeting Department of Computer Engineering San Jose State University Fall 2015 Instructor: Ron Mak
Data Warehousing Multidimensional Analysis
Data Mining Data Warehouses.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
The Data Warehouse Chapter Operational Databases = transactional database  designed to process individual transaction quickly and efficiently.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 6 The Data Warehouse Jason C. H. Chen, Ph.D. Professor of MIS School of Business Administration.
GSK FMCG Data Warehouse Business definition GSK FMCG industry 10 October 2014 Pavan Kumar Mantha Vinod Tati Shourya Konda 1.
Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support.
1 Online Analytical Processing (OLAP) Anjali Gupta Mithun Arora Aameek Singh Kranthi Kumar.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
Data Warehousing and OLAP Outline u Models & operations u Implementing a warehouse u Future directions.
CMPE 226 Database Systems April 12 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
Information Management course
Data Warehousing CIS 4301 Lecture Notes 4/20/2006.
Data warehouse and OLAP
A multi-dimensional data model
What is OLAP OLAP allows to model data in a multidimensional way like a data cube in order to look for the data from many perspectives.
OLAP Concepts and Techniques
Databases & Data Warehouses
CMPE 226 Database Systems April 11 Class Meeting
Data warehouse Design Using Oracle
Data Warehouse and OLAP
DATA CUBES E0 261 Jayant Haritsa Computer Science and Automation
Chapter 13 The Data Warehouse
Fundamentals of Data Cube & OLAP Operations
Data Warehouse and OLAP
Presentation transcript:

Chapter 4 Tutorial

Modeling Data Warehouse A data warehouse is based on a multidimensional data model which views data in the form of a data cube A data cube allows data to be modeled and viewed in multiple dimensions Dimension tables Fact table contains measures and keys to related dimension tables

Cont. Star schema: A fact table in the middle connected to a set of dimension tables. Snowflake schema: represents dimensional hierarchy by normalizing the dimension tables. save storage reduces the effectiveness of browsing Fact constellations: Multiple fact tables share dimension tables

OLAP Operations by climbing up hierarchy or by dimension reduction Roll up (drill-up): summarize data by climbing up hierarchy or by dimension reduction Drill down (roll down): reverse of roll-up from higher level summary to lower level summary or detailed data, or introducing new dimensions Slice and dice: project and select Pivot (rotate): reorient the cube, visualization, 3D to series of 2D planes

Q3 Suppose that a data warehouse consists of the three dimensions time, doctor, and patient, and the two measures count and charge, where charge is the fee that a doctor charges a patient for a visit. (a) Enumerate three classes of schemas that are popularly used for modeling data warehouses. Star schema Snowflake schema Fact constellation schema

Q3 cont. (b) Draw a schema diagram for the above data warehouse using one of the schema classes listed in (a). Using a star schema.

Q3 cont. Star Schema time Fact Table Measures time_key doctor_id day day_of_the_week month quarter year time doctor_id doctor_name phone # address gender doctor Fact Table time_key doctor_id patient_id Charge Count patient_id patient_name phone # address gender patient Measures

Q3 cont. (c) Starting with the base cuboid [day, doctor, patient], what specific OLAP operations should be performed in order to list the total fee collected by each doctor in 2010? The operations to be performed are: Roll-up on time from day to year. Slice for time = 2010. Roll-up on patient from individual patient to all.

Q4 Suppose that a data warehouse for Big-University consists of the following four dimensions: student, course, semester, and instructor, and two measures count and avg. grade. When at the lowest conceptual level (e.g., for a given student, course, semester, and instructor combination), the avg grade measure stores the actual course grade of the student. At higher conceptual levels, avg grade stores the average grade for the given combination.

Q4 cont. Snowflake Schema course_id Course_name department Course student_id student_name area_id Major status university Student area_id City State country Area Sales Fact Table course_id student_id instructor_id semester_id Count Avg. grade semester_id semester year Semester instructor_id depatment rank Instructor

Q4. cont. Starting with the base cuboid [student, course, semester, instructor], what specific OLAP operations (e.g., roll-up from semester to year) should one perform in order to list the average grade of CS courses for each Big University student. Roll-up on course from course id to department. Roll-up on semester from semester id to all. Slice for course=“CS” .

Q5 Suppose that a data warehouse consists of the four dimensions, date, spectator, location, and game, and the two measures, count and charge, where charge is the fare that a spectator pays when watching a game on a given date. Spectators may be students, adults, or seniors, with each category having its own charge rate. Draw a star schema diagram for the data warehouse.

Q5 cont. Star Schema date Sales Fact Table date_id spectator_id day month quarter year date spectator_id spectator_name phone # address Status Charge rate spectator Sales Fact Table date_id spectator_id location_id game_id Charge Count location_id phone # Street city province country location game_id game_name description producer game

Q5. cont. Starting with the base cuboid [date, spectator, location, game], what specific OLAP operations should one perform in order to list the total charge paid by student spectators at GM Place in 2010? The specific OLAP operations to be performed are: Roll-up on date from date id to year. Roll-up on game from game id to all. Roll-up on location from location id to location name. Roll-up on spectator from spectator id to status. Dice with status=“students”, location name=“GM Place”, and year = 2010.