Adamson & Venerable Chapter 2 & working out a Homework 5/6 Solution Transforming Relational Databases into Dimensional Diagrams Spring 2012.

Slides:



Advertisements
Similar presentations
Information Systems Today: Managing in the Digital World
Advertisements

Dimensional Modeling.
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Database Management3-1 L3 Database Management Santa R. Susarapu Ph.D. Student Virginia Commonwealth University.
Chapter 3 Database Management
Database Management An Introduction.
3-1 Chapter 3 Data and Knowledge Management
A Guide to SQL, Seventh Edition. Objectives Understand the concepts and terminology associated with relational databases Create and run SQL commands in.
A Guide to SQL, Seventh Edition. Objectives Introduce Premiere Products, a company whose database is used as the basis for many of the examples throughout.
Concepts of Database Management Sixth Edition
Database Design Chapter 2. Goal of all Information Systems  To add value –Reduce costs –Increase sales or revenue –Provide a competitive advantage.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Chapter 14 The Second Component: The Database.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Access Lecture 1 Database Overview and Creating Tables Create an Employee Table.
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
Database Software Application
Software Development Unit 2 Databases What is a database? A collection of data organised in a manner that allows access, retrieval and use of that data.
1 DATABASE TECHNOLOGIES BUS Abdou Illia, Fall 2007 (Week 3, Tuesday 9/4/2007)
DBSQL 4-1 Copyright © Genetic Computer School 2009 Chapter 4 Database Design.
Copyright © 2003 by Prentice Hall Module 4 Database Management Systems 1.What is a database? Data hierarchy and data organization Field, record, file,
Systems analysis and design, 6th edition Dennis, wixom, and roth
DAY 15: ACCESS CHAPTER 2 Larry Reaves October 7,
A Guide to SQL, Eighth Edition 1 Chapter One Introduction to Premiere Products, Henry Books, and Alexamara Marina Group.
Database. Basic Definitions Database: A collection of related data. Database Management System (DBMS): A software package/ system to facilitate the creation.
Objectives Overview Define the term, database, and explain how a database interacts with data and information Define the term, data integrity, and describe.
A Guide to SQL, Eighth Edition Chapter Two Database Design Fundamentals.
Concepts of Database Management, Fifth Edition Chapter 4: The Relational Model 3: Advanced Topics.
Chapter 4 The Relational Model 3: Advanced Topics Concepts of Database Management Seventh Edition.
MIS 301 Information Systems in Organizations Dave Salisbury ( )
MIS 301 Information Systems in Organizations Dave Salisbury ( )
Database & Data Warehouse Assignments
© 2007 by Prentice Hall 1 Introduction to databases.
1 A Guide to MySQL 2 Database Design Fundamentals.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
1 A Guide to MySQL 2 Database Design Fundamentals.
Data Warehousing.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition Lesson 23 Getting Started with Access Essentials 1 Morrison / Wells / Ruffolo.
Database & Data Warehouse Assignments
A Guide to MySQL. 2 Objectives Introduce Premiere Products, a company whose database is used as the basis for many of the examples throughout the text.
Building Dashboards SharePoint and Business Intelligence.
Department of Industrial Engineering Sharif University of Technology Session# 9.
Database & Data Warehouse Assignments BCIS 4660 – Dr. Nick Evangelopoulos Spring 2012.
MIS 301 Information Systems in Organizations Dave Salisbury ( )
0 / Database Management. 1 / Identify file maintenance techniques Discuss the terms character, field, record, and table Describe characteristics.
© 2006 Pearson Education Canada Inc. 3-1 Chapter 3 Database Management PowerPoint Presentation Jack Van Deventer Ward M. Eagen.
© 2003 Prentice Hall, Inc.3-1 Chapter 3 Database Management Information Systems Today Leonard Jessup and Joseph Valacich.
DO NOT COPY --CONFIDENTIAL Homework 6b Partial Key Generating a Fact table for Henry Books BCIS 4660 Dr. Nick Evangelopoulos Spring 2012.
Copyright© 2014, Sira Yongchareon Department of Computing, Faculty of Creative Industries and Business Lecturer : Dr. Sira Yongchareon ISCG 6425 Data Warehousing.
DO NOT COPY --CONFIDENTIAL Homework 5 Partial Key Star Diagrams & Data Warehouse Design BCIS 4660 Dr. Nick Evangelopoulos Spring 2012.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
Data Warehousing DSCI 4103 Dr. Mennecke Chapter 2.
Concepts of Database Management Seventh Edition Chapter 4 Keys and Relationship.
I am Xinyuan Niu I am here because I love to give presentations. Data Warehousing.
1 Management Information Systems M Agung Ali Fikri, SE. MM.
DATA SCIENCE MIS0855 | Spring 2016 Designing Data
Database & Data Warehouse Assignments
A Guide to MySQL.
A Guide to SQL, Eighth Edition
MIS2502: Data Analytics Relational Data Modeling
Information Systems Today: Managing in the Digital World
Data warehouse and OLAP
GO! with Microsoft Access 2016
DATABASE TECHNOLOGIES
Presentation transcript:

Adamson & Venerable Chapter 2 & working out a Homework 5/6 Solution Transforming Relational Databases into Dimensional Diagrams Spring 2012

Dimensional modeling in Sales In a DW designed to analyze SALES data, important component of a dimensional model is the Product Dimension. Product dimension includes important characteristics used to differentiate the product in the marketplace, called Discriminators.

Key Business Term: Discriminators Discriminators: Descriptive characteristics of a product that further describe it and are relevant to purchasing decisions. Tracking discriminators allows the business analyst to monitor performance of various product styles, influencing production and marketing plans. Discriminators for a men’s suit: Cloth, color, style/cut, weight, size Discriminators for vehicles: Model name, model styling package, line, category, exterior color, model year, interior color

Other Dimensions in Sales Time dimension: Time key, month, day, date, year, day of week, quarter. Customer_Demographic dimension: This does not require a row for each customer, but groups customers by different combinations of age, gender, income, and geography. The degree of demographic segmentation varies by industry. Dealer dimension: Data on dealer performance are needed, to support decisions on which dealers should be eased out of business. Method_Of_Payment dimension (lease, financing options, etc.)

Fact Table: Storing derived facts A commonly used derived fact should be stored, and not calculated in reports and queries. Cutting such “redundant” key measures from the fact table results in the following: 1.Development of reports gets more complex 2.Increased potential for errors in reports 3.Increased documentation requirements 4.A Hundred dollars’ worth of disk space is saved (40 MB of space savings for a 10-million row fact table)

Transformation Stages (Key for Homework #5/6) De-normalization Process –Start with Normalized Tables –Determine Dimensions and Fact Tables –Delete Relationships Rebuild Tables Rebuild Relationship Diagram as Star Diagram, a.k.a., Dimension Table Provides information needed to complete Homework 6, too!

Premiere Products ERD Redrawn to form most likely Star Diagram

Delete Relationships

Rebuild Tables Using copied operations database Be sure all ops. data is saved and backed up … multiple times. Data staging & cleansing –Denormalize extra relationships Order:OrderLine; Customer:SalesRep; Order:OrderLine  OrderDetail Customer:SalesRep  OrderDetail –Transform data for new tables in Access: Make Table –Export data files, if needed, to rebuild elsewhere (Excel) –Additional Transformations as needed in Excel –Create the Time dimension –Re-Import data files to new tables

Order_OrderLine Query

Restructured Data -- Stage 1

Preparing Rep:Customer

Join to Order_OrderLine

OrderDetail Query Order of Columns Not critical Sort order not critical but Good time to revise

Make-Table OrderDetail Query

OrderDetail Table Set Primary Keys

Data Cleansing Fix dates

Add Indexes

Build Star Diagram What’s Missing? Fact Table

Time Dimension Use SQL to eliminate Redundant dates

Built-in Functions

Built-in Date/Time Functions

Excel Time Table Create Time table using Excel Formulas: 1.Fill 2.Copy/paste 3.Compute Quarter 4.Conversion Formulas 5.Etc.

Import Time Table to Access

Time Table w/Indexes

Premier Products Star Diagram

PP -- Relation List Fact Table OrderDetail[OrderNum, PartNum, OrderDate, CustNo, RepNo, NumOrdered, Price] Dimension Tables Customer[CustNum, CustName, Street, City, State, Zip, Balance, CreditLimit] Rep[RepNum, LastName, FirstName, Street, City, State, Zip, Commission, Rate] Part[PartNum, Desc, OnHand, Class, Warehouse, Price] Time[TimeKey, Day_of_Week, Month, Year, JulianDate, Quarter, etc.]

Replacing OrderDate with Time_key

Tools/Analyze/Tables

Tools/Analyze/Documenter

CUSTOMER table

GUID Last modified: Thursday, June 20, 2002 Short for Globally Unique Identifier, a unique 128-bit number that is produced by the Windows OS or by some Windows applications to identify a particular component, application, file, database entry, and/or user. For instance, a Web site may generate a GUID and assign it to a user's browser to record and track the session. A GUID is also used in a Windows registry to identify COM DLLs. Knowing where to look in the registry and having the correct GUID yields a lot information about a COM object (i.e., information in the type library, its physical location, etc.). Windows also identifies user accounts by a username (computer/domain and username) and assigns it a GUID. Some database administrators even will use GUIDs as primary key values in databases.bitOSapplicationsdatabasebrowserCOMDLLs GUIDs can be created in a number of ways, but usually they are a combination of a few unique settings based on specific point in time (e.g., an IP address, network MAC address, clock date/time, etc.).IP addressMAC address

Tools/Analyze/Documenter OrderDetail table

Henry Books ERD (before) BookInventory

Henry Books ERD (before)

Update Query for Inventory Table [Physical Inventory Date]

Inventory Table: Date Updated

BookInventory Fact Table Note: 2 Time Keys [Pick one]

Henry Books Star Diagram

HB -- Relation List Fact Table BookTrans[BookCode, AuthorNum, BranchNum, Sequence, PubCode, OnHand, InventoryDate] Dimension Tables Book[BookCode, Title, Type, Price, Paperback] Publisher[PubCode, PubName, City, …State, Zip] Author[AuthorNum, LastName, FirstName, …] Branch[BranchNum, BranchName, Location, Employees] Time[TimeKey, Day_of_Week, Month, Year, JulianDate, Quarter, etc.]

Summary Complete Transformations How normal are the resulting tables? –1NF, 2NF, 3NF? Document Transformation maps Prepare for Appending Tables with new data Tools/Analyze/Documenter (next time)