Last Updated : 26th may 2003 Center of Excellence Data Warehousing Introductionto Data Modeling.

Slides:



Advertisements
Similar presentations
Dimensional Modeling.
Advertisements

CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Chapter 10: Designing Databases
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Copyright © Starsoft Inc, Data Warehouse Architecture By Slavko Stemberger.
Concepts of Database Management Seventh Edition Chapter 6 Database Design : ERD Model.
Data Warehousing M R BRAHMAM.
Dimensional Modeling Business Intelligence Solutions.
Managing Data Resources
Dimensional Modeling CS 543 – Data Warehousing. CS Data Warehousing (Sp ) - Asim LUMS2 From Requirements to Data Models.
Data Warehouse IMS5024 – presented by Eder Tsang.
Client/Server Databases and the Oracle 10g Relational Database
Dimensional Modeling – Part 2
Data Warehousing Design Transparencies
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
“DOK 322 DBMS” Y.T. Database Design Hacettepe University Department of Information Management DOK 322: Database Management Systems.
CSE6011 Warehouse Models & Operators  Data Models  relations  stars & snowflakes  cubes  Operators  slice & dice  roll-up, drill down  pivoting.
Mgt 20600: IT Management & Applications Databases Tuesday April 4, 2006.
Data Warehousing DSCI 4103 Dr. Mennecke Introduction and Chapter 1.
Principles of Dimensional Modeling
DWH – Dimesional Modeling PDT Genči. 2 Outline Requirement gathering Fact and Dimension table Star schema Inside dimension table Inside fact table STAR.
Sayed Ahmed Logical Design of a Data Warehouse.  Free Training and Educational Services  Training and Education in Bangla: Training and Education in.
Copyright © 2003 by Prentice Hall Module 4 Database Management Systems 1.What is a database? Data hierarchy and data organization Field, record, file,
6-1 DATABASE FUNDAMENTALS Information is everywhere in an organization Information is stored in databases –Database – maintains information about various.
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
1 Advanced Computer Programming Databases. Overview What is a database? Database Basics Database Components Data Models Normalization Database Design.
Web-Enabled Decision Support Systems
Chapter 1 Overview of Database Concepts Oracle 10g: SQL
Information Systems: Databases Define the role of general information systems Describe the elements of a database management system (DBMS) Describe the.
Datawarehouse & Datamart OLAPs vs. OLTPs Dimensional Modeling Creating Physical Design Using SQL Mgt. Studio Module II: Designing Datamarts 1.
311: Management Information Systems Database Systems Chapter 3.
OnLine Analytical Processing (OLAP)
Program Pelatihan Tenaga Infromasi dan Informatika Sistem Informasi Kesehatan Ari Cahyono.
1 Data Warehousing Lecture-13 Dimensional Modeling (DM) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
Data Warehousing Concepts, by Dr. Khalil 1 Data Warehousing Design Dr. Awad Khalil Computer Science Department AUC.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Chapter 1 Adamson & Venerable Spring Dimensional Modeling Dimensional Model Basics Fact & Dimension Tables Star Schema Granularity Facts and Measures.
Storing Organizational Information - Databases
Information Systems & Databases 2.2) Organisation methods.
The Data Warehouse “A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of “all” an organisation’s data in support.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Basic Model: Retail Grocery Store
Designing a Data Warehousing System. Overview Business Analysis Process Data Warehousing System Modeling a Data Warehouse Choosing the Grain Establishing.
More Dimensional Modeling. Facts Types of Fact Design Transactional Periodic Snapshot –Predictable time period –Ex. Monthly, yearly, etc. Accumulating.
Organizing Data and Information
UNIT-II Principles of dimensional modeling
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Creating the Dimensional Model
Data Warehousing Multidimensional Analysis
1 DATABASE TECHNOLOGIES (Part 2) BUS Abdou Illia, Fall 2015 (September 9, 2015)
June 08, 2011 How to design a DATA WAREHOUSE Linh Nguyen (Elly)
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
Fundamentals of Information Systems, Sixth Edition Chapter 3 Database Systems, Data Centers, and Business Intelligence.
I am Xinyuan Niu I am here because I love to give presentations. Data Warehousing.
COMP 430 Intro. to Database Systems Denormalization & Dimensional Modeling.
1 Database Systems, 8 th Edition Star Schema Data modeling technique –Maps multidimensional decision support data into relational database Creates.
3 Copyright © 2006, Oracle. All rights reserved. Business, Logical, and Dimensional Modeling.
Building the Corporate Data Warehouse Pindaro Demertzoglou Data Resource Management.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 9: DATA WAREHOUSING.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
Managing Data Resources File Organization and databases for business information systems.
Data warehouse and OLAP
CMPE 226 Database Systems April 11 Class Meeting
Retail Sales is used to illustrate a first dimensional model
Dimensional Modeling.
Retail Sales is used to illustrate a first dimensional model
Database Design Hacettepe University
Dimensional Model January 16, 2003
DWH – Dimesional Modeling
Presentation transcript:

Last Updated : 26th may 2003 Center of Excellence Data Warehousing Introductionto Data Modeling

Objectives At the end of this lesson, you will know :  Data Modeling for Data Warehouse  What are dimensions and facts  Star Schema and Snowflake Schemas  Coverage Tables  Factless Tables  What to look for in Modeling tools  Some modeling tools

Data Modeling for Data Warehouse How to structure the data in your data warehouse ? Process that produces abstract data models for one or more database components of the data warehouse Modeling for Warehouse is different from that for Operational database  Dimensional Modeling, Star Schema Modeling or Fact/Dimension Modeling

Modeling Techniques Entity-Relationship Modeling  Traditional modeling technique  Technique of choice for OLTP  Suited for corporate data warehouse Dimensional Modeling  Analyzing business measures in the specific business context  Helps visualize very abstract business questions  End users can easily understand and navigate the data structure

Entity-Relationship Modeling - Basic Concepts The ER modeling technique is a discipline used to illuminate the microscopic relationships among data elements. The highest art form of ER modeling is to remove all redundancy in the data. Created databases that cannot be queried !!!!!

An Order Processing ER Model Order Header Order Details Customer Table FK Item Table FK Salesrep table City Sales District Sales Region Sales Country Product Brand Product Category FK

Entity-Relationship Modeling - Basic Concepts Entity  Object that can be observed and classified by its properties and characteristics  Business definition with a clear boundary  Characterized by a noun  Example  Product  Employee

Entity-Relationship Modeling - Basic Concepts Relationship  Relationship between entities - structural interaction and association  described by a verb  Cardinality  1-1  1-M  M-M  Example : Books belong to Printed Media

Entity-Relationship Modeling - Basic Concepts Attributes  Characteristics and properties of entities  Example :  Book Id, Description, book category are attributes of entity “Book”  Attribute name should be unique and self-explanatory  Primary Key, Foreign Key, Constraints are defined on Attributes

Entity-Relationship Modeling – Why Not ? End users cannot understand or remember an ER model. No graphical user interface (GUI) that takes a general ER model and makes it usable by end users. Softwares cannot usefully query a general ER model. Use of the ER modeling technique defeats the basic allure of data warehousing, namely intuitive and high-performance retrieval of data.

Dimensional Modeling - Basic Concepts Represents the data in a standard, intuitive framework that allows for high-performance access; Schema designed to process large, complex, adhoc and data intensive queries. No concern for concurrency, locking and insert/update/delete performance Every dimensional model is composed of one table with a multipart key, called the fact table, and a set of smaller tables called dimension tables. This characteristic "star-like" structure is often called a star join.

CITY PRODUCT PERIOD CUSTOMER SALES AMOUNT UNITS Measures Dimension s REGION STATE DISTRICT CITY PRODUCT BRAND COLOR CATEGORY SIZE DAY MONTH YEAR QUARTER CUSTOMER CATEGORY CONTACT ADDRESS Star Schema

Dimensional Modeling - Basic Concepts Fact Tables  The most useful facts in a fact table are numeric and additive  Typically represents a business transaction, or event that can be used in analyzing business process  By nature fact tables are sparse  Usually very large - billions of records

Dimensional Modeling - Basic Concepts Dimension Tables  Each dimension table has a single-part primary key that corresponds exactly to one of the components of the multipart key in the fact table.  Dimension tables, most often contain descriptive textual information  Determine contextual background for facts  Examples :  Time  Location/Region  Customers

Dimensional Modeling - Basic Concepts Measures  A numeric attribute of a fact  Represents performance or behavior of the business relative to the dimensions  The actual numbers are called variables  Occupy very little space compared to Fact Tables  Examples :  Quantity supplied  Transaction amount  Sales volume

Fact Table & Dimension Tables Fact Tables Numerical Measurements of business are stored in Fact Tables. Dimensional Tables Dimensions are attributes about facts.

Conformed Dimensions Dimension that means the same thing with every possible fact table that it can be joined with Conformed dimensions most essential  For the Bus Architecture  Integrated function of the Data Warehouse Some common dimensions are :  Customer  Product  Location  Time

Surrogate Keys All tables (facts and dimensions) should not use production keys but Data Warehouse generated surrogate keys  Productions keys get reused sometimes  In case of mergers/acquisitions, protects you from different key formats  Production systems may change their systems to generalize key definitions  Using surrogate key will be faster  Can handle Slowly Changing dimensions well

Slowly Changing Dimensions Certain kinds of dimension attribute changes need to be handled differently in Data Warehouse Type I - Overwrite  e.g. Name Correction, Description changes Type II - Partition History  Packing change, Customer movement  Create a new dimension record with new surrogate key Type III - Organizational changes  Sales Force Reorganization  Show by sales broken by new and old organizations  Need to create an old and a new field

Factless Fact Tables For Event Tracking e.g. attendance Date_Key Student_Key Course_Key Teacher_Key Facility_Key Date Dimension Course Dimension Facility Dimension Student Dimension Teacher Dimension

Problem : To find out which Products on promotion did not sell? Date_Key Product_Key Store_Key Promotion_Key Dollars Sold Date Dimension Store Dimension Product Dimension Promotion Dimension Units Sold Fact Table Coverage Tables

Date_Key Product_Key Store_Key Promotion_Key Date Dimension Store Dimension Product Dimension Promotion Dimension Sales Promotion Coverage Table Coverage Tables Solution - Coverage Tables

Snowflake Schema Dimension tables are normalized by decomposing at the attribute level Each dimension has one key for each level of the dimension’s hierarchy Good performance when queries involve aggregation Complicated maintenance and metadata, explosion in number of table. Makes user representation more complex and intricate

Snowflake schema - Example Fact Table Dim Table Dim Table Dim Table Dim Table

Aggregates Pre-stored summaries in the database Significant Performance advantage Preferably should not be stored in fact tables. May take significant time to build aggregates Many tools can automatically navigate to most aggregated table that can service a query

Aggregate Navigators Automatically redirect queries to the most summarized table Some tools like Business Objects, Discoverer, Microstrategy, Metacube etc support this Native database support already available Aggregate Navigator DBMS LAN

Examples of Data Modeling Tools ERWIN  Supports Data Warehouse design as a modeling technique Powersoft WarehouseArchitect  Module of Power Designer specifically for DW Modeling Oracle Designer  Can be extended for Warehouse modeling Others like Infomodeler, Silverrun are also used

Questions