DWH-Ahsan Abdullah 1 Data Warehousing Lecture-21 Introduction to Data Quality Management (DQM) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof.

Slides:



Advertisements
Similar presentations
Dr. Hamda Qotba, M.D,MFPH,FFPH
Advertisements

Quality Data for a Healthy Nation by Mary H. Stanfill, RHIA, CCS, CCS-P.
QA Programs for Local Health Departments
1 Managing Quality Quality defined Total cost of quality Strategic Quality –Total quality management (TQM) –Continuous improvement tools Quality assurance.
Lecture-19 ETL Detail: Data Cleansing
These slides are designed to accompany Software Engineering: A Practitioner’s Approach, 6/e (McGraw-Hill 2005). Slides copyright 2005 by Roger Pressman.1.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-5 Types & Typical Applications of DWH Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Total Quality Management
1 Software Testing and Quality Assurance Lecture 33 – Software Quality Assurance.
Analysis Stage (Phase I) The goal: understanding the customer's requirements for a software system. n involves technical staff working with customers n.
DWH-Ahsan Abdullah 1 Data Warehousing Lab Lect-1 DTS: Introduction Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Lecture-33 DWH Implementation: Goal Driven Approach (1)
1 Introduction to Data Quality Management (DQM). 2 What is Quality? Informally Some things are better than others i.e. they are of higher quality. How.
Lecture-1 Introduction and Background
1/28 Basic Concepts of Quality. 2/28 Basic Concepts of Quality What is Quality? ATTRIBUTES are used to describe QUALITY… examples: Beauty, Goodness, Freshness,
DWH-Ahsan Abdullah 1 Data Warehousing Lab Lect-2 Lab Data Set Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Whitewater Strategies, Inc. EMERGENCY PREPAREDNESS QUIZ 1 Most accidents can be anticipated. False -- An accident by definition is something that occurs.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Ahsan Abdullah 1 Data Warehousing Lecture-17 Issues of ETL Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
ENTREPRENEURSHIP Lecture No: 27 Resource Person: Malik Jawad Saboor Assistant Professor Department of Management Sciences COMSATS Institute of Information.
Quality Control Concepts. Outline 1.Introduction 2.Quality Control 3.Quality Assurance 4.Total Quality Management 5.Quality Tools 6.Summary.
Ahsan Abdullah 1 Data Warehousing Lecture-11 Multidimensional OLAP (MOLAP) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
Data Warehousing 1 Lecture-24 Need for Speed: Parallelism Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-37 Case Study: Agri-Data Warehouse Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Ahsan Abdullah 1 Data Warehousing Lecture-7De-normalization Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Total Quality Management. INTRODUCTION Total Quality Management (TQM) is customer oriented management philosophy and strategy. It is centered on quality.
Software Project Management Lecture # 10. Outline Quality Management (chapter 26)  What is quality?  Meaning of Quality in Various Context  Some quality.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-4 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
Software Project Management Lecture # 11. Outline Quality Management (chapter 26 - Pressman)  What is quality?  Meaning of Quality in Various Context.
Ahsan Abdullah 1 Data Warehousing Lecture-18 ETL Detail: Data Extraction & Transformation Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. &
Ahsan Abdullah 1 Data Warehousing Lecture-9 Issues of De-normalization Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Data Warehousing 1 Lecture-28 Need for Speed: Join Techniques Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
1 Data Warehousing Lecture-14 Process of Dimensional Modeling Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Ahsan Abdullah 1 Data Warehousing Lecture-20 Data Duplication Elimination & BSN Method Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-2 Introduction and Background Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
Chapter 11 – Part I Total Quality Management COB 300 Busing.
Introduction to Quality Imran Hussain. Project Development Costs Around 63% of software projects exceed their cost estimates. The top four reasons for.
About Quality Pre paired By: Muhammad Azhar. Scope What is Quality Quality Attributes Conclusion on software Quality Quality Concepts Quality Costs.
Data Warehousing Lecture-31 Supervised vs. Unsupervised Learning Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Ahsan Abdullah 1 Data Warehousing Lecture-16 Extract Transform Load (ETL) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for.
1 Data Warehousing Lecture-15 Issues of Dimensional Modeling Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Copyright 2006 John Wiley & Sons, Inc. Beni Asllani University of Tennessee at Chattanooga Operations Management - 5 th Edition Chapter 3 Roberta Russell.
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-29 Brief Intro. to Data Mining Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
CHAPTER 6 TOTAL QUALITY MANAGEMENT. QUALITY AS A PHILOSOPHY As competitive weapon that must be produced efficiently : high performance design and consistency.
Chapter 16 Implementing Quality Concepts Cost Accounting Foundations and Evolutions Kinney, Prather, Raiborn.
Software Engineering Lecture # 1.
DWH-Ahsan Abdullah 1 Data Warehousing Lecture-22 DQM: Quantifying Data Quality Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center.
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
CHAPTER 7 STATISTICAL PROCESS CONTROL. THE CONCEPT The application of statistical techniques to determine whether the output of a process conforms to.
Ahsan Abdullah 1 Data Warehousing Lecture-6Normalization Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
SCOPE DEFINITION,VERIFICATION AND CONTROL Ashima Wadhwa.
HSA 171 CAR. 1436/5/10 3  Concept Of Controlling.  Definition.  Controlling Process. 4.
Quality Control Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill.
Ahsan Abdullah 1 Data Warehousing Lecture-8 De-normalization Techniques Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics.
Certified Internal Auditor-NABH. Terms and definitions  Audit/ Assessment: Systematic, independent and documented process for obtaining evidence and.
Demonstrating Institutional Effectiveness Documenting Using SPOL.
MATH BY MEAGHAN, ROWEN, ELSIE. CONTENT LIST ▪ INTRODUCTION : Past vs Present ▪ SELECTING APPROPRIATE MATH : Math Standards ▪ RESEARCH ON MATH INSTRUCTION.
Lecture-3 Introduction and Background
Lecture-32 DWH Lifecycle: Methodologies
TOTAL QUALITY MANAGEMENT
9 Management of Quality.
Introduction to Quality
Software Quality Engineering
Lecture-38 Case Study: Agri-Data Warehouse
Lecture-35 DWH Implementation: Pitfalls, Mistakes, Keys
Chapter # 1 Overview of Software Quality Assurance
Organizational Aspects of Data Management
Presentation transcript:

DWH-Ahsan Abdullah 1 Data Warehousing Lecture-21 Introduction to Data Quality Management (DQM) Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research National University of Computers & Emerging Sciences, Islamabad

DWH-Ahsan Abdullah 2 Introduction to Data Quality Management (DQM)

DWH-Ahsan Abdullah 3 What is Quality? Informally Some things are better than others i.e. they are of higher quality. How much “better” is better? Is the right item the best item to purchase? How about after the purchase? What is quality of service? The bank example

DWH-Ahsan Abdullah 4 What is Quality? Formally “Quality is conformance to requirements” P. Crosby, “Quality is Free” 1979 “Degree of excellence” Webster’s Third New International Dictionary

DWH-Ahsan Abdullah 5 What is Quality? Examples from Auto Industry Quality means meeting customer’s needs, not necessarily exceeding them. Quality means improving things customers care about, because that makes their lives easier and more comfortable. Why example from auto-industry?

DWH-Ahsan Abdullah 6 What is Data Quality?  Muhammad Khan Height = 5’8” Weight = 160 lbs Gender = Male Age = 35 yrs Emp_ID = 440 All data is an abstraction of something real What is Data? Note Change the picture

DWH-Ahsan Abdullah 7 What is Data Quality? Intrinsic Data Quality Electronic reproduction of reality. Realistic Data Quality Degree of utility or value of data to business.

DWH-Ahsan Abdullah 8 Data Quality & Organizations Intelligent Learning Organization: High-quality data is an open, shared resource with value- adding processes. The dysfunctional learning organization: Low-quality data is a proprietary resource with cost-adding processes. {Comment: Put picture of person in water holding round tube with data written on it}

DWH-Ahsan Abdullah 9 Law #1 - “Data that is not used cannot be correct!” Law #2 - “Data quality is a function of its use, not its collection!” Law #3 - “Data will be no better than its most stringent use!” Law #4 - “Data quality problems increase with the age of the system!” Law #5 – “The less likely something is to occur, the more traumatic it will be when it happens!” Orr’s Laws of Data Quality

DWH-Ahsan Abdullah 10 Total Quality Control (TQM) Philosophy of involving all for systematic and continuous improvement. It is customer oriented. Why? TQM incorporates the concept of product quality, process control, quality assurance, and quality improvement. Quality assurance is NOT Quality improvement

DWH-Ahsan Abdullah 11 Co$t of fixing data quality Lowest Quality Highest quality Cost of achieving quality Defect minimization is economical. Defect elimination is very very expensive. Exponential rise in cost

DWH-Ahsan Abdullah 12 Co$t of Data Quality Defects  Controllable Costs  Recurring costs for analyzing, correcting, and preventing data errors  Resultant Costs  Internal and external failure costs of business opportunities missed.  Equipment & Training Costs

DWH-Ahsan Abdullah 13 Where data quality is critical?  Almost everywhere, some examples:  Marketing communications.  Customer matching.  Retail house-holding.  Combining MIS systems after acquisition.

DWH-Ahsan Abdullah 14 Characteristics or Dimensions of Data Quality Data Quality Characteristic Definition Accuracy Qualitatively assessing lack of error, high accuracy corresponding to small error. Completeness The degree to which values are present in the attributes that require them.

DWH-Ahsan Abdullah 15 Completeness Vs Accuracy 95% accurate and 100% complete OR 100% accurate and 95% complete Which is better? Depends on data quality (i) tolerances, the (ii) corresponding application and the (iii) cost of achieving that data quality vs. the (iv) business value.

DWH-Ahsan Abdullah 16 Characteristics or Dimensions of Data Quality Data Quality Characteristic Definition Consistency A measure of the degree to which a set of data satisfies a set of constraints. Timeliness A measure of how current or up to date the data is. Uniqueness The state of being only one of its kind or being without an equal or parallel. Interpretability The extent to which data is in appropriate languages, symbols, and units, and the definitions are clear. Accessibility The extent to which data is available, or easily and quickly retrievable Objectivity The extent to which data is unbiased, unprejudiced, and impartial