Quality Data – An Improbable Dream? Quality Data An Improbable Dream? Elizabeth Vannan Centre for Education Information Victoria, BC, Canada.

Slides:



Advertisements
Similar presentations
Conducting your own Data Life Cycle Audit
Advertisements

Usage statistics in context - panel discussion on understanding usage, measuring success Peter Shepherd Project Director COUNTER AAP/PSP 9 February 2005.
ASYCUDA Overview … a summary of the objectives of ASYCUDA implementation projects and features of the software for the Customs computer system.
Pennsylvania BANNER Users Group 2007 Structuring a reporting environment for success.
Care and Feeding of a Data Warehouse
SolidWorks Enterprise PDM Data Loading Strategies
1. 2 August Recommendation 9.1 of the Strategic Information Technology Advisory Committee (SITAC) report initiated the effort to create an Administrative.
SO- You Have a Protocol- What ’ s NEXT? Bob Ensor and Dana York 1.
Course: e-Governance Project Lifecycle Day 1
C6 Databases.
Institutional Repositories It’s not Just the Technology New England Archivists Boston College March 11, 2006 Eliot Wilczek University Records Manager Tufts.
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
3108: Enterprise Upgrade Lessons Learned
CHAPTER 10 & 13 IS within the Organization & Acquiring IS and Applications.
Managing Data Resources
Quality is about testing early and testing often Joe Apuzzo, Ngozi Nwana, Sweety Varghese Student/Faculty Research Day CSIS Pace University May 6th, 2005.
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
Understanding Data Quality Issues: Finding Data Inaccuracies Art DeMaio Evoke Software VP Technical Sales Support.
Quality Manual for Interoperability Testing Morten Bruun-Rasmussen Presented by Jos Devlies, Eurorec.
PowerPoint Presentation for Dennis & Haley Wixom, Systems Analysis and Design Copyright 2000 © John Wiley & Sons, Inc. All rights reserved. Slide 1 Systems.
DATA RESOURCE MANAGEMENT.
by Ha Do Statistical Standard Methodology and ITC Department
What is Business Analysis Planning & Monitoring?
Database Systems: Design, Implementation, and Management Ninth Edition
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
S/W Project Management
AICT5 – eProject Project Planning for ICT. Process Centre receives Scenario Group Work Scenario on website in October Assessment Window Individual Work.
Condor Technology Solutions, Inc. Grace RFTS Application Extension Phase.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
DBS201: DBA/DBMS Lecture 13.
International Council on Archives Section on University and Research Institution Archives Michigan State University September 7, 2005 Preserving Electronic.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS (Cont’d) Instructor Ms. Arwa Binsaleh.
Lecture #9 Project Quality Management Quality Processes- Quality Assurance and Quality Control Ghazala Amin.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Chapter 5 Defining and Managing Project and Product Scope Copyright 2012 John Wiley & Sons, Inc. 5-1.
CS 474 Database Design and Application Terminology Jan 11, 2000.
Encounter Data Validation: Review and Project Update August 25, 2015 Presenters: Amy Kearney, BA Director, Research and Analysis Team Thomas Miller, MA.
Population Census carried out in Armenia in 2011 as an example of the Generic Statistical Business Process Model Anahit Safyan Member of the State Council.
Relationships July 9, Producers and Consumers SERI - Relationships Session 1.
Component 11/Unit 8b Data Dictionary Understanding and Development.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Technology In Action Chapter 11 1 Databases and… Databases and their uses Database components Types of databases Database management systems Relational.
State of Wisconsin Department of Revenue Data Warehouse Presentation August 16, 2000.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Search Engine Optimization © HiTech Institute. All rights reserved. Slide 1 What is Solution Assessment & Validation?
Small Agency Meeting June 24, 2008 Sunflower Project Statewide Financial Management System.
Managing Data Resources. File Organization Terms and Concepts Bit: Smallest unit of data; binary digit (0,1) Byte: Group of bits that represents a single.
Requirements Validation
National Enrolment Service (NES) Overview October 2015 – June 2016.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
Management Information System
August 14-15, 2003 Crystal Gateway Marriott Arlington, VA Software Developers Conference.
UN ECE Seminar on New Frontiers for Statistical Data Collection 31 Oct – 2 Nov 2012 Beyond 2011 The future of population statistics Andy Teague, Office.
Public Libraries Survey Data File Overview. What We’ll Talk About PLS: Public Libraries Survey State level data Public library data (Administrative Entities)
Public Libraries Survey Data File Overview. 2 What We’ll Talk About PLS: Public Library Survey State level data Public library data (Administrative Entities)
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 1 Database Systems.
Data Warehouse A place the information system department puts the data that is turned into information. Data must be properly prepared,organized,and presented.
Copyright, © 2006, eePulse, Inc. the measure of your success 1 Implementation for Data and Dialogue Driven Leadership™ Initial Discussion Document.
Unified Migration Analytical System Ministry of Justice Public Service Development Agency Secretariat of the State Commission on Migration Issues Tbilisi.
Managing Data Resources File Organization and databases for business information systems.
Sample Fit-Gap Kick-off
Project Planning: Scope and the Work Breakdown Structure
Seminar on Evaluation of Internal Control Systems
2 Selecting a Healthcare Information System.
Data Quality By Suparna Kansakar.
CIS12-3 IT Project Management
AICT5 – eProject Project Planning for ICT
Presentation transcript:

Quality Data – An Improbable Dream? Quality Data An Improbable Dream? Elizabeth Vannan Centre for Education Information Victoria, BC, Canada

Quality Data – An Improbable Dream? Information quality is a journey, not a destination - Larry P. English

Quality Data – An Improbable Dream? Agenda Data Definitions and Standards Project What is Quality Data? The Cost of Poor-Quality Data Improving Data Quality – Our Process Questions?

Quality Data – An Improbable Dream? BC Higher Education Canada’s Western-most province Population: Million Land Area: 366,795 Sq Miles Publicly Funded Post- Secondary System –22 Colleges –6 Universities

Quality Data – An Improbable Dream? CEISS The Centre for Education Information is an independent organization that provides research and technology services to improve the performance of the BC education system

Quality Data – An Improbable Dream? CEISS Implement and manage administrative systems Perform custom surveys, research and analysis Facilitate development and implementation of data standards Negotiate and manage province wide software contracts (Oracle, SCT Banner, Datatel)

Quality Data – An Improbable Dream? DDEF Project The Problem –Better data about the BC higher education sector needed for decision-making –No infrastructure in place to facilitate the collection of data electronically Data Definitions and Standards Project Initiated in 1995

Quality Data – An Improbable Dream? DDEF Project The Solution –Create data standards for all higher education information (Student, HR, Finance) –Develop a data warehouse based on standards for reporting –Implement a common technical infrastructure at all higher education institutions

Quality Data – An Improbable Dream? DDEF Project Project Goals –Improve the quantity and QUALITY of data available –Reduce the number of data and reporting requests –Develop business information system to support the management and evaluation of the BC Post-Secondary system

Quality Data – An Improbable Dream? How Are We Doing? 16 institutions implemented/implementing Institutions using data warehouses for internal reporting Data requests reduced Ministry using data

Quality Data – An Improbable Dream? Why Focus on Data Quality? Poor data quality in our data warehouse impacts: –Confidence –Decision making –Funding

Quality Data – An Improbable Dream? Quality Data Are… The Four Attributes of Data Quality

Quality Data – An Improbable Dream? Quality Data Are… Accurate –Free from errors –Representative

Quality Data – An Improbable Dream? Quality Data Are… Complete –All values are present

Quality Data – An Improbable Dream? Quality Data Are… Timely –Recorded immediately –Available when required

Quality Data – An Improbable Dream? Quality Data Are… Flexible –Data definitions understood –Can be used for multiple purposes

Quality Data – An Improbable Dream? Quality Data… Don’t have to be perfect Good enough to fill the business need at a price you’re willing to pay Our Challenge Defining Quality Criteria for Higher Education Data

Quality Data – An Improbable Dream? Cost of Poor-Quality Data Business Process Costs Incorrect Registrations Inaccurate Tuition Billings Payroll Errors

Quality Data – An Improbable Dream? Cost of Poor-Quality Data Rework Re-collect Data Correct Errors Data Verification

Quality Data – An Improbable Dream? Cost of Poor-Quality Data Missed Opportunities Substandard Customer Service Poor Decision Making Loss of Reputation

Quality Data – An Improbable Dream? Improving Data Quality Business Process Review Improve d Data Quality Data Quality Assessment Business Practice Change Data Cleansing

Quality Data – An Improbable Dream? Business Process Review When, where, how is data collected? Where is data stored? Who creates data? Who uses data? What outputs are required? What quality checks already exist?

Quality Data – An Improbable Dream? Business Process Review Involve all stakeholders! –For student data we involve Executive Registrars office IT Department Institutional Research

Quality Data – An Improbable Dream? Business Process Review Results –Understanding of business practices –Identification of data creators, custodians, users –Preliminary quality metrics –Problem business practices

Quality Data – An Improbable Dream? Data Quality Assessment Establish Metrics Apply metrics to data Review results

Quality Data – An Improbable Dream? Establish Metrics For each element determine quality criteria –Acceptable range of values –Acceptable syntax –Comparison to known values –Business rules –Thresholds

Quality Data – An Improbable Dream? Quality Metrics

Quality Data – An Improbable Dream? Applying Metrics Collect known information for comparison Develop queries to test each of your validation criteria –We use Oracle Discoverer, but other tools exist (MS Access, SQL)

Quality Data – An Improbable Dream? Applying Metrics Test 1 PEN must be 9 digits long. No characters, no shorter values acceptable

Quality Data – An Improbable Dream? Test 1 Results Two Student Records Contain Invalid PEN Numbers

Quality Data – An Improbable Dream? Test 1 Results Invalid PEN’s Data Entry Error? Can Identify specific students for data cleansing

Quality Data – An Improbable Dream? Applying Metrics Test 2 At least 80% of student records must have valid PEN number

Quality Data – An Improbable Dream? Test 2 Results This Institution Meets the Quality Threshold

Quality Data – An Improbable Dream? Applying Metrics Test 3 No Duplicate PEN’s

Quality Data – An Improbable Dream? Test 3 Results This institution has a BIG problem! Can we see more details?

Quality Data – An Improbable Dream? Test 3 Results Addition information reveals data loading problems

Quality Data – An Improbable Dream? Reviewing Results Systematic approach needed Develop strategy for data cleaning Identify source of data problems Deal with Disparate Data Shock!

Quality Data – An Improbable Dream? Reviewing Results Insert a quality review checklist

Quality Data – An Improbable Dream? Reviewing Results

Quality Data – An Improbable Dream? Data Cleansing Location –Administrative System? –Staging Area? Who Scope

Quality Data – An Improbable Dream? Typical Data Cleansing Correcting data entry errors Removing or correcting nonsensical dates Deleting “garbage” records Combining or deleting duplicates Updating and applying code sets

Quality Data – An Improbable Dream? Business Practice Change Two components –Implementing changes to improve data quality –Adopting ongoing data quality review process Changing Business Practices is a Challenge Get Stakeholder Support

Quality Data – An Improbable Dream? Business Practice Change Education Centralizing responsibility for codes Consolidating data collection Implementing validation routines Change business processes

Quality Data – An Improbable Dream? Quality Review Process Review data regularly Make someone responsible Establish procedures for correcting data problems Communicate quality improvements

Quality Data – An Improbable Dream? Some Changes in BC Creation of Data Manager position, responsible for code sets, data quality Regular education for registration clerks and other data creators Established relationships between data creators and users Re-engineered administrative systems

Quality Data – An Improbable Dream? Improvements to BC Data Improved data quality and quantity –Nonsensical dates almost eliminated –Completeness of key elements improved (from 50% to 80-90%) –Data now being collected for CE in standard format

Quality Data – An Improbable Dream? Final Thoughts… Quality Data are Probable if you are willing to… –Take a critical look at your existing data –Implement changes to how you collect and manage data –Invest the time to educate and communicate with data users and creators –Make data quality improvement an on- going process

Quality Data – An Improbable Dream? Recommended Reading Brackett, Michael H., Data Resource Quality, Turning Bad Habits into Good Practices (New York:Addison-Wesley, 2000) English, Larry P., Improving Data Warehouse and Business Information Quality (New York: John Wiley and Sons, 1999) Redman, Thomas C., Data Quality for the Information Age (Boston;Artech House, Inc., 1996)

Quality Data – An Improbable Dream? Thank You! Presentation Available At or