Quality of PSI Robbin te Velde Helsinki, 19-20 April 2007.

Slides:



Advertisements
Similar presentations
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Advertisements

Business Development Suit Presented by Thomas Mathews.
Corporate Administration Management System CAMS-ITech: Vertical CRM for the Administration/Finance Area CAMS-iTech™ is the technological answer developed.
UNIT-2 Data Preprocessing LectureTopic ********************************************** Lecture-13Why preprocess the data? Lecture-14Data cleaning Lecture-15Data.
StormingForce.com Motion. StormingForce.com StormingForce’s technology is significantly increasing productivity and quality of manual repetitive tasks.
C6 Databases.
TECHNICAL VOCATIONAL EDUCATIONAL AND TRAINING COLLEGES AN INTRODUCTION TO THE IMPEMENTATION OF A COMPLIANT RISK MANAGEMENT PROCESS July 2014.
Procserve Benefits of eCommerce © Procserve Holdings Limited. All rights reserved.
Konstanz, Jens Gerken ZuiScat An Overview of data quality problems and data cleaning solution approaches Data Cleaning Seminarvortrag: Digital.
Data-Sharing and Governance Consultation ANALYSIS OF RESPONSES.
Frank Yu Australian Bureau of Statistics Unstructured Data 1.
Regional Workshop for African Countries on Compilation of Basic Economic Statistics Pretoria, July 2007 Administrative Data and their Use in Economic.
Theoretical Structure of Financial Accounting
Managing Data Resources
1 Computerised National Land Book of Latvia Ints Lukss Project Manager MikroKods Ltd.
ISQA 479 MRP Quality Control. Materials Requirements Planning Bill of Material List of all parts necessary to make ONE BofM X Order = GROSS Req’mts Subtract.
EFET comments on obstacles to trading November 2007 Obstacles to Electricity Trading in Central & Eastern Europe – latest development.
Data Resource Management Data Concepts Database Management Types of Databases Chapter 5 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies,
Office of Government Contracting & Business Development -- Women Owned Small Business Program -- April 2012 “The WOSB Advantage” A Guide to the Women Owned.
Compliance System Validation - An Audit Based Approach December 2012 Uday Gulvadi, CPA, CIA, CISA, CAMS Director - Internal Audit, Risk and Compliance.
Data Governance Data & Metadata Standards Antonio Amorin © 2011.
Get More Value from Your Reference Data—Make it Meaningful with TopBraid RDM Bob DuCharme Data Governance and Information Quality Conference June 9.
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
Visit our Focus Rooms Evaluation of Implementation Proposals by Dynamics AX R&D Solution Architecture & Industry Experts Gain further insights on Dynamics.
Re – use of PSI in Slovenia Kristina Kotnik Šumah Deputy of the Information Commisoner.
Slide 1 D2.TCS.CL5.04. Subject Elements This unit comprises five Elements: 1.Define the need for tourism product research 2.Develop the research to be.
International Congress and Convention Associationwww.iccaworld.com Strategic Plan – Mission Statement “ICCA is the global community for the meetings industry,
Management Accounting- Nature And Scope
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
What You Need before You Deploy Master Data Management Presented by Malcolm Chisholm Ph.D. Telephone – Fax
DIRECTIVE 2003/98/EC OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL of 17 November 2003 on the re-use of public sector information (PSI directive) Theory.
Conceptual Framework For Financial Reporting
Revise lecture Statement of cash flows – IAS 7 2.
MIS 301 Information Systems in Organizations Dave Salisbury ( )
Emerging Technologies Work Group Master Data Management (MDM) in the Public Sector Don Hoag Manager.
Case 2: Emerson and Sanofi Data stewards seek data conformity
ARDN to EPSI, Riga1 The market for re-usable PSI Presentation to EPSI Riga Adrian Norman.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
CZECH STATISTICAL OFFICE Na padesátém 81, CZ Praha 10, Czech Republic 1 Subsystem QUALITY in Statistical Information System Czech.
1.file. 2.database. 3.entity. 4.record. 5.attribute. When working with a database, a group of related fields comprises a(n)…
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
Revise Lecture 2 1. Revise Lecture The regulatory system 2.2. A conceptual framework 2.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Health eDecisions Use Case 2: CDS Guidance Service Strawman of Core Concepts Use Case 2 1.
Master Data Management & Microsoft Master Data Services Presented By: Jeff Prom Data Architect MCTS - Business Intelligence (2008), Admin (2008), Developer.
Knowledge Management & Knowledge Management Systems By: Chad Thomison MIS 650.
Global and China Underfloor Heating Industry Report 2015 Website : No of Pages: 231 Published: November 2015 Single User PDF: US$ 2800.
Global Sports Nutrition Industry 2015 Market Research Report Website : No of Pages: 158 Published: November 2015 Single User PDF: US$
Global Food Flavors Industry 2015 Market Research Report Website : No of Pages: 159 Published: November 2015 Single User PDF: US$ 2800.
Global Guar Gum Industry 2015 Market Research Report
Statistics Netherlands’ modernization programme: the use of administrative data, lessons learned and the way ahead. Geert Bruinooge Assistant Director.
© 2012 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
Chapter 8 Auditing in an E-commerce Environment
Copyright ©2005 by South-Western, a division of Thomson Learning. All rights reserved. Introduction to Marketing.
Information Resource Stewardship A suggested approach for managing the critical information assets of the organization.
Are the Standard Documentations really Quality Reports? European Conference on Quality in Official Statistics Helsinki, 3-6 May 2010 © STATISTIK AUSTRIA.
Introduction to Microeconomics. Meaning of Microeconomics Microeconomics is the study of the economic actions of individuals and small group of individuals.
Logical Database Design and the Rational Model
An Introduction to Quality
Risk Budgeting.
Amadeus Open Profile Suite
Literature Review: Conception to Completion
Data Quality By Suparna Kansakar.
Transaction Objects, Control Objects, Control tags and Tags Dynamics
6.1 Quality improvement Regional Course on
Contents Co-operation about one common register Public accessible
MGT601 SME MANAGEMENT.
Introduction to Quality
Accuracy and Precision
Presentation transcript:

Quality of PSI Robbin te Velde Helsinki, April 2007

2 of 12 Outline of the presentation Short (philosophical) introduction on quality Data management & data quality (practice and theory) Conventional data management PSI enlightened models Quality and pricing

3 of 12 Defining the elusive concept of Quality (I) Common definitions of quality (Garvin, 1984) : Transcendent: “quality is neither mind nor matter, but a third entity independent of the two […] even though Quality cannot be defined, you know what it is” (Pirsig, 1974) Product-based: “differences in quality amount to differences in the quantity of some desired ingredients or attribute” (Abbot, 1955) Manufacturing-based: “quality means conformance to requirements” (Crosby, 1984) Value-based: “quality means best for certain customer conditions. These conditions are (a) the actual use and (b) the selling price of the product” (Feigenbaum, 1961) User-based: “quality is fitness for use” (Juran, 1988)

4 of 12 Defining the elusive concept of Quality (II) There is no unambiguous definition of quality. Each definition stresses other dimensions in quality management [thus] the specific interpretation of quality is no neutral process but is both cause and effect of internal (management x staff) and external (organisation x customer; buyer x supplier) relations. In each era one particular definition has been dominant. Over the centuries there has been a shift from the transcendent to the product and manufacturing-based via the value-based back to the more transcendent user-based definition.

5 of 12 The grim reality of data quality Lack of Metadata Management –no common data definitions exists about what data means (e.g., shared vocabulary) No clarity on data ownership –Users create, modify and access data but nobody sees it as its responsibility to own it (fear of ‘blame culture’) Poor data quality –no common consistent way of validating data across applications Massive data redundancy and fractured inconsistent data across different systems –significant data re-keying –maintenance of master data attributes done in different systems –two-way data flow between systems to synchronise the same data Business process outsourcing occurring without process integration and/or integrated master data management Fractured unmanaged unstructured content –no CMS and/or taxonomy to organise the content

6 of 12 Data Quality Improvement as part of Data Management (I) Database Administration Data Security Management Data Architecture, Analysis & Design Metadata Management Data Warehousing & Business Intelligence Reference & Master Data Management Data Quality Improvement Unstructured Data Management Data Stewardship, Strategy & Governance Regulatory Compliance (SOX, etc.) Data Quality Analysis (including Data Profiling) Data Cleanup Campaigns and Programs Data Quality Requirements Analysis Data Quality Auditing and Certification

7 of 12 Conventional Data Management (II) This model is still very much within the manufacturing-based tradition of quality control Quality is defined as the accuracy of the product (the data) Assumes existence of ex ante, objective, uniform quality criteria Works well but only under certain conditions (stable, well- defined operational environment) Primary process accept reject Re-use Quality criteria (filter) Example I: Unique identification of companies at the Dutch Basic Business Register (BBR) [source: Human Inference] Primary process (MoE: tons) Re-use of information (investors) Example II: Lack of common data definition between Czech Statistical Office (CZSO) and Ministry of Environment (MoE) [source: prof. Jiri Hřebiček, Masaryk University, Brno, Czech Republic] Primary process (CZSO: kg) Same value hospital EC database

8 of 12 Limitations of conventional Data Management Ex ante objective criteria are never 100% complete –It is impossible to define beforehand all possible combinations –If you do not include enough combinations you miss the ‘fuzzy’ ones –If many combinations are included filtering takes too long The (futile) effort to go for 100% accuracy hampers process outsourcing Reference data has different meanings to different people; the quality of this reference data is related to the requirements of each user 350,000 objects x 3 types of address x 15 object categories Search options were only based on exact matches, so all ‘fuzzy’ duplicates (e.g., alternative spelled names) were not found the number of combinations was already so big that the filtering took several seconds This often resulted in duplicates because once users has searched for a few seconds without any results, they simply created a new record Example III: Address validation at RWTÜV AG (Germany) [source: Human Inference] The official Dutch government portal Overheid.nl has a strict policy not to allow any content from third (private) parties on the website. This is not a particularly citizen-centered approach but the official policy statement is that they only want “100% certified” information and that they thus do not accept content generated by processes which are not fully under their own control. Example IV: Content syndication at Dutch government portal (overheid.nl) Reference data has different meaning to different users and the quality of this data is related to the requirements of each user. Some reference data may be more critical than others depending on its use at the time. The solution choosen it to built a unique dynamic list of business rules for each user, based on the qualitative feedback obtained from that user. Example V: Use of business rules for UK security trading (private sector) [source: Finsoft Ltd]

9 of 12 Hidden assumptions of conventional (closed) model for PSI quality control There is a strict split between (public sector) generation of data and (private sector) re-use of that information The flow of data is unidirectional The generator of the data is solely responsible for the quality of the data Lack of quality of PSI is an important obstacle for re-use Primary process accept reject Re-use Quality criteria (filter) Example I: Unique identification of companies at the Dutch Basic Business Register (BBR) [source: Human Inference] Primary process Final use Example VI: Conventional (‘closed’) model for PSI quality control (cf. the Czech waste case) ex ante quality control Re-use ex post quality control public sector private sector Primary process accept reject Re-use Quality criteria (filter) Example I: Unique identification of companies at the Dutch Basic Business Register (BBR) [source: Human Inference] Primary process Final use Example VI*: Conventional (‘closed’) model for PSI quality control (ex post quality control outsourced to private sector, e.g. Acxiom) ex ante quality control Re-use ex post quality control public sector private sector

10 of 12 ‘Enlightened’ models for PSI quality control The generation, re-use and final use are intertwined The flow of data is multidirectional The public content holder does not have to be the generator but is always at least partly responsible for the quality of the data Lack of fitness for (re)use is an important obstacle for re-use (not lack of primary data quality per se) Example VII: Intertwined, multidirectional data flows Primary process Final use Re-use public sector private sector Primary process Final use Re-use Primary process accept reject Re-use Quality criteria (filter) Example I: Unique identification of companies at the Dutch Basic Business Register (BBR) [source: Human Inference] Primary process Final use Example VIII: Co-management of quality (geo- information Norway) Re-use public sector private sector Primary process accept reject Re-use Quality criteria (filter) Example I: Unique identification of companies at the Dutch Basic Business Register (BBR) [source: Human Inference] Primary process Final use Example IX: Co-management of quality by final user (Latvia) Re-use public sector private sector Example X: Central role of government in quality management Primary process Final use Re-use public sector private sector Primary process Final use Re-use

11 of 12 Quality and pricing Many public content holders fear that opening up their information (for free) to the public at large cannibalizes their income from commercial re-use. In general though low end and high end markets can very well co-exist. The price discrimination is based on differences in quality In the specific case of information goods, this quality does not always refer to the quality of the primary data itself but especially to the ‘fitness for re-use’. Primary process accept reject Re-use Quality criteria (filter) Example I: Unique identification of companies at the Dutch Basic Business Register (BBR) [source: Human Inference] Primary process Final use Example XI: Commercial re-use excludes free use for public at large Re-use public sector private sector First price no price Primary process accept reject Re-use Quality criteria (filter) Example I: Unique identification of companies at the Dutch Basic Business Register (BBR) [source: Human Inference] Primary process Final use Example XI: Commercial re-use excludes free use for public at large Re-use public sector private sector Primary process accept reject Re-use Quality criteria (filter) Example I: Unique identification of companies at the Dutch Basic Business Register (BBR) [source: Human Inference] Primary process Final use Example XII: High-end market (‘fit for re-use’) coexists with low end market Re-use public sector private sector high quality low quality ‘fit for re-use’ Final use’

12 of 12 Contact Robbin te Velde 20 April 2007