Kim Duckworth New Zealand Ministry of Fisheries The application of standardised data quality improvement methodologies to data describing marine fisheries.

Slides:



Advertisements
Similar presentations
Further Analysis MICS3 Regional Workshop on Data Archiving and Dissemination Alexandria, Egypt 3-7 March, 2007.
Advertisements

VARTAN – Validation Reporting Templates Jürgen Teutsch, NLR CAATS Workshop, 16-Feb-2006, Lanzarote.
Producing Quality Evidence in a Well Organised Portfolio Doc Ref: 20/04/09-portfolio-quality-evidence.
Organisation Of Data (1) Database Theory
Session Outline: 1. Research Strategy - the 8 steps including: Finding information on the subject guide Searching the library catalogue Searching online.
Quality Data for a Healthy Nation by Mary H. Stanfill, RHIA, CCS, CCS-P.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Social Research Methods
Development of a National Aquatic Biodiversity Information System for New Zealand Jacqui Burgess Senior Scientist Ministry of Fisheries New Zealand.
Chapter 9: Basic Information Systems Concepts. Definitions u A system is a set of interrelated components that must work together to achieve some common.
Plagiarism and the IWU Student. … I’ve been hearing about plagiarism since I was in preschool! … of course I know it’s wrong and I could get in trouble.
Plagiarism and the IWU Student. … I’ve been hearing about plagiarism since I was in preschool! … of course I know it’s wrong and I could get in trouble.
Medieval Sources, Digital Resources Mark Merry History Data Service
Software Quality Metrics
Week 1. What we will cover in this course General place of AIS in accounting Conceptual place of AIS Values and assumptions Documentation techniques Accounting.
Software Development Unit 2 Databases What is a database? A collection of data organised in a manner that allows access, retrieval and use of that data.
Evaluation of digital Libraries: Criteria and problems from users’ perspectives Article by Hong (Iris) Xie Discussion by Pam Pagels.
Creating a high performing School What the research says on how our best performing schools come out on top Courtesy of AITSL.
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
Controlled Vocabularies (Term Lists). Controlled Vocabs Literally - A list of terms to choose from Aim is to promote the use of common vocabularies so.
Purpose of study A high-quality computing education equips pupils to use computational thinking and creativity to understand and change the world. Computing.
Qualitative Analysis Information Studies Division Research Workshop Elisabeth Logan.
Database Management Exploring the Territory. Database vs Flat Files Flat Files –Characters-fields-records-files Files are not designed to work together.
Requirements Engineering
Copyright 2010, The World Bank Group. All Rights Reserved. Tourism statistics, 1 Business Statistics and Registers 1.
Register-Based Census 2011 in Slovenia – Some Quality Aspects Danilo Dolenc Statistical Office of the Republic of Slovenia UNECE-Eurostat Expert Group.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
United Nations Economic Commission for Europe Statistical Division Getting the Facts Right: Metadata for MDG and other indicators UNECE Baku, Azerbaijan,
What to Know: 9 Essential Things to Know About Web Searching Janet Eke Graduate School of Library and Information Science University of Illinois at Champaign-Urbana.
Data & Information Unit 2 Topic 2. A doctor will order various tests on a patient (data). The results from the tests will give the doctor information.
Data and information. Information and data By the end of this, you should be able to state the difference between DATE and INFORMAITON.
Databases. What is a database?  A database is used to store data. The word DATA is actually Latin for FACTS. A database is, therefore, a place, or thing.
Guidelines for ENSCONET partners in the use of the e-forum.
Developing Statistical Information Systems and XML Information Technologies - Possibilities and Practicable Solutions Geneva,
Supporting Researchers and Institutions in Exploiting Administrative Databases for Statistical Purposes: Istat’s Strategy G. D’Angiolini, P. De Salvo,
1 The Good, the Bad, and the Ugly: Collecting and Reporting Quality Performance Data.
1 CS 430: Information Discovery Sample Midterm Examination Notes on the Solutions.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
A Metrics Program. Advantages of Collecting Software Quality Metrics Objective assessments as to whether quality requirements are being met can be made.
MarLIN - CSIRO Marine Laboratories Information Network.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Lesson 9: Types of information system. Introduction  An MIS is a decision support system in which the form of input query and response is predetermined.
MICS Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Overview of the MICS Process.
CONCEPTUAL MODELLING OF STATISTICAL METADATA AND METADATA DATA MODEL IN CoSSI Geneva, 3-4 April 2006 Heikki Rouhuvirta, Statistical.
CS223: Software Engineering Lecture 2: Introduction to Software Engineering.
Verification & Validation
Census quality evaluation: Considerations from an international perspective Bernard Baffour and Paolo Valente UNECE Statistical Division Joint UNECE/Eurostat.
SESSION 6.4 Information Management System (IMS) initiatives Sixth Tuna Data Workshop (TDW-6) April 2012 SPC, Noumea, New Caledonia.
Auditing data management system Bruno Deprez Data Audit officer SPC, Oceanic Fisheries Program.
First Tuna Data Workshop (TDW-1) October 2006, Noumea, New Caledonia Oceanic Fisheries Programme (OFP) Secretariat of the Pacific Community (SPC)
PBIS DATA. Critical Features of PBIS SYSTEMS PRACTICES DATA Supporting Culturally Knowledgeable Staff Behavior Supporting Culturally Relevant Evidence-based.
The IUCN Species Information Service (SIS)
Plagiarism and the IWU Student
Component 1.6.
Quality assurance in official statistics
RECENT TRENDS IN METADATA GENERATION
REPORT WRITING REFERENCE : Pinner, D. & Pinner, D. (2003) Communication Skills, 4th ed. Pearson Longman, New Zealand, pp. 147 – 162.
CHAPTER 3 Architectures for Distributed Systems
REPORT WRITING REFERENCE : Pinner, D. & Pinner, D. (2003) Communication Skills, 4th ed. Pearson Longman, New Zealand, pp 147 – 162.
Literature review Lit. review is an account of what has been published on a topic by accredited scholars and researchers. Mostly it is part of a thesis.
TDW-11: 24-28th April 2017, Noumea, New Caledonia
Data Quality By Suparna Kansakar.
An introduction to MEDIN Data Guidelines.
Georg Umgiesser and Natalja Čerkasova
Progress in the implementation of RTMCF1 Action Plan.
Parallel Session: BR maintenance Quality in maintenance of a BR:
WJEC GCSE Computer Science
OBSERVER DATA MANAGEMENT PRINCIPLES AND BEST PRACTICE (Agenda Item 4)
Introduction to reference metadata and quality reporting
Database Design Chapter 7.
Presentation transcript:

Kim Duckworth New Zealand Ministry of Fisheries The application of standardised data quality improvement methodologies to data describing marine fisheries and biodiversity.

Why this topic ? Because it is easy to forget that disciplines other than our own also have information quality problems; and Because information quality is what I am passionate about.

Content: The management of marine biodiversity and fisheries information in NZ Structured information quality improvement methodologies: A few definitions The main concepts How we (NZ Ministry of Fisheries) have applied structured information quality improvement methodologies.

Fisheries and biodiversity information management in NZ One group controls the majority of NZ’s fisheries and marine biodiversity information Commercial catch logbook (“Catch Effort”) Fisheries observer (about 20 types of information) Distribution information (on GIS systems) Trawl survey Acoustic survey Fish length frequency Fish aging

Information brokerage Information producers Information Analysers (decision makers)

Fisheries and biodiversity information management in NZ NZ effectively has a national archive of fisheries and marine biodiversity data. Possibly this has meant that accessibility and interoperability have been less of an “issue” in NZ then in many other countries. The big issue for the management of NZ’s fisheries and marine biosecurity information has been improving information quality.

Information quality In New Zealand there are approximately 30 people employed (full time) on improving the quality of fisheries and biodiversity information. “Poor data quality is the norm rather than the exception, but most organisations are in a state of denial about this issue” (GartnerGroup, 1997) The management and improvement of information quality is slowly becoming a discipline (and profession) in itself.

Definitions Data - A representation of a thing or event in the real world Information – Data in context (the meaning of data) Information quality – How closely the representation matches the thing or event in the real world,

Definitions Data - A representation of a thing or event in the real world Information – Data in context (the meaning of data) Information quality – How closely the representation matches the thing or event in the real world, given the purpose(s) for which the data is being collected.

Implications A key aspect of our information quality improvement programmes is to establish and document the purposes for which the information will be used; Data can simultaneously be of both high and low quality; For us to provide someone with information we must give them with both data and context.

Definitions – characteristics of information quality Accuracy Precision Completeness Non-duplication Timeliness Currency Format Context “Rightness”

The information production chain Decision Start of production

The information production chain A (simplified) commercial catch logbook example: Create logbooks and create codes for use on logbooks, Create explanatory notes & train fishers Fishers fill in forms Fishers post forms to a central location Data entry staff enter data Computer systems check and “correct” data Humans check and “correct” data Store data in database Extract from database Analyse and interpret data

Implications: Planning and action needs to be on the basis that all weak links in the chain are identified and acted on. For example – With regard to NZ’s fisheries observer data we have identified over 100 purposes for which the information is used, 482 issues with the status quo and 33 projects which (if implemented) should address those issues.

The methodology Assess information quality Clean existing data Improve the processes that produce data Assess cost/risks of non-quality

Improving the processes that produce data Analyse root causes of errors. Minimise the things that produce errors. Prevent re- occurrence. For example – In NZ we are redesigning catch logbook forms specifically to make them “harder to get wrong”.

Form redesign Prototype forms were tested on “real fishers” Write the month and year on which you fished

Form redesign Prototype forms were tested on “real fishers” Write the month and year on which you fished Write the month (e.g. FEB) and year on which you fished

Context Three examples from NZ of projects to help decision makers understand the context of data: Reference library CD for commercial catch logbook data Information interpretation system for commercial catch logbook data Schematic form used to represent species distribution data on the Ministry’s marine biodiversity GIS

Catch Effort reference library Created because decision makers were having trouble getting hold of the documentation that they needed in order to make sense of the data. The Catch Effort reference library: is a website that runs off a CD provides a “one stop shop” for everything that a decision maker might ever want to know regarding how Catch Effort data is collected, processed, stored and managed contains the equivalent of 500 pages of documentation

Information Interpretation System Arose as a consequence of implementing a decision maker query-able data warehouse, and concerns that decision makers would not understand the context of the data; IIS is an application that stores (in a separate database) known “issues” with Catch Effort data, and retrieves relevant issues in parallel with extractions of data from the data warehouse; Decision makers cannot turn IIS off. They can prevent individual issues being re-displayed within the next 6 months.

Information Interpretation System – example search

Information Interpretation System – example of results

NABIS The National Aquatic Biodiversity Information System A queriable internet based GIS storing information about “what lives where” Aimed at: Decision makers who are not experts in marine bio-diversity The general public Scientists

Conclusions One prerequisite for information quality improvement is knowing the purpose(s) for which the information will be used; It is important for decision makers to be provided with “context” as well as data; Measure information quality; Assess costs/risks of “non-quality”; Address root causes of problems.

The end Questions ?