The Edit Anders Norberg, Statistics Sweden (SCB) Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.

Slides:



Advertisements
Similar presentations
Work Session on Statistical Data Editing Paris, France, April 2014 Topic (i): Selective editing / macro editing Experiences from Selective Editing.
Advertisements

SELEKT in the International Trade in Services. International Trade in Services Statistics on international trade in services, wages and transfers are.
Preparing Data for Quantitative Analysis
Deliverable 2.8: Outliers Gary Brown Office for National Statistics UK.
Quality Guidelines for statistical processes using administrative data European Conference on Quality in Official Statistics Q2014 Giovanna Brancato, Francesco.
Starting Out with C++, 3 rd Edition 1 Chapter 1. Introduction to Computers and Programming.
Modernisation of Statistical Processing at SURS Andreja Smukavec, SURS Rudi Seljak, SURS Workshop on Modernisation of Statistical Production Geneva, 15–17.
Implementation/Acceptance Testing / 1 Implementation and Acceptance Testing Physical Implementation Criteria: 1. Data availability 2. Data reliability.
Principles of Procedural Programming
1 Business Exchange Structures Concepts.
Editing of mixed source data for turnover statistics Jeffrey Hoogland (SN) Work Session on Statistical Data Editing (Ljubljana, Slovenia, 9-11 May 2011)
Maintenance of Selective Editing in ONS Business Surveys Daniel Lewis.
Database Design IST 7-10 Presented by Miss Egan and Miss Richards.
Copyright 2003 Scott/Jones Publishing Brief Version of Starting Out with C++, 4th Edition Chapter 1 Introduction to Computers and Programming.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
Metadata driven application for aggregation and tabular protection Andreja Smukavec SURS.
DCT 1123 PROBLEM SOLVING & ALGORITHMS INTRODUCTION TO PROGRAMMING.
Simple Program Design Third Edition A Step-by-Step Approach
Automated Data Analysis National Center for Immunization & Respiratory Diseases Influenza Division Nishan Ahmed Data Management Training Cairo, Egypt April.
Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.
Q2010, Helsinki Development and implementation of quality and performance indicators for frame creation and imputation Kornélia Mag László Kajdi Q2010,
A generic tool to assess impact of changing edit rules in a business survey – SNOWDON-X Pedro Luis do Nascimento Silva Robert Bucknall Ping Zong Alaa Al-Hamad.
Towards Drafting a Risk Ontology based on the IRIS Risk Glossary SUMMER ACADEMY Sep 1 st – Sep 4 th 2009 Nick Bassiliades, Dimitris Vrakas Logic Programming.
Support for design of statistical surveys at Statistics Sweden
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
The application of selective editing to the ONS Monthly Business Survey Emma Hooper Office for National Statistics
Current and Future Applications of the Generic Statistical Business Process Model at Statistics Canada Laurie Reedman and Claude Julien May 5, 2010.
Jeroen Pannekoek - Statistics Netherlands Work Session on Statistical Data Editing Oslo, Norway, 24 September 2012 Topic (I) Selective and macro editing.
© 2011 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Stewart Venit ~ Elizabeth Drake Developing a Program.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
Metadata driven application for data processing – from local toward global solution Rudi Seljak Statistical Office of the Republic of Slovenia.
Unit-1 Introduction Prepared by: Prof. Harish I Rathod
Statistical Data Editing Anders Norberg, Statistics Sweden (SCB)
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
First Steps in Modularization. Simple Program Design, Fourth Edition Chapter 8 2 Objectives In this chapter you will be able to: Introduce modularization.
Construction treatment in conducting international comparison of the GDP and purchasing power parities (OECD-Eurostat and CIS approach)
1 Selective data editing Development & implementation Q 2010 Helsinki Jörgen Svensson Process Owner Statistics Sweden (SCB)
BSBPMG504A Manage Project Costs 7.1 Estimate Costs Adapted from PMBOK 4 th Edition InitiationPlanning ExecutionClose Monitor Control The process of developing.
StatisticalData Editing Anders Norberg, Statistics Sweden (SCB)
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Topic (iii): Macro Editing Methods Paula Mason and Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.
Outlining a Process Model for Editing With Quality Indicators Pauli Ollila (part 1) Outi Ahti-Miettinen (part 2) Statistics Finland.
Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.
Copyright 2010, The World Bank Group. All Rights Reserved. Managing Data Processing Section B.
Topic (i): Selective editing / macro editing Discussants Orietta Luzi - Italian National Statistical Institute Rudi Seljak - Statistical Office of Slovenia.
Chapter 1: Overview of SAS System Basic Concepts of SAS System.
A selective editing method considering both suspicion and potential impact, developed and applied to the Swedish foreign trade statistics Topic (ii), WP.
Advances In Software Inspection
On Implementing CSPA Specifications for Editing and Imputation Services Donato Summa, Monica Scannapieco, Diego Zardetto, Istat, Italy Istituto Nazionale.
GCSE Computing#BristolMet Session Objectives #23 MUST understand what is meant by the programming term iteration SHOULD describe methods of looping used.
Evaluating the benefits of using VAT data to improve the efficiency of editing in a multivariate annual business survey Daniel Lewis.
Public Libraries Survey Data File Overview. What We’ll Talk About PLS: Public Libraries Survey State level data Public library data (Administrative Entities)
Public Libraries Survey Data File Overview. 2 What We’ll Talk About PLS: Public Library Survey State level data Public library data (Administrative Entities)
Elaborating on the Business Architecture of SN Robbert Renssen Statistics Netherlands Standard Process Steps.
Maria Gunnarsson SCB Mikael Nordberg SCB PX-Web 2015 December.
Danish GFS compilation system Use of primary data in national accounts By Kristina Stæhr Vest Gitte Frej Knudsen Martin Rasmussen.
Danish GFS compilation system by Martin Rasmussen.
Chapter 1. Introduction to Computers and Programming
An R package for selective editing based on a latent class model
Improving the efficiency of editing in ONS business surveys
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Jeroen Pannekoek, Sander Scholtus and Mark van der Loo
SDLC Phases Systems Design.
Parallel Session: BR maintenance Quality in maintenance of a BR:
Cohesion and Coupling.
Quick statistics - how to deal with quality?
New and Emerging Methods
New Editing Methods at Statistics Sweden
Hands-on GSIM Mauro Scanu ISTAT
Presentation transcript:

The Edit Anders Norberg, Statistics Sweden (SCB) Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011

The environment of SELEKT Input, throughput, output, use

Suspicion

SELEKT 1.1 Survey specific cold adapter (SAS code) Data preparation SAS data set PRE-SELEKT Parameter specifications, Analysis of cold data AUTOSELEKT Score calculation & record flagging Records to FOLLOW-UP Process data and reports Input (hot) survey data Records to IMPUTATION Raw+edited past (cold) survey data Survey specific hot adapter (SAS code) Data preparation SAS data set Table of Parameters Table of Estimates Accepted records CLAN estimation software SNOWDON -X analysis of edits Edits

Glossary of Terms on Statistical Data Editing (1) “EDIT RULE SPECIFICATION CHECK RULE SPECIFICATION A set of check rules that should be applied in the given editing task.”

Glossary of Terms on Statistical Data Editing (2) “CHECKING RULE A logical condition or a restriction to the value of a data item or a data group which must be met if the data is to be considered correct. In various connections other terms are used, e.g. edit rule.”

Recommended Practices for Editing and Imputation in Cross- sectional Business Surveys “EDIT A logical condition or a restriction to the value of a data item or a data group which must be met if the data is to be considered correct. Also known as edit rule or checking rule.”

Example 1 if Occupation = ‘Doctor’ and not (29000 < Salary < 71000) then Errcode_A01 = ‘Flag’

Example 1 The test variable if Occupation = ‘Doctor’ and not (29000 < Salary < 71000) then Errcode_A01 = ‘Flag’

Example 1 The edit group if Occupation = ‘Doctor’ and not (29000 < Salary < 71000) then Errcode_A01 = ‘Flag’

Example 1 The acceptance region if Occupation = ‘Doctor’ and not (29000 < Salary < 71000) then Errcode_A01 = ‘Flag’

Example 2 The test variable i f Occupation = ‘Doctor’ and not (29000 < Salary < 71000) or Occupation = ‘Nurse’ and not (23300 < Salary < 43800) then Errcode_A02 = ‘Flag’

Example 2 The edit groups i f Occupation = ‘Doctor’ and not (29000 < Salary < 71000) or Occupation = ‘Nurse’ and not (23300 < Salary < 43800) then Errcode_A02 = ‘Flag’

Example 2 The acceptance regions i f Occupation = ‘Doctor’ and not (29000 < Salary < 71000) or Occupation = ‘Nurse’ and not (23300 < Salary < 43800) then Errcode_A02 = ‘Flag’

Edits EDIT GROUP AND ACCEPTANCE REGION Edit identification Edit group Acceptance region EDIT Edit identification Type of edit Active Section Internal error message External error message Instruction for data review Un-edited test variable Error flag

Edits EDIT GROUP AND ACCEPTANCE REGION Edit identification Edit group Acceptance region EDIT Edit identification Type of edit Active Section Internal error message External error message Instruction for data review Un-edited test variable Error flag LINK Edit identification Survey variable IMPACT ON STATISTICS Survey variable Potent. impact on statistics FLAGGING EDITS, VARIABLES AND UNITS EDIT PRACTICAL SUPPORT Edit identification Standard edit rule Edited test variable Suspicion probability value produced by the SELEKT system 2 1

My questions (1) Can most edits be described as consisting of the components –test variable –edit group –acceptance region ? What types of edits can not?

My questions (2) If the edits can be described this way, what arguments are there for saying that –one edit has only one edit group and one acceptance region –one edit can be composed of many edit groups with one acceptance region each?

My questions (3) Can you give me examples of similar modeling of edits metadata storage for edits edit script generator using a standard metadata storage for edits