Maintenance of Selective Editing in ONS Business Surveys Daniel Lewis.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Survey Design Workshop
Advertisements

Evaluating the Effects of Business Register Updates on Monthly Survey Estimates Daniel Lewis.
Overview of Sampling Methods II
1 Third Workshop on ICP Western Asia Beirut, October 2004 Design of ICP price survey Sultan Ahmad, Consultant Based on Keith.
The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
Editing and Imputing VAT Data for the Purpose of Producing Mixed- Source Turnover Estimates Hannah Finselbach and Daniel Lewis Office for National Statistics,
Sampling Strategy for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Deliverable 2.8: Outliers Gary Brown Office for National Statistics UK.
Tool for Assessing Impact of Changing Editing Rules On Cost & Quality Alaa Al-Hamad, Begoña Martín, Gary Brown Processing, Editing & Imputation Branch.
Examining the use of administrative data for annual business statistics Joanna Woods, Ria Sanderson, Tracy Jones, Daniel Lewis.
© 2003 Prentice-Hall, Inc.Chap 1-1 Business Statistics: A First Course (3 rd Edition) Chapter 1 Introduction and Data Collection.
QBM117 Business Statistics Statistical Inference Sampling 1.
Evaluation.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
© 2002 Prentice-Hall, Inc.Chap 1-1 Statistics for Managers using Microsoft Excel 3 rd Edition Chapter 1 Introduction and Data Collection.
Sampling.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 8 Using Survey Research.
Evaluation.
Chapter 17 Additional Topics in Sampling
ISSUES RELATED TO SAMPLING Why Sample? Probability vs. Non-Probability Samples Population of Interest Sampling Frame.
STAT262: Lecture 5 (Ratio estimation)
A new sampling method: stratified sampling
Basic Business Statistics (8th Edition)
Sampling Design.
Unit 4: Monitoring Data Quality For HIV Case Surveillance Systems #6-0-1.
~ Draft version ~ 1 HOW TO CHOOSE THE NUMBER OF CALL ATTEMPTS IN A TELEPHONE SURVEY IN THE PRESENCE OF NONRESPONSE AND MEASUREMENT ERRORS Annica Isaksson.
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
1 Methods for detecting errors in VAT Turnover data Phil Lewis Processing, Editing and Imputation branch Business Statistics Methods-Survey Methodology.
CSCI 347 / CS 4206: Data Mining Module 06: Evaluation Topic 01: Training, Testing, and Tuning Datasets.
Work Package 5: Integrating data from different sources in the production of business statistics Daniel Lewis Office for National Statistics (UK)
Chapter 10 Hypothesis Testing
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
IB Business and Management
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
A generic tool to assess impact of changing edit rules in a business survey – SNOWDON-X Pedro Luis do Nascimento Silva Robert Bucknall Ping Zong Alaa Al-Hamad.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
A Strategy for Prioritising Non-response Follow-up to Reduce Costs Without Reducing Output Quality Gareth James Methodology Directorate UK Office for National.
The application of selective editing to the ONS Monthly Business Survey Emma Hooper Office for National Statistics
ICP Workshop, Tunis Nov. 03 Overview of the Sample Framework.
Population and Sampling
Jeroen Pannekoek - Statistics Netherlands Work Session on Statistical Data Editing Oslo, Norway, 24 September 2012 Topic (I) Selective and macro editing.
Overall Quality Assurance, Selecting and managing external consultants and outsourcing Baku Training Module.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Handbook on Residential Property Price Indices Chapter 5: Methods Jan de Haan UNECE/ILO Meeting, May 2010.
Evaluating generalised calibration / Fay-Herriot model in CAPEX Tracy Jones, Angharad Walters, Ria Sanderson and Salah Merad (Office for National Statistics)
1 Selective data editing Development & implementation Q 2010 Helsinki Jörgen Svensson Process Owner Statistics Sweden (SCB)
Design of the 2011 Census Coverage Survey Owen Abbott (ONS) James Brown (Institute of Education)
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
1 Chapter 2: Sampling and Surveys. 2 Random Sampling Exercise Choose a sample of n=5 from our class, noting the proportion of females in your sample.
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
The 2011 Census: Estimating the Population Alexa Courtney.
Testing the use of administrative data to edit the 2009 Agriculture Census Dolores Lorca National Statistical Institute of Spain.
Evaluating the benefits of using VAT data to improve the efficiency of editing in a multivariate annual business survey Daniel Lewis.
RESEARCH METHODS Lecture 28. TYPES OF PROBABILITY SAMPLING Requires more work than nonrandom sampling. Researcher must identify sampling elements. Necessary.
Common Pitfalls in Randomized Evaluations Jenny C. Aker Tufts University.
FDI - Imputation. Overview Introduction Overview of Imputation Methods Overview of Outliering methods Overview of Estimation methods Aggregation Disclosure.
Data Science Credibility: Evaluating What’s Been Learned
Introduction to Inference
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
RESEARCH METHODS Lecture 28
Improving the efficiency of editing in ONS business surveys
Cross-validation Brenda Thomson/ Peter Fox Data Analytics
Business Statistics: A First Course (3rd Edition)
Winsorisation for estimates of change
Presentation transcript:

Maintenance of Selective Editing in ONS Business Surveys Daniel Lewis

Overview Introduction Selective editing in ONS The need to review selective editing Threshold review Process for maintaining selective editing Next steps

Introduction ONS uses selective editing on nine key business surveys Aim to focus editing effort where it improves output quality Three different implementations: Original, based on prioritising edit failures New “standard”, inspired by Australian method Selekt – Swedish software Important to regularly review thresholds and maintain editing systems

Selective editing in ONS – standard method Calculate score for each key variable (item): weight * | returned value - expected value | * 100 previous period domain estimate Expected value is usually the previous period value, when available Otherwise use relationship to register variable Calculate unit score by taking average of item scores Follow up businesses with score larger than (domain) threshold

Selective editing in ONS – standard method Thresholds set for each output domain by analysis of past data Aim to keep “pseudo-bias” no greater than 1% for each domain i.e. bias introduced by not editing all suspicious responses

Selective editing in ONS - Selekt Standard method works well when there is high sample overlap between variables and not too many key variables Annual Business Survey (ABS) has many variables and less than 50% overlap Selekt used instead Flexible in making use of available data Score split into three parts: Suspicion Impact Importance

Selective editing in ONS – Selekt To use Selekt, need to specify many parameters and choose which variables (or combinations of variables) to include in the score Tested options using an iterative approach, aiming to minimise failures whilst keeping quality acceptable Settings also partially informed by survey team experience

The need to review selective editing Selective editing offers good efficiency savings, but systems are often set up based on initial analysis of data and not reviewed Over time problems often arise with the use of selective editing on a survey Need to consider: Unforeseen data issues Cultural issues Threshold review

Unforeseen issues and changes It may be necessary to review selective editing due to unforeseen data issues or changes to questions or coverage of a survey Possible to fix these issues, but need process to identify and deal with them quickly

Cultural issues Also important to address any cultural issues with the use of selective editing When editing or results team do not trust / understand the process they may perform their own additional editing Important to embed culture – workshops explaining methods and discussing issues often work well

Threshold review Key challenge in maintaining selective editing is keeping thresholds and parameters up to date Original analysis to set thresholds relies on fully edited dataset to evaluate performance Once selective editing is in place, we no longer have access to such a dataset Two options to deal with this Make model assumptions about data and their error structure Sample and re-contact some businesses that passed selective editing

Threshold review Developing method to re-set thresholds based on sample of businesses passing selective editing Initial study in 2012 to evaluate performance of thresholds in Retail Sales selective editing Average monthly sample size around 3500, with around 900 failing selective editing Agreed resource to re-contact an additional 600 businesses as a one-off exercise Sampled proportional to

Threshold review Three approaches considered for sampling businesses within each domain: 1. Simple random sample 2. Stratified random sample using unit score as stratification variable 3. Split domain into two strata based on unit score, fully enumerate largest scores Approaches tested using RSI data before selective editing implemented Suggestion from Sweden (not tested) to use Poisson sampling with probabilities proportional to score

Threshold review Performance of tested approaches assessed using two estimates of pseudo-bias Full sample estimate Sub-sample estimate

Threshold review Bias estimates compared for 35 months of RSI data Simple random sample within domain most accurate for estimating pseudo-bias Disadvantage that no guarantee of getting units that narrowly pass selective editing Implemented method in practice and discovered no domains with pseudo-bias above 1% In this case, no need to change thresholds Fortunate as we still need to develop method to enable new threshold analysis!

Process for maintaining selective editing Would like a well defined process to ensure thresholds are regularly reviewed, data issues are identified and dealt with, selective editing culture is properly embedded Agreed that sub-annual surveys will have thresholds agreed every three years, annual surveys every five years Any ongoing selective editing issues will be identified as part of new regular Survey Action Plan meetings

Next steps Develop and pilot method for testing thresholds using RSI data Review any other issues with RSI selective editing Following implementation of successful approach, create timetable for reviewing all surveys with selective editing Aim to fully implement process by end 2014