Evaluating the Effects of Business Register Updates on Monthly Survey Estimates Daniel Lewis.

Slides:



Advertisements
Similar presentations
Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Advertisements

Innovation Surveys: Advice from the Oslo Manual South Asian Regional Workshop on Science, Technology and Innovation Statistics Kathmandu,
Tests About a Population Mean
CHAPTER 2 – DISCRETE DISTRIBUTIONS HÜSEYIN GÜLER MATHEMATICAL STATISTICS Discrete Distributions 1.
Comparing Two Population Parameters
Editing and Imputing VAT Data for the Purpose of Producing Mixed- Source Turnover Estimates Hannah Finselbach and Daniel Lewis Office for National Statistics,
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
Sampling Distributions Martina Litschmannová K210.
Estimation of Sample Size
Examining the use of administrative data for annual business statistics Joanna Woods, Ria Sanderson, Tracy Jones, Daniel Lewis.
QBM117 Business Statistics Statistical Inference Sampling 1.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Hui-Hua Lee 1, Kevin R. Piner 1, Mark N. Maunder 2 Evaluation of traditional versus conditional fitting of von Bertalanffy growth functions 1 NOAA Fisheries,
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
1 Marketing Research Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Maintenance of Selective Editing in ONS Business Surveys Daniel Lewis.
1/49 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 9 Estimation: Additional Topics.
 Deviation is a measure of difference for interval and ratio variables between the observed value and the mean.  The sign of deviation (positive or.
Eurostat Repeated surveys. Presented by Eva Elvers Statistics Sweden.
Improving Quality in the Office for National Statistics’ Annual Earnings Statistics Pete Brodie & Kevin Moore UK Office for National Statistics.
AM Recitation 2/10/11.
Work Package 5: Integrating data from different sources in the production of business statistics Daniel Lewis Office for National Statistics (UK)
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
Chapter 10 Hypothesis Testing
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Improvements in stratification in the UK's Office for National Statistics Pete Brodie, Martina Portanti & Emily Carless UK Office for National Statistics.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
© John M. Abowd 2007, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2007.
Basic Statistics for Engineers. Collection, presentation, interpretation and decision making. Prof. Dudley S. Finch.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Performance of Resampling Variance Estimation Techniques with Imputed Survey data.
Investigating improvements in quality of survey estimates by updating auxiliary information in the sampling frame using returned and modelled data Alan.
Lecture 2 Forestry 3218 Lecture 2 Statistical Methods Avery and Burkhart, Chapter 2 Forest Mensuration II Avery and Burkhart, Chapter 2.
Using administrative registers in sample surveys European Conference on Quality in Official Statistics 3-–6 May 2010 Kaja Sõstra Statistics Estonia.
CHAPTER 12 Descriptive, Program Evaluation, and Advanced Methods.
Copyright  2003 by Dr. Gallimore, Wright State University Department of Biomedical, Industrial Engineering & Human Factors Engineering Human Factors Research.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Evaluating generalised calibration / Fay-Herriot model in CAPEX Tracy Jones, Angharad Walters, Ria Sanderson and Salah Merad (Office for National Statistics)
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
Index of Manufactured Exports Methodology Review 2012 Ratio Estimation methodology update Chainlinking and benchmarking to GCS / I-O (and SNAP) SIC 2007.
Replacing sample panels in the Index of Production with Ratio Estimates Gregor Boyd Office of the Chief Economic Adviser October 2008.
Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Evaluating the benefits of using VAT data to improve the efficiency of editing in a multivariate annual business survey Daniel Lewis.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Backcasting the future Gary Brown Office for National Statistics, UK.
1 Collecting and Interpreting Quantitative Data Deborah K. van Alphen and Robert W. Lingard California State University, Northridge.
An Overview of Statistics Section 1.1 After you see the slides for each section, do the Try It Yourself problems in your text for that section to see if.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Inferential Statistics Psych 231: Research Methods in Psychology.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Statistics Statistics is that field of science concerned with the collection, organization, presentation, and summarization of data, and the drawing of.
Statistics in Management
Data Analysis.
Random Variable.
ESSnet on Consistency Workshop
Regression composite estimation for the Finnish LFS from a practical perspective Riku Salonen.
Random Variable.
ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS. RESULTS
Psych 231: Research Methods in Psychology
Analyzing Reliability and Validity in Outcomes Assessment
Sampling and estimation
Collecting and Interpreting Quantitative Data
Presentation transcript:

Evaluating the Effects of Business Register Updates on Monthly Survey Estimates Daniel Lewis

Overview Introduction Different strategies for updating the business register Simulation method for testing updating strategies Simulation results Conclusions and comments

Introduction Inter-Departmental Business Register (IDBR) used by most ONS business surveys Key variables – employment, turnover, industry (SIC) Updated from different survey and admin sources Quality of updates affects survey estimates Desire to produce priority rules for updating Accuracy of updating strategies tested by simulation

Sources for updating the IDBR Employment: –Pay As You Earn tax data (PAYE) –Business Register Survey (BRS) –Imputed from turnover data (Annual) Turnover: –Value Added Tax data (VAT) –Imputed from employment data Standard Industrial Classification (SIC): –VAT –PAYE –BRS

Updating scenarios tested Employment: –Always favour PAYE –Always favour BRS –Favour BRS if less than 3 / 2 / 1 years old SIC: –Always favour PAYE –Always favour VAT –Always favour BRS –Favour BRS if less than 3 / 2 / 1 years old –Range of options for second priority if first unavailable Frequency of updates – monthly, quarterly, annually

Method for testing updating scenarios Assess effect on monthly turnover survey estimates Simulation method using four steps: 1.Simulate 12 months real world data 2.Create business register 3.Select and survey samples from register 4.Calculate estimates and compare to true values

1. Simulate real world data (i) Use January IDBR data as starting point Analyse 12 months of IDBR data to estimate: –Probability of a business dying –Probability of changing SIC –Probability of a change in turnover –Probability of a change in employment –Probability of a new business being born Probabilities used to randomly assign characteristics to businesses each month

1. Simulate real world data (ii) Changes in employment and turnover modelled based on observed means and standard deviations within strata Change in SIC randomly assigned based on probabilities of each type of SIC change Births and deaths also randomly introduced based on probabilities Monthly turnover data created by comparison with weighted monthly survey data from the same year

2. Create business register Create register variables from real world for each updating source with matching (realistic) quality deficiencies Quality parameters for each source derived by comparing average changes in employment and SIC before and after main register update Variables for each updating scenario derived by adding random variation to real world value:

3. Select and survey samples Samples drawn many times from the simulated business registers using typical business survey sample design Stratified by industry and employment Neyman allocation using annual turnover data Samples selected using Permanent Random Number sampling with different random start for each iteration and 15 month rotation period Turnover value collected for each sampled business

4. Calculate and compare estimates Estimates of total turnover calculated for each sample using separate ratio estimator with employment as auxiliary MSEs calculated for each updating scenario by comparison with true turnover in the real world:

Simulation results Simulation very time consuming –Only possible to draw 516 samples –Just sufficient for convergence Best updating strategy: –Update IDBR monthly –Give preference to PAYE for employment –Use BRS for SIC if less than 3 years old, otherwise VAT A few other options were not significantly worse

Conclusions and comments Method very computer intensive, but gave useful results Project time limited, so constrained to using simple methods Potential to extend the model to better reflect business survey populations and updating processes Then possible to test a wide range of business survey methods: –Sample designs, rotation rates, estimation, variance estimation, outlier treatment, …