A comparison of sample and register based survey: the case of labour market data De Gregorio C., Filipponi D., Martini A., Rocchetti I.

Slides:

Advertisements

Similar presentations

Helsinki 8-11 th June, 2005 NIESR. –National accounts –Comparability –Expertise Information needed –Industry disaggregation –Backdating –INDUSTRY CLASSIFICATION.

Advertisements

Département fédéral de lintérieur DFI Office fédéral de la statistique OFS Record Linkage : a key and challenging process for CATI surveys. ESSnet on Data.

ISTAT - Italian National Institute of Statistics Labour Force Survey Division Unit “Methods for LFS data treatment” 5 th Workshop on LFS methodology Paris,

1 European Conference on Quality in Official Statistics Rome, 8-11 July 2008 Improving the quality and the quality assessment of the Labour Force Survey.

Estimation of unrecorded employment using administrative data Ágota Scharle Bálint Szabó Ministry of Finance Budapest.

SCREENING FOR DISEASE Nigel Paneth. THREE KEY MEASURES OF VALIDITY 1.SENSITIVITY 2.SPECIFICITY 3.PREDICTIVE VALUE.

Beginning the Research Design

QMSS, Lugano, Lynn Control of Sampling Error Peter Lynn Institute for Social and Economic Research, University of Essex, UK.

1 Active Population, Employment and Unemployment in Slovenia.

The Census Data Enhancement Project Glenys Bishop.

What If You Reject H 0 ? How much different is Second Population from which Your Sample Is derived? d is a rati o (Comparison) of group differences to.

Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.

FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS

Trade and business statistics: use of administrative data Lunch Seminar Enrico Giovannini Italian National Statistical Institute (ISTAT) New York, February,

Arun Srivastava. Small Areas What is a small area? Sub - population Domain The Domain need not necessarily be geographical. Examples Geographical Subpopulations.

Guidance on Evaluation of Youth Employment Initiative

The new HBS Chisinau, 26 October Outline 1.How the HBS changed 2.Assessment of data quality 3.Data comparability 4.Conclusions.

Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.

Work Package 5: Integrating data from different sources in the production of business statistics Daniel Lewis Office for National Statistics (UK)

4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.

Innovations on methods and survey process for the 2011 Italian population census European Conference on Quality in Official Statistics 8-11 July, 2008.

1st NRC Meeting, October 2006, Amsterdam 1 ICCS Sampling Design.

Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.

12th Meeting of the Group of Experts on Business Registers

DEFINITIONS 1 SAMPLE MEAN Z-TEST 1 SAMPLE MEAN T-TEST 1 PROPORTION Z-TEST 2 INDEPENDENT SAMPLES T-TEST 2 RELATED SAMPLES PAIRED DATA TYPE OF ERRORS Chapter.

Q2010, Helsinki Development and implementation of quality and performance indicators for frame creation and imputation Kornélia Mag László Kajdi Q2010,

Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.

Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.

Pushing forward with ASPIRE A System for Product Improvement, Review and Evaluation Heather Bergdahl, Paul Biemer, Dennis Trewin Q2014.

Use of web scraping and text mining techniques in the Istat survey on “Information and Communication Technology in enterprises” Giulio Barcaroli(*), Alessandra.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 4 Gathering Data Section 4.2 Good and Poor Ways to Sample.

Sampling strategy for the dual-system correction of the under-coverage in the Register Supported 2011 Italian Population Census Loredana Di Consiglio,

European Conference on Quality in Official Statistics Roma, July 8-11, 2008 New Sampling Design of INSEE’s Labour Force Survey Sébastien Hallépée Vincent.

Implementation of quality indicators in the Finnish statistics production process Kari Djerf Statistics Finland Q2008, Rome Italy.

ISTAT - Italian National Institute of Statistics Labour Force Survey Division Unit “Methods for LFS data treatment” European Conference on Quality in Official.

© 2010 Pearson Prentice Hall. All rights reserved Chapter Data Collection 1.

Bias in Surveys MDM4U – Mathematics of Data Management.

One-Sample Tests of Hypothesis Chapter 10 McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.

The Dutch Virtual Census of 2001 A New Approach by Combining Different Sources Eric Schulte Nordholt ECE Census meetings Geneva, November 2004.

National Accounts and Employment Data Group of Experts on National Accounts Geneva April 2006

for statistics based on multiple sources

4. Numerical Integration. Standard Quadrature We can find numerical value of a definite integral by the definition: where points x i are uniformly spaced.

USING A TABLE OF VALUES TO ESTIMATE A LIMIT. Use a table of values to estimate.

Inference about a Population Proportion BPS chapter 19 © 2010 W.H. Freeman and Company.

Raymond W. Stone Stone & McCarthy Research Associates Presentation: University of Richmond November 19, 2004 Topics Relating to Employment Statistics.

A discussion of Comparing register and survey wealth data ( F. Johansson and A. Klevmarken) & The Impact of Methodological Decisions around Imputation.

THE LFS REVIEW in the context of Eurostat programme for modernising social micro-data collections Anne CLEMENCEAU - Eurostat 9th Workshop on Labour Force.

Improved Register Data Matching and its Impact on Survey Population Estimates Steve Vale Office for National Statistics, UK.

Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.

KNOMAD MIGRATION COST SURVEY Manolo Abella. Background Much anecdotal evidence that migration costs are high and rising attributed to growing wage differentials.

Application of ESeC in Estonian Social Surveys based on EU-SILC and LFS data Merle Paats Leading Statistician from the Social Statistics Department, Estonia.

1 1 Topics difficult to measure in a register-based census Harald Utne Census Project Statistics Norway UNECE-Eurostat Meeting on Population.

Representativity Indicators for Survey Quality Programme: Cooperation Theme: Socio-economic sciences and Humanities Activity: Socio-economic and scientific.

© John M. Abowd 2005, all rights reserved Assessing Data Quality John M. Abowd April 2005.

Multivariate selective editing via mixture models: first applications to Italian structural business surveys Orietta Luzi, Guarnera U., Silvestri F., Buglielli.

Evolution of Census Statistics on Enterprises in Italy : from the Traditional Census to a Register of Local Units Monica Consalvi, Luigi Costanzo,

Towards a Process Oriented View on Statistical Data Quality Michaela Denk, Wilfried Grossmann.

Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.

3-1Forecasting CHAPTER 3 Forecasting McGraw-Hill/Irwin Operations Management, Eighth Edition, by William J. Stevenson Copyright © 2005 by The McGraw-Hill.

SWBAT: Explain how undercoverage, nonresponse, and question wording can lead to bias in a sample survey. Do Now: An airline that wants to assess customer.

As a data user, it is imperative that you understand how the data has been generated and processed…

1 A theoretical framework for register-based statistics --- Can we carry on without it? Li-Chun Zhang Statistics Norway

Drop out statistics EU 2020 and the Labour Force Survey UOE (UNESCO/OECD/Eurostat) data The student register Danish measures of drop out.

On the Optimality of the Simple Bayesian Classifier under Zero-One Loss Pedro Domingos, Michael Pazzani Presented by Lu Ren Oct. 1, 2007.

Surveys Jan 21. Samples and populations Need for sampling Goal: representative sample Methods of sampling – Quota Sampling – Volunteer samples – Probability.

Random Sampling Error and Sample Size are Related

Analysis based on normal distributions

Section 1.5 Bias in Sampling.

Statistical Power.

Presentation transcript:

A comparison of sample and register based survey: the case of labour market data De Gregorio C., Filipponi D., Martini A., Rocchetti I.

Contents Survey(LFS) – ADMIN Strategic issue Previous ESS research Long term innovation process Our purposes Answers and new questions Innovation leverage in several fields

Microdata LFS vs. ADMIN Integration: labour input measurement Definition of employment, Regular vs Irregular First: employment status comparison ADMIN wrt: LFS reference week, Employed and Self-employed.

Our purposes Managing inconsistencies between LFS and ADMIN Measuring Regular and Irregular employment Assessing Accuracy of LFS and ADMIN (Assumed error models, MSE’s derivation and computation, No considered benchmark ) Estimating ADMIN Over-coverage (precision) Estimating ADMIN Under-coverage (irregular) Estimating LFS Under-coverage (understatement)

Our model: LFS sample “True” status REGULAR IRREGULAR NOT EMPLOYED “ADMIN employed” status “LFS employed” status

Inconsistencies REGULARIRREGULAR

Our model Hypotheses (to simplify) –If LFS employed then employed –If True Regular then ADMIN employed –No LFS Non-response or substitution bias –ADMIN exhaustive and with no error –No problems with record linkage Key estimates –Probability of being truly employed if “ADMIN employed” –Rate & number of LFS false negatives –Probability of being truly employed if “LFS not employed” Assume it’s OK!

Compare LFS and ADMIN MSE Error model for LFS employment status (z) given the true employment status (y) Error model for ADMIN employment status (x) and ADMIN under-coverage (irregular employment) ADMIN over-coverage (false employment signal)

MSE by domain LFS ADMIN >95% of total MSE - given “true” employment, population and sample size Linear locus of “low impact” on MSE

LFS MSE: depends on the probability of under-coverage ADMIN MSE : balance of two opposite errors

LFS & ADMIN both have errors LFS has sampling and under-coverage errors Apparently ADMIN performs better, as the sources of errors tend to compensate ADMIN worsens in the domains with higher irregularity rates ADMIN produces higher errors at micro-level For analysis purposes, survey and ADMIN data should be integrated further An efficient usage of exhaustive ADMIN data should count on survey based estimates of actual employment status To conclude