UNECE Work Session on Statistical Data Editing Vienna April 2008 Topic ii – Editing Administrative Data and Combined Sources.

Slides:



Advertisements
Similar presentations
Statistics NZs experience in using Administrative Data in an Integrated Programme of Economic Vince Galvin General Manager Strategy & Communications.
Advertisements

Paul Smith Office for National Statistics
Some considerations on developing a DWH for SBS estimates Orietta Luzi – Mauro Masselli Istat - Italy march 2013.
Migration of a large survey onto a micro-economic platform Val Cox April 2014.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.
Quality Guidelines for statistical processes using administrative data European Conference on Quality in Official Statistics Q2014 Giovanna Brancato, Francesco.
Results and next steps from the ESSnet Admin Data Alison Pritchard Business Outputs & Developments, Office for National Statistics, UK 4 December 2012.
1 Editing Administrative Data and Combined Data Sources Introduction.
Unido.org/statistics Analysis of Divergence of quarterly and Annual Index of Industrial Production Shyam Upadhyaya, Shohreh Mirzaei Yeganeh United Nations.
Trade and business statistics: use of administrative data Lunch Seminar Enrico Giovannini Italian National Statistical Institute (ISTAT) New York, February,
E&I of administrative data used for producing business statistics Vera Costa, Frances Krsinich, Rudi Van der Mescht 2008 UNECE Work Session on Statistical.
Use of administrative data in statistics - challenges and opportunities ICES III End Panel Discussion Montreal, June 2007 Heli Jeskanen-Sundström Statistics.
Vienna, 23 April 2008 UNECE Work Session on SDE Topic (v) Editing on results (post-editing) 1 Topic (v): Editing based on results Discussants: Maria M.
Role of editing and imputation in integration of sources for structural business statistics Svein Gåsemyr, Statistics Norway Svein Nordbotten, University.
Eurostat Repeated surveys. Presented by Eva Elvers Statistics Sweden.
Eurostat Statistical Data Editing and Imputation.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
Using survey data collection as a tool for improving the survey process Silvia Biffignandi, Antonio Laureti Giulio Perani University of Bergamo Istat Istat.
Carmela Pascucci – Istat - Italy Meeting of the Working Party on International Trade in Goods and Trade in Services Statistics (WPTGS) Linking business.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
Dutch Virtual Census Presentation at the International Seminar on Population and Housing Censuses; Beyond the 2010 Round November, 2012 Egon Gerards,
12th Meeting of the Group of Experts on Business Registers
Q2010, Helsinki Development and implementation of quality and performance indicators for frame creation and imputation Kornélia Mag László Kajdi Q2010,
1 Presentation to OG6 Canberra, Australia May 2011 Statistical Uses of Administrative Data in Canada.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
Quality issues on the way from survey to administrative data: the case of SBS statistics of microenterprises in Slovakia Andrej Vallo, Andrea Bielakova.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
Copyright 2010, The World Bank Group. All Rights Reserved. Managing processes Core business of the NSO Part 2 Strengthening Statistics Produced in Collaboration.
Topic (ii): New and Emerging Methods Maria Garcia (USA) Jeroen Pannekoek (Netherlands) UNECE Work Session on Statistical Data Editing Paris, France,
The new multiple-source system for Italian Structural Business Statistics based on administrative and survey data Orietta Luzi, Ugo Guarnera, Paolo Righi.
Cristina Casciano, Viviana De Giorgi, Filippo Oropallo Istat Division for Structural Business Statistics, Agriculture, Foreign Trade and Consumer Prices.
for statistics based on multiple sources
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Session topic (iii) – Editing and Imputation in the context of data integration from multiple sources and mixed modes Discussants Felipa Zabala, Orietta.
Editing of linked micro files for statistics and research.
CBS-SSB STATISTICS NETHERLANDS – STATISTICS NORWAY Work Session on Statistical Data Editing Oslo, Norway, September 2012 Jeroen Pannekoek and Li-Chun.
© Federal Statistical Office Germany, Division IB, Institute for Research and Development in Federal Statistics Sheet 1 Surveys, administrative data or.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
1/10 Editing Strategies for VAT Data Peter Kruiskamp.
Topic (iii): Macro Editing Methods Paula Mason and Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.
Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.
1 For a Population Statistical Register Characteristics and Potentials for the Official Statistics Central department for administrative data and archives.
Integrated Approach Processing Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February, 2014 SNA seminar in the.
Topic (i): Selective editing / macro editing Discussants Orietta Luzi - Italian National Statistical Institute Rudi Seljak - Statistical Office of Slovenia.
Experience and response in developing countries: the twinning project with the Tunisian National Statistical Institute Monica Consalvi ISTAT, Division.
STS Compilation with Multiple Data Sources Anu Peltola Economic Statistics Section, UNECE UNECE Workshop on Short-Term Statistics (STS) and Seasonal Adjustment.
New and Emerging Methods UN/ECE Work Session on Statistical Data Editing Vienna April 21-23, 2008.
Ph. Brion Insee Redesigning French structural business statistics: first methodological studies Bonn, september 2006.
Administrative Data at Statistics Canada – Current Uses and the Way Forward Wesley Yung and Peter Lys, Statistics Canada.
COMBINING SURVEY AND ADMINISTRATIVE DATA IN THE ITALIAN EU-SILC EXPERIENCE: POSITIVE AND CRITICAL ASPECTS National Institute of Statistics - Italy Claudio.
How to deal with quality aspects in estimating national results Annalisa Pallotti Short Term Expert Asa 3st Joint Workshop on Pesticides Indicators Valletta.
4-6 September 2013, Vilnius Quality in Statistics: Administrative Data and Official Statistics USING ADMINISTRATIVE DATA SOURCES IN OFFICIAL.
Methods for Data-Integration
Implementation of Quality indicators for administrative data
Theme (v): Managing change
Theme (i): New and emerging methods
Estimation methods for the integration of administrative sources
Estimation methods for the integration of administrative sources
Quality Aspects and Approaches in Business Statistics
Prague EU-SILC Best Practice Workshop, 14th and 15th September 2017
Goals and objectives of Work package 2 of the ESSnet on Consistency of concepts and applied methods of business and trade-related statistics Norbert Rainer,
Italian situation in the following areas:
Parallel Session: BR maintenance Quality in maintenance of a BR:
The Swedish survey on turnover in the service sector
DIAGNOSTIC FRAMEWORK: National Accounts and Supporting Statistics
Presentation transcript:

UNECE Work Session on Statistical Data Editing Vienna April 2008 Topic ii – Editing Administrative Data and Combined Sources

Introduction Statistical Agencies rely on administrative data to improve the quality of statistics, reduce costs and response burden Administrative data are not originally designed for use as statistical data and need to undergo extensive processing and editing In recent years, more emphasis on the use of tax data to augment business statistics and register data to augment social and economic data

Introduction Combining multiple sources of data presents new challenges: ensuring quality in line with statistical standards and coherence across different sources. Papers cover: –Effective methods for adjusting administrative data to statistical use –Improving the usability of business and population registers –Construction of quality statistical databases using effective E&I strategies which ensure correct coverage, consistent and clean records –Enhancing the quality and efficiency of estimates from surveys

Introduction Papers: –WP.8Italy: The editing process in the Italian short- term survey on Labour Cost based on administrative data –WP.9 New Zealand: E&I of administrative data used for producing business statistics –WP.10 Norway: Role of edit and imputation in integration of sources for structural business statistics –WP.11 Norway: Prediction and imputation in ISEE: tools for more efficient use of combined data sources

Introduction Papers: –WP.12 Austria: Quality of administrative data – a challenge for the maintenance of the statistical business register –WP.13 France: The future system of French structural business statistics: the role of the estimates –WP.14 Italy: Combining survey and administrative data in the Italian EU-SILC experience: positive and critical aspects –WP.15 Netherlands: Editing Strategies for VAT Data

Presentations

The editing process in the Italian short-term survey on labour cost based on administrative data M. Carla Congia, Silvia Pacini and Donatella Tuzi – Italian National Statistical Institute (Istat) Steps in process: –Preliminary checks on administrative data and retrieval of statistical variables –Micro data editing (cross sectional and longitudinal checks) –Imputation of eligible unit non-responses –Large enterprise checks and combination with survey data –Macro editing based on time series analysis

The editing process in the Italian short-term survey on labour cost based on administrative data Interesting points –Integration of administrative and survey data and identifying errors –Combining many processes in an integrated setting –Recognition of the importance of metadata for administrative data – changes in concepts and definitions –Macro editing using time series methods for automated detection of outliers

E&I of administrative data used for producing business statistics Vera Costa, Frances Krsinich and Rudi Van der Mescht – Statistics New Zealand Challenges with using administrative data from the private sector –Electronic Card Transactions Data obtained as aggregated data from switch companies –Must rely on companies for ensuring quality data –Discussion of time series models for identifying outliers and carrying out imputation

E&I of administrative data used for producing business statistics Building a longitudinal business database –Integrating survey data, tax data and business sampling frame using a deterministic record linkage process –Donor imputation for missing/erroneous data from tax files –Expanding methods of imputation to take into account historical values and other fine-tuning mechanisms Interesting points –Advantages and disadvantages of time series methods for macro editing on aggregated administrative data –Need to consider practicality and feasibility for large scale production systems when analyzing imputation methods

The Role of E&I in integration of sources for structural business statistics Svein Gasemyr, Svein Nordbotten and Morton Anderson – Statistics Norway Integrated longitudinal business database from multiple sources –Estimate enterprise accounts distribution for complex enterprises –Aggregations from Job files –Imputation of input and output production variables, imputation of non-response and out of survey units –Corrections to enhance record linkage –Need for more computer based methods and support for editing

The Role of E&I in integration of sources for structural business statistics Standardized modules for editing and estimation –Imputation and estimation carried out interactively –Inspect effect of changed values on the estimates Interesting points –Quality information for integrated databases as opposed to single source databases with emphasis on errors in linking data and inconsistencies

Prediction and imputation in ISEE: tools for more efficient use of combined data sources Li-Chun Zhang and Svein Nordbotten – Statistics Norway Standardization of data processing for combined data sources –Editing individual data –Estimation of population parameters Integrating multiple sources by constructing a complete population data file –Imputation for non-response and out of sample –Nearest neighbour imputation method with restrictions on totals

Prediction and imputation in ISEE: tools for more efficient use of combined data sources Interesting points –Good review and discussion of imputation methods –Innovative new method for imputing out of sample units –Development of a generic statistical application

The Future System of French Structural Business Statistics: the Role of Estimates Philippe Brion - INSEE Combining administrative sources and a statistical survey –Breakdown of turnover and NACE code only available in the sample Analysis of statistical estimates produced by mass-imputation versus weighting –Imputation of APE code for out of sample units can be biased –Weights calibrated to 3-digit APE code with adjustments based on the survey outcome at the 4-digit level

The Future System of French Structural Business Statistics: the Role of Estimates Interesting points –Good discussion of advantages and disadvantages of mass imputation versus weighting to obtain population estimates –Consideration of editing strategies: micro edits and selective editing based on scores and “jack-knifed” ratios

Combining Survey and Administrative data in the Italian EU-SILC Experience Claudio Ceccarelli, Lucia Coppola, Andrea Cutillo and Davide Di Laurea Use of administrative data in the social survey EU- SILC –Tracking individuals for a longitudinal survey –Linking tax registers to reduce impact of item non- response and other measurement errors (recall effects, telescoping, etc.) Problems related to timeliness and comparability of data sources –Need for integrated processing systems, understanding of complexities, more time to process data

Combining Survey and Administrative data in the Italian EU-SILC Experience Interesting points –A good discussion is provided on the advantages and disadvantages of incorporating administrative data at different stages of the survey process –Interesting analysis of estimation methods for calculating survey weights

Quality of Administrative Data – a challenge for the maintenance of the Statistical Business Register Norbert Rainer – Statistics Austria Main administrative data sources for the Business Register Quality issues –Linking data sources –Different definitions –Continuity procedures –Missing data Improvement strategies

Quality of Administrative Data – a challenge for the maintenance of the Statistical Business Register Interesting points –Data issues leading to a need for using imputation Data available only for a higher aggregation level within the business Timeliness Annual data only, when monthly data is needed Not all activities covered Data available only for enterprises above a certain threshold

Editing Strategies for VAT Data Peter Kruiskamp – Statistics Netherlands VAT data –Current use: auxiliary variable –Future use: source for turnover data (small / medium businesses) Editing strategy –Micro editing on the fiscal units level –Data handling on the statistical units level –Macro editing on the aggregates level

Editing Strategies for VAT Data Data used for the production of the Short-Term Statistics – data frequency Interesting points –Consideration of time series model for estimating VAT to overcome seasonal effects –Discussion of cut off points for identifying outliers –Need to move from using VAT data as auxiliary versus VAT data as a source of data

Questions for discussion Editing Administrative Data –Combine collections then edit versus edit each collection then combine –Editing / imputation of back data –Keeping track of changes in administrative data definitions –Parameters for outlier detection under multiple sources of data –Use of time series models to identify outliers and impute/estimate unit record values: advantages and disadvantages –Automation of macro editing when a large number of series are produced

Questions for discussion Assessing quality –New statistical tools/methods for assessing coherency between sources and linking errors –Types of quality indicators for integrated databases –The impact of the timeliness of the data sources on the quality of the data –Does the need for practical and feasible production systems reduce the quality of the data –Variance estimation in combined data sources, especially when an auxiliary is an estimate or when massive imputation is carried out

Questions for discussion Weighting versus mass-imputation –Most papers opted for mass imputation and square datasets and a few papers opted for weighting – pros and cons –Methods for building “square” datasets when linking administrative sources to survey data

Thank you for your attention