An Overview of Editing and Imputation Methods for the next Italian Censuses Gianpiero Bianchi, Antonia Manzari, Alessandra Reale UNECE-Eurostat Meeting.

Slides:



Advertisements
Similar presentations
Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Advertisements

Innovation Surveys: Advice from the Oslo Manual South Asian Regional Workshop on Science, Technology and Innovation Statistics Kathmandu,
Harvard Center for Population and Development Studies1 Census Editing and the Art of Motorcycle Maintenance Michael J. Levin Center for Population and.
Quality assurance -Population and Housing Census Alma Kondi, INSTAT, Albania.
Quality Guidelines for statistical processes using administrative data European Conference on Quality in Official Statistics Q2014 Giovanna Brancato, Francesco.
New procedures for Editing and Imputation of demographic variables G. Bianchi, A. Manzari, A. Pezone, A. Reale, G. Saporito ISTAT.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (13-15 May 2008) Sample results expected accuracy in the Italian Population and Housing.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
FARMS MULTIFUNCTIONALITY AND HOUSEHOLDS INCOMES IN SUSTAINABLE RURAL DEVELOPMENT Session 4: Income and Employment of the Rural Household By Marco Ballin.
Using survey data collection as a tool for improving the survey process Silvia Biffignandi, Antonio Laureti Giulio Perani University of Bergamo Istat Istat.
Innovations on methods and survey process for the 2011 Italian population census European Conference on Quality in Official Statistics 8-11 July, 2008.
Geneva, 30 October 2009 Giuseppe Sindoni, Istat, Italy An online system for multi-channel, register-based census data collection.
Introduction to SDLC: System Development Life Cycle Dr. Dania Bilal IS 582 Spring 2009.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of.
5 Marzo 2007 EMERGING METHODOLOGIES OF CONTINUOUS USE OF REGISTERS AND GEOCODED DATABASES IN THE ITALIAN POPULATION AND HOUSING CENSUS Fabio Crescenzi,
Dutch Virtual Census Presentation at the International Seminar on Population and Housing Censuses; Beyond the 2010 Round November, 2012 Egon Gerards,
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
THE MAIN INNOVATIONS OF DATA EDITING AND IMPUTATION FOR THE 2010 ITALIAN AGRICULTURAL CENSUS G. Bianchi, R. M. Lipsi, P. Francescangeli, G. Ruocco, A.
Evaluating a Research Report
Register-Based Census 2011 in Slovenia – Some Quality Aspects Danilo Dolenc Statistical Office of the Republic of Slovenia UNECE-Eurostat Expert Group.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Topic (ii): New and Emerging Methods Maria Garcia (USA) Jeroen Pannekoek (Netherlands) UNECE Work Session on Statistical Data Editing Paris, France,
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
Topic (vi): New and Emerging Methods Topic organizer: Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Oslo, Norway, September 2012.
October 28-30, 2009 UNECE Geneva Quality Assessment of 2008 Integrated Census - Israel Pnina ZADKA Central Bureau of Statistics Israel.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Luxembourg, 13 October 2008 ITDG Giuseppe Sindoni, Istat, Italy 2011 Census innovations, the 2009 pilot Census and some technological implications.
New sources – administrative registers Genovefa RUŽIĆ.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
May 12-15, Evaluating the Integrated Census Israel Pnina ZADKA Central Bureau of Statistics Israel.
Topic (iii): Macro Editing Methods Paula Mason and Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.
Paolo Valente - UNECE Statistical Division Slide 1 Technology for census data coding, editing and imputation Paolo Valente (UNECE) UNECE Workshop on Census.
Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.
1 For a Population Statistical Register Characteristics and Potentials for the Official Statistics Central department for administrative data and archives.
Integrated Approach Processing Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February, 2014 SNA seminar in the.
Topic (i): Selective editing / macro editing Discussants Orietta Luzi - Italian National Statistical Institute Rudi Seljak - Statistical Office of Slovenia.
United Nations Workshop on Evaluation and Analysis of Census Data, 1-12 December 2014, Nay Pyi Taw, Myanmar DATA VALIDATION-I Evaluation of editing and.
Census Processing Baku Training Module.  Discuss:  Processing Strategies  Processing operations  Quality Assurance for processing  Technology Issues.
Methods and software for editing and imputation: recent advancements at Istat M. Di Zio, U. Guarnera, O. Luzi, A. Manzari ISTAT – Italian Statistical Institute.
Session 3 The population registers in Germany – the main data source in the 2011 Census UNECE-Eurostat Expert Group Meeting on Censuses Using Registers.
Towards a Process Oriented View on Statistical Data Quality Michaela Denk, Wilfried Grossmann.
The 2011 Census: Estimating the Population Alexa Courtney.
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, 7-9 JULY 2010 A QUALITY ASSURANCE STRATEGY FOR THE 2011 CENSUS IN ENGLAND AND.
Census quality evaluation: Considerations from an international perspective Bernard Baffour and Paolo Valente UNECE Statistical Division Joint UNECE/Eurostat.
Interstate Statistical Committee of the Commonwealth of Independent States (CIS-STAT) CES seminar “Challenges for future population and housing censuses.
COMBINING SURVEY AND ADMINISTRATIVE DATA IN THE ITALIAN EU-SILC EXPERIENCE: POSITIVE AND CRITICAL ASPECTS National Institute of Statistics - Italy Claudio.
Use of administrative data for outlier detection in the VI Italian agriculture census A. Reale 1, M. Riani 2, M. Greco 1, G. Ruocco 1 1 ISTAT, Census Department;
1 Handbook on Population and Housing Census Editing Department of Economic and Social Development United Nations Statistics Division Studies in Methods,
Methods for Data-Integration
Canadian Census E&I – Lessons Learned from 2006 with Plans for 2011
Estimation methods for the integration of administrative sources
Survey phases, survey errors and quality control system
Multi-Mode Data Collection Approach
Survey phases, survey errors and quality control system
Albania 2021 Population and Housing Census - Plans
Preliminaries Training Course «Statistical Matching» Rome, 6-8 November 2013 Mauro Scanu Dept. Integration, Quality, Research and Production Networks.
Jeroen Pannekoek, Sander Scholtus and Mark van der Loo
Census innovations in Italy and their technological implications
The change of data sources in the Spanish SILC
Multi-Mode Data Collection Approach
Multi-Mode Data Collection
Innovations on the Canadian Census
Modernization of Social statistics: integrated use of survey and
Item 9 Validation in UOE data collection
Presentation transcript:

An Overview of Editing and Imputation Methods for the next Italian Censuses Gianpiero Bianchi, Antonia Manzari, Alessandra Reale UNECE-Eurostat Meeting on Population and Housing Censuses Geneva, May, 2008

Outline  Features of 2001 E&I strategy  E&I strategy for 2011 Census  Likely innovations for 2011 Census  Impact on editing and validation procedures  Conclusions

Features of 2001 E&I strategy  Main E&I purpose: provide a complete and consistent set of data by performing plausible imputations and preserving the maximum amount of collected information  E&I strategy: divide the E&I problem into simpler sub- problems and find appropriate solutions for each of them  Overall E&I process composed of several (connected) procedures addressing to specific problems and implementing suitable methods  Development and use of new techniques and software tools

E&I strategy for 2011 Census  Built on the useful experience of the 2001 Census, taking account of:  The innovations in the survey design  Eurostat timeliness constraints In particular: Census variables split into topics processed in pre- determined order (first demographic, then socio-economic) by appropriate procedures Adaptation of 2001 procedures to the innovations and developing of new procedures by means of highly efficient algorithms Proper planning, implementation and managing of the E&I procedures

Main elements of the 2011 strategy  Use of DIESIS* system developed in 2001 by ISTAT and academic researchers (Department of Computer and Systems Science of the University of Roma “La Sapienza”). Based on optimization techniques, allows:  Treatment of qualitative and quantitative variables  Between-unit and within-unit edit rules  Joint use of data driven and minimum change approaches DIESIS will process 2011 demographic variables and, likely, some socio-economic variables * Data Imputation and Edit System - Italian Software

Main elements of the 2011 strategy  Joint use of data driven and minimum change approaches by DIESIS system  When reduced pool of donors the data driven approach can require imputing too many values  Minimum change approach used to minimize the number of values to be changed

Main elements of the 2011 strategy  Identification of the respondent path  Respondent paths used to: – Compute the Subset of Admissible Values (SAV) of Year of birth, a strata variable for the imputation of demographic variables – connection between demographic and socio- economic steps – Define strata for the imputation of socio-economic variables  Missing responses or errors can make uncertain the identification of the right respondent path  Automatic procedure for the identification of the most likely path based on the analysis of the responses given to filter and dependent questions

Main elements of the 2011 strategy  Validation of Person 1 in the household  Based on optimization techniques implemented in the DIESIS system  The minimum change algorithm assigns the role of Person 1 to the person that minimizes the number of changes needed for the record to be consistent  Identification of potential couples  Components of couples having non-unique relationship to Person 1 identified prior to editing  Score based on the responses provided to the demographic variables

Main elements of the 2011 strategy  Especial care in E&I of small but important groups in the population E.g. Centenarians validation  2001 procedure: – Automatic match of individuals enumerated in the 2001 with same individuals enumerated in the 1991 – Automatic check for internal consistency of unlinked records – Manual check for consistency with questionnaire images of some ambiguous cases  New procedure supported by availability of local population registers

Likely innovations for 2011  Short-long form questionnaires  Short: (mainly) demographic variables  Long: demographic and socio-economic variables  Availability of registers  Local population registers (residing individuals)  Integrative registers from auxiliary sources  Residential address lists  Use of multi-mode data collection  Enumerators, CATI, mail, web

Impact on E&I and validation  Socio-economic characteristics collected on sample basis (by long-form)  Two procedures for computing the SAV of Year of birth (one for short-form, one for long-form)  The reduced pool of donors for imputation of long- form variables requires careful managing of data collection and donor pool selection phases  Sampling weights required for data validation after E&I of long-form variables

Impact on E&I and validation  Availability of registers :  Improvement of the quantitative control of the forms  Imputation of missing or inconsistent census values by matching census data and register data (Record linkage procedure) – availability of unique record identifiers – same time reference than census data – good quality of register data  Imputation of missing or inconsistent census values by adding register data to census data - enlarging the donor pool

Impact on E&I and validation  Use of multi-mode data collection  Improvement of the collected data quality due to editing performed at the data capturing (CATI, web)  Procedure aiming at verifying duplicate questionnaires is required

Conclusions  E&I strategy for 2011 Census based on 2001 experiences  The new survey design aims to reduce the respondent burden but requires a careful monitoring during production and a more complex E&I process  High efficient procedures need to be developed in order to meet the timeliness requirement E&I is an achievable but hard task