Integrated Approach Processing Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February, 2014 SNA seminar in the.

Slides:



Advertisements
Similar presentations
Data Quality Assurance and Dissemination International Workshop on Energy Statistics Aguascalientes, Mexico.
Advertisements

Use of Tax Data in the Unified Enterprise Survey (UES) Workshop on Use of Administrative Data in Economics Statistics Marie Brodeur Moscow November, 2006.
1 ESTIMATION IN THE PRESENCE OF TAX DATA IN BUSINESS SURVEYS David Haziza, Gordon Kuromi and Joana Bérubé Université de Montréal & Statistics Canada ICESIII.
1 Sharing best practices for the redesign of three business surveys Charles Tardif, Business Survey Methods Division,Statistics Canada presented at the.
Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Data Imputation United Nations Statistics Division (UNSD) 16 March 2011 Santiago, Chile.
Integrated Business Statistics Program (IBSP) Introduction Daniela Ravindra Director, Enterprise Statistics Division November 9th, 2010.
Some considerations on developing a DWH for SBS estimates Orietta Luzi – Mauro Masselli Istat - Italy march 2013.
Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.
C6 Databases.
Editing And Imputation For Manufacturing Statistics At Statistics Canada Marie Brodeur Director General, Industry Statistics Branch Santiago, Chile March.
Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.
The Many Ways of Improving the Industrial Coding for Statistics Canada’s Business Register Yanick Beaucage ICES III June 2007.
1 Editing Administrative Data and Combined Data Sources Introduction.
An Integrated Approach to Economic Statistics “ The Canadian Experience” UNSD – IBGE Workshop on Manufacturing Statistics Kevin Roberts Rio de Janeiro,
08/08/2015 Statistics Canada Statistique Canada Paradata Collection Research for Social Surveys at Statistics Canada François Laflamme International Total.
Jim Tebrake New York, March 2014 The Integrated Business Statistics Program at Statistics Canada Director General Statistics Canada Statistics Canada Statistique.
André Loranger New York, June 2014 The Integrated Business Statistics Program at Statistics Canada Presentation to the UNCEEA Assistant Chief Statistician.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
The implementation of tools to support the data quality of the survey frame Mario Ménard November 2008.
The Integrated Approach to Economic Statistics “The Canadian Approach” Friends of the Chair Group on Integrated Economic Statistics Marie Brodeur, Michel.
1 BUSINESS REGISTER CBS-ISRAEL. 2 LEGAL FRAME WORK in 1997 two inter-governmental committees issued: 1. LEGAL ASPECTS 2. PRACTICAL & TECHNICAL ASPECTS.
SNA seminar in the Caribbean Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February, 2014 Governance of an Integrated.
1 Business Register: Quality Practices Eddie Salyers
The Canadian Integrated Approach to Economic Surveys Marie Brodeur, Peter Koumanakos, Jean Leduc, Éric Rancourt, Karen Wilson Statistics Canada International.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
1 Presentation to OG6 Canberra, Australia May 2011 Statistical Uses of Administrative Data in Canada.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
All the answers? Statistics New Zealand’s Integrated Data Infrastructure Paper by Felibel Zabala, Rodney Jer, Jamas Enright and Allyson Seyb Presented.
The Future of Administrative Data ICES III End Panel Discussion Don Royce Statistics Canada June 2007.
Impact of using fiscal data on the imputation strategy of the Unified Enterprise Survey of Statistics Canada Ryan Chepita, Yi Li, Jean-Sébastien Provençal,
Integration of Annual Economic Collections – The Australian Experience ICESIII, Canada, 2007 Presented by Eden Brinkley.
© John M. Abowd 2007, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2007.
A Strategy for Prioritising Non-response Follow-up to Reduce Costs Without Reducing Output Quality Gareth James Methodology Directorate UK Office for National.
Statistics Canada Statistique Canada Integrated Questionnaire Design Friends of the Chair Group on Integrated Economic Statistics Marie Brodeur, Michel.
Stop the Madness: Use Quality Targets Laurie Reedman.
National design, fieldwork and data harmonization for Labour Force Survey Irena Svetin Statistical Office of the Republic of Slovenia September 2014.
Prioritizing Follow-up of Non- Respondents Using Scores for the Canadian Quarterly Survey of Financial Statistics for Enterprises Pierre Daoust Statistics.
Prioritizing Follow-up for the Canadian Quarterly Survey of Financial Statistics for Enterprises Pierre Daoust Statistics Canada ICES III, Montréal Statistique.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Methodology of Allocating Generic Field to its Details Jessica Andrews Nathalie Hamel François Brisebois ICESIII - June 19, 2007.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
Editing of linked micro files for statistics and research.
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
SNA seminar in the Caribbean Integrated questionnaires Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February,
A Quality Driven Approach to Managing Collection and Analysis
Topic (iii): Macro Editing Methods Paula Mason and Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Ljubljana, Slovenia, 9-11 May 2011.
Developing and applying business process models in practice Statistics Norway Jenny Linnerud and Anne Gro Hustoft.
Experience and response in developing countries: the twinning project with the Tunisian National Statistical Institute Monica Consalvi ISTAT, Division.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
Unified Enterprise Survey New Horizons International Conference on Establishment Surveys Daniela Ravindra and Marie Brodeur Montreal, June 2007 Statistics.
5.8 Finalise data files 5.6 Calculate weights Price index for legal services Quality Management / Metadata Management Specify Needs Design Build CollectProcessAnalyse.
Public Libraries Survey Data File Overview. What We’ll Talk About PLS: Public Libraries Survey State level data Public library data (Administrative Entities)
Public Libraries Survey Data File Overview. 2 What We’ll Talk About PLS: Public Library Survey State level data Public library data (Administrative Entities)
An Overview of Editing and Imputation Methods for the next Italian Censuses Gianpiero Bianchi, Antonia Manzari, Alessandra Reale UNECE-Eurostat Meeting.
METIS 2011 Workshop Session III – National Implementation of the GSBPM Alice Born and Tim Dunstan Thursday October 6, 2011 Implementation of the GSBPM.
Administrative Data at Statistics Canada – Current Uses and the Way Forward Wesley Yung and Peter Lys, Statistics Canada.
1 Overview of the U.S. Census Bureau’s Business Register Profiling Operations Presented to International Roundtable on Business Survey Frames– Wiesbaden.
R&D statistics in Denmark organization of data collection, and dissemination of R&D statistics.
Metadata models to support the statistical cycle: IMDB
Étienne Saint-Pierre and Serge Godbout, Statistics Canada
An Active Collection using Intermediate Estimates to Manage Follow-Up of Non-Response and Measurement Errors Jeannine Claveau, Serge Godbout and Claude.
Generic Statistical Business Process Model (GSBPM)
Unified Enterprise Survey
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Business Register Redesign Technology Strategy Plan
Guidelines on Integrated Economic Statistics
Étienne Saint-Pierre, Statistics Canada
Presentation transcript:

Integrated Approach Processing Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February, 2014 SNA seminar in the Caribbean

Why A Centralized Process?  Best Practices  Standardization of Processes Cross Survey Comparisons Enterprise Centric Processing/Coherence Analysis  Efficient use of Resources  Transportable Knowledge Across Survey Programs 2

Pre-Grooming Allocation / Estimation Edit & Imputation Records from Collection Data Service Center Subject Matter Review & Correction Tool Tax Data Business Register UES Post-Collection Processing 3

Collection  Precontact (Dec-Jan) –Mostly for Business Register (BR) births; verification of contact information (name, address, …) –By phone (in a few cases, a letter or a fact sheet is sent)  Mail-out of questionnaires (Jan-March) –2 or 3 mail-out dates  Follow-up in case of non-response for some units (begins about a month after mail-out) –Phone call, r or fax  Mail-back of questionnaires  Verifications of received questionnaires / Edits –Is the questionnaire complete or are some key variables missing? (Edit follow-up by phone in some cases) 4

Centralized Collection Mailout Pre-Contact Edit / Verification Receipt (75% target) Delinquent Follow-Up Capture / Imaging “Clean” Records Prioritize 5

Use Of Tax Data  Validation (comparison)  Verify dubious collected data against the equivalent tax data record  Imputation  One of the methods used for non-response  Estimation  Direct Data Replacement  Calibration Estimates  Update Business Register  Allocation of survey data (use tax revenues, salaries and expenses)

 Develop centralized systems Move away from stand-alone Single point of access for security  Integrated Questionnaire Metadata System  Edit and imputation  Allocation and Estimation  Data Warehouse Centralized Processing Systems And Databases

Enterprise Portfolio Managers  Top 350 enterprises in Canada  Status Platinum, Gold, Silver, Bronze  Personal visits  Enterprise Profiling  Coordination of mail-out and collection  Enterprise/ Establishment coherence  Holistic Response Management Strategic Response Unit Escalation Process / Statistics Act 8

What Is E & I?  Editing Verify that parts add-up to total Ensure that there are no missing values where parts add up to total There must be consistency between related variables  Imputation Changing values in fields which fail edit rules with a view to ensuring that the resulting data satisfy all edit rules. In practice, reported data will rarely be changed Impute for missing data or partially responded data Impute entire records in the case of total non- response 9

Why Is E&I Necessary?  To produce a complete and consistent data file that accounts for all sampled units  Both units that did not respond to the survey must be imputed and units that did not provide a complete response must be imputed  Correct erroneous responses 10

E&I Terminology  Data Group Groupings (defined by SM) of records that will be kept together for imputation purposesGroupings (defined by SM) of records that will be kept together for imputation purposes These groupings are based on multi dimensions:These groupings are based on multi dimensions:  industry (NAICS)  geography (province)  Data groups that will be used for a specific survey will depend on: initial sample design (number of units sampled and the level of stratification used)initial sample design (number of units sampled and the level of stratification used) number of records that respond to the survey (a minimum of 5 or 10 records are required)number of records that respond to the survey (a minimum of 5 or 10 records are required) 11

BANFF E & I System  Impute for missing key variables as specified by subject matter (i.e. total revenue, total expenses)  Impute for other missing variables: Apply Historical Trend Apply Current Year Trend Use donor (for partial imputation) 12

BANFF Algorithms  DIFTREND - Historical trend imputation  CURRATIO - Current ratio imputation  PREVALUE – Value from the previous period for the same unit is imputed  PREAUX – Historical value of a proxy variable for the same unit  CURAUX – Current value of a proxy variable for the same unit 13

Allocation - Definition & Purpose Definition:  Allocation is the distribution of survey and administrative data from their acquisition level (Collection Entity) to the targeted statistical units (Establishments or Locations) as defined on the survey frame. Purpose:  To provide fully-processed micro data on a fiscal year basis, for establishments or locations in-sample for the UES  Determine the distribution of value added by province 14

Sample Survey Allocation 15

11/12/2015 Statistics Canada Statistique Canada 16 Multi-Mode Collection Quality Indicators and Scores Follow-Up Editing Imputation Estimation Sampling Rolling Estimates Interpretation & Dissemination Automated Processing Active Management Manual Editing Overview of the IBSP Rolling Estimates Approach

11/12/2015 Statistics Canada Statistique Canada 17 Active Management – Strategy Settings  A subset of all Key Estimates is selected  All Key Estimates are: Ranked from the most to the least important Weighted relatively using an importance factor Assigned a Quality Target  Targets are set in line with the importance factor.  Active Collection ends for a Key Estimate when the Quality Indicator meets the Quality Target.  Active management and sampling strategies are coherent by design.

 Quality Indicator (QI) QI= Sampling CV & Imputation CV & Pseudo Relative Bias  Measure of Impact (MI) Score Impact of a unit on the QI for a given estimate Units imputed from a poor model or with reported/imputed values far from their predicted values will have high MIs. 11/12/2015 Statistics Canada Statistique Canada 18 Active Management – Definitions

 Parallel run for 47 Business Surveys  Four Rolling Estimates iterations  Total CV calculated for all key estimates (8,600) at each iteration 11/12/2015 Statistics Canada Statistique Canada 19 Empirical Study – RY2011 Prototype

11/12/2015 Statistics Canada Statistique Canada 20