Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.

Slides:



Advertisements
Similar presentations
Calculation of Sampling Errors MICS3 Data Analysis and Report Writing Workshop.
Advertisements

1 ESTIMATION IN THE PRESENCE OF TAX DATA IN BUSINESS SURVEYS David Haziza, Gordon Kuromi and Joana Bérubé Université de Montréal & Statistics Canada ICESIII.
Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Innovation Surveys: Advice from the Oslo Manual South Asian Regional Workshop on Science, Technology and Innovation Statistics Kathmandu,
Innovation Surveys: Advice from the Oslo Manual National training workshop Amman, Jordan October 2010.
Data Imputation United Nations Statistics Division (UNSD) 16 March 2011 Santiago, Chile.
Introduction Simple Random Sampling Stratified Random Sampling
Unido.org/statistics International workshop on industrial statistics 8 – 10 July, Beijing Non response in industrial surveys Shyam Upadhyaya.
SAMPLING METHODS OR TECHNIQUES
Some considerations on developing a DWH for SBS estimates Orietta Luzi – Mauro Masselli Istat - Italy march 2013.
Designing and ICT Business Survey
Harvard Center for Population and Development Studies1 Census Editing and the Art of Motorcycle Maintenance Michael J. Levin Center for Population and.
9. Weighting and Weighted Standard Errors. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Estimates and sampling errors for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
A Robust, Optimization-Based Approach for Approximate Answering of Aggregate Queries By : Surajid Chaudhuri Gautam Das Vivek Narasayya Presented by :Sayed.
MISUNDERSTOOD AND MISUSED
Credibility: Evaluating what’s been learned. Evaluation: the key to success How predictive is the model we learned? Error on the training data is not.
© John M. Abowd 2005, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2005.
Sampling.
Chapter 10 Sampling and Sampling Distributions
Chapter 17 Additional Topics in Sampling
Documentation and survey quality. Introduction.
11 Populations and Samples.
Stratified Simple Random Sampling (Chapter 5, Textbook, Barnett, V
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Edit and Imputation of the 2011 Abu Dhabi Census Glenn Hui and Hanan AlDarmaki Statistics Centre - Abu Dhabi UNECE CES Work Session on Statistical Data.
Data Editing United Nations Statistics Division (UNSD) 16 March 2011 Santiago, Chile.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
Sample Design.
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
Copyright 2010, The World Bank Group. All Rights Reserved. PROCESSING, Part 1 Data capture, editing, imputation and tabulation Quality assurance for census.
Eurostat Statistical Data Editing and Imputation.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 13.
Chapter 2: Statistics of One Variable
Copyright 2010, The World Bank Group. All Rights Reserved. Business statistics surveys 3. Data processing 1 Business statistics and registers.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting, Part I.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
Q2010, Helsinki Development and implementation of quality and performance indicators for frame creation and imputation Kornélia Mag László Kajdi Q2010,
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
© John M. Abowd 2007, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2007.
Chapter 18 Additional Topics in Sampling ©. Steps in Sampling Study Step 1: Information Required? Step 2: Relevant Population? Step 3: Sample Selection?
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
Editing a Mixture of Canadian 2006 Census and Tax Data Mike Bankier Statistics Canada 2006 Work Session on Statistical Data Editing
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
United Nations Economic Commission for Europe Statistical Division Mapping Data Production Processes to the GSBPM Steven Vale UNECE
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Basic Sampling Issues CHAPTER twelve.
DATA PREPARATION: PROCESSING & MANAGEMENT Lu Ann Aday, Ph.D. The University of Texas School of Public Health.
CBS-SSB STATISTICS NETHERLANDS – STATISTICS NORWAY Work Session on Statistical Data Editing Oslo, Norway, September 2012 Jeroen Pannekoek and Li-Chun.
Notes 1.3 (Part 1) An Overview of Statistics. What you will learn 1. How to design a statistical study 2. How to collect data by taking a census, using.
Copyright 2010, The World Bank Group. All Rights Reserved. Managing Data Processing Section B.
Probability Sampling. Simple Random Sample (SRS) Stratified Random Sampling Cluster Sampling The only way to ensure a representative sample is to obtain.
Study of Editing and Imputation Practices at Statistics Finland Janika Konnu and Pauli Ollila Statistics Finland Q2010: Editing session Wednesday 5 th.
5.8 Finalise data files 5.6 Calculate weights Price index for legal services Quality Management / Metadata Management Specify Needs Design Build CollectProcessAnalyse.
FDI - Imputation. Overview Introduction Overview of Imputation Methods Overview of Outliering methods Overview of Estimation methods Aggregation Disclosure.
How to deal with quality aspects in estimating national results Annalisa Pallotti Short Term Expert Asa 3st Joint Workshop on Pesticides Indicators Valletta.
HW Page 23 Have HW out to be checked.
The treatment of uncertainty in the results
Information for marketing management
جمعیت –نمونه –روشهای نمونه گیری دکتر محسن عسکرشاهی دکترای آمار زيستی
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Data Sampling Jerry Post Copyright © 1997
PRODCOM SURVEY IN THE UNITED KINGDOM
Data validation handbook
Disseminating ICT data
Data processing German foreign trade statistics
Mapping Data Production Processes to the GSBPM
Chapter One Data Collection
Presentation transcript:

Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data Unctad Manual Chapter 7

Module B4: Processing ICT survey data UNCTAD 2 Objectives After completing this module you will know how to do: Data processing Data weighting (grossing-up) Data editing Data analysis Contents of this module 4. Data processing and analysis 4.1 Data editing 4.2 Data weighting 4.3 Estimating ICT indicators

Module B4: Processing ICT survey data UNCTAD 3 Data editing  Statistical information provided by businesses can contain errors such as  Wrong or missing data,  Incorrect classifications  Inconsistent or illogical responses.  Solutions to minimize such errors  Ex ante optimize the effectiveness of data capture instruments collection procedures.  Ex post application of robust data editing techniques Editing! What is editing? B4.1. Data editing Page 82

Module B4: Processing ICT survey data UNCTAD 4 Phases of data processing Raw data Quality controls during data collection and entry Clean data file Data editing Treatment of internal errors and inconsistencies Estimation of missing data Outlier analysis Re-weighting procedures Editing of aggregates Micro-editing (input) Macro-editing (output ) Editing! B4.1. Data editing

Module B4: Processing ICT survey data UNCTAD 5 Internal inconsistencies and errors  Validity control of an individual data item requires: 1.To define a valid set of responses (in general, gender should be = 0 or 1, age should not be 110 years, etc; in ICT use of Internet by business should be 0 or 1) 2.To check questions against valid responses - Definition of rules based in relationships between questions (see Box 15 of the Manual: some logical tests) 3.Arithmetic checks during data entry or batch mode (totals, subtotals, frequencies) B4.3. Estimating ICT indicators Page 82

Module B4: Processing ICT survey data UNCTAD 6 Treatment of missing data  Final non-response (missing data) should be treated to avoid biased estimates.  Unit non-response treatment: Corrective weighting. Sample-based methods (the original weights are modified with sample information) Population-based method (the weights are modified with population information, the classical post stratification procedure) B4.3. Estimating ICT indicators Page 84

Module B4: Processing ICT survey data UNCTAD 7 Treatment of missing data (cont.)  Final non-response (missing data) should be treated in order to avoid biased estimates.  Item non-response treatment: Imputation. Deterministic imputation (a law). Hot deck imputation (let’s do it now). Cold deck imputation (using other information, models, econometrics…). Mean or modal value imputation ( it is clear). Historical imputation (long series). B4.3. Estimating ICT indicators Page 151 Annexe 5

Module B4: Processing ICT survey data UNCTAD 8 Misclassified units  Two cases of misclassification  Non-eligibility unit erroneously included This will reduce the effective sample size unless a reserve list is prepared  Eligible unit included in the wrong stratum or omitted from the frame altogether The technical solution consists of recalculating sample weights (see Box 17) B4.3. Estimating ICT indicators Page 86

Module B4: Processing ICT survey data UNCTAD 9 Some simple weighting methods  The sample average in stratum h is defined as  The estimate for the total for stratum h can be obtained by multiplying the stratum average by the total number of businesses in the stratum (Nh) B4.2. Data weighting

Module B4: Processing ICT survey data UNCTAD 10   The estimate for the total in the population is just or See boxes 18 and 19 pag 89 B4.3. Estimating ICT indicators Some simple weighting methods (cont.)

Module B4: Processing ICT survey data UNCTAD 11 Estimating proportions and ratios  A proportion:  Four different types of estimates are very usual  Simple random sampling of a non-stratified population  Stratified random sampling With one or several strata exhaustively investigated  Ratio estimates with simple random sampling  Ratio estimates with stratified random sampling A ratio : B4.3. Estimating ICT indicators n n ICT indicators are mainly proportions and ratios.

Module B4: Processing ICT survey data UNCTAD 12 CASE 1: Simple random sampling of a non- stratified population  The indicator can be expressed as the sample proportion:  The standard error (SE) of the sample proportion is estimated by :  SE expression valid with a sampling fraction of 10% or less B4.3. Estimating ICT indicators

Module B4: Processing ICT survey data UNCTAD 13 CASE 2: Stratified random sampling  An unbiased estimate of p is: Where, L : the number of strata Nh : the population in stratum h (h=1, 2,... L) nh : the sample size in stratum h (h=1, 2,... L)  The estimate of the SE of :  See Annex 4 of the Manual for more details B4.3. Estimating ICT indicators

Module B4: Processing ICT survey data UNCTAD 14 CASE 3: Ratio estimates with simple random sampling  The indicator to estimate is :  The natural estimate of ratio p is:  Finally, one approximation of the SE is: where is the sample average of n X observations, This is a reference outside the scope of our course B4.3. Estimating ICT indicators