Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada.

Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada

Outline 1.Why is it a problem? 2.Causes 3.Measurement 4.Follow-up 5.Score Function 6.Adjusting for nonresponse 7.Weight adjustment 8.Imputation 9.Summary

Why is it a Problem? Bias Non-respondents differ from respondents in the characteristics measured Sampling variance Increased Reduced effective sample size

Causes Frame quality Contact information name, address, telephone number and fax number Classification (industry/geography) Over-coverage: sampled unit not in scope to the survey - does not respond Under coverage: units declared out-of-scope – not contacted

Causes, c ont. Questionnaire Design and layout Coverage: complex businesses Language Length / time to fill out

Causes, c ont. Data collection method Did not adjust to respondent’s preferred contact mode Mail, personal interview, telephone interview, computer assisted interviewing, etc

Causes, c ont. Contact: Agency and respondent Lack of communication and follow-up Too much contact: editing checks Timing Best day and time Fiscal year end

Causes, c ont. Contact: Agency and respondent Data availability Response load Who else is asking? Legal obligations for respondents and statistical agency Confidentiality protection

Measurement Compile non-response rates Refusals Non-contact Out-of-scope Seasonality /death status (unknown) Mail returns Other reasons

Follow Up Follow-up non-respondents All and/or targeted sub-group Effective way to increase the response rate

Follow Up, c ont. Prioritise follow-up Who? Target large or significant units first Non-responding births Delinquent businesses How? Score-function

Follow Up, c ont. Annual business census type surveys Split non-respondents by into take-all and take-some strata Boundary Select with certainty ta units: Select n - ta remaining units from take-some stratum Largest Smallest Response Follow-up Non- Response

Follow Up, c ont. Hansen-Hurwitz (1946) Initial sample : Follow-up sample of non-respondents Estimator

Score Function Basic idea Follow-up non-responding units that have most impact on estimates Adaptation of Latouche and Berthelot (1992), McKenzie (2001), and Hedlin (2003).

Score Function, c ont. Key steps 1.Define and compute score function from past values 2.Determine score cut-off: minimize absolute standard bias 3.Follow-up units above score cut-off

Score Function, cont. 1.Define and compute score function

Score Function, cont. 2.Determine score cut-off

Score Function, cont. 2. Determine score cut-off 3.Follow-up units above score cut-off

Score Function, cont. Score-function (Latouche and Berthelot 1992) Establish threshold based on ASB Follow-up k-th unit if

Score Function, cont. Number of recontacts Absolute standard bias 0 Cut-off

Weight Adjustment, cont. Select sample s: Design weights Portion of sampled units that respond: Portion of sampled units that does not respond:

Adjusting for nonresponse Two options 1.Weight adjustment: Inverse of response probability Use of auxiliary data 2.Imputation: Impute for missing values to get a full data matrix

Weight Adjustment Used to reduce bias due to non-response Depends on the probability to respond Assumes independent of variable of interest, y Ignorable non-response Respondents behave same as non-respondents

Weight Adjustment, cont. If known, then adjustment is Unbiased estimator is However, not known Use estimates of : may be biased If are ‘good’, then estimates are approximately unbiased

Weight Adjustment, cont. Let true response mechanism be and If assume missing at random: Bias for estimated total:

Weight Adjustment, cont. How to estimate (approximate) ? Auxiliary variables Logistic regression Auxiliary data (discrete, continuous)

Weight Adjustment, cont. Logistic regression Define indicator response variable Probability that unit k responds Equivalent to:

Weight Adjustment, cont. Logistic regression Solve Response probability adjusted weight Reweighed estimator:

Weight Adjustment, cont. 127 sampled businesses 71 businesses respond Same : 0.56 Example: Logistic regression

Weight Adjustment, cont. Example Logistic regression

Weight Adjustment, cont. 127 sampled businesses 55 businesses respond Same : 0.43 Example: Logistic regression

Weight Adjustment, cont. Discrete (Count Adjustment) Assume that and for all i and j That is, everyone has the same probability of response and the probability of response is independent between individuals (Uniform Response Mechanism) Estimate of is

Weight Adjustment, cont. Discrete (Count Adjustment) Non-response adjustment is Non-response adjusted estimator is

Weight Adjustment, cont. Continuous ( Auxiliary Data) Suppose we have auxiliary data x i and the known population total X Estimate by either Under a Uniform Response Mechanism (URM), and provide approximately unbiased estimates

Weight Adjustment, cont. Continuous ( Auxiliary Data) Note that leads to a two-phase estimator and to the well known ratio estimator calibrates to the known total X

Weight Adjustment, cont. Continuous ( Auxiliary Data) If we have marginal totals for 2 auxiliary variables, X and Z, one can use raking MF 15-30??Z1 30-65??Z2 65+??Z3 X1X2

Weight Adjustment, cont. Continuous ( Auxiliary Data) Raking assumes that and Raking is an iterative procedure Rake to one margin then the other At convergence, get adjustment so that marginal totals are met

Weight Adjustment, cont. Continuous ( Auxiliary Data) Generalized Regression (GREG) estimator Weight adjustment not really an estimate of response probability Can show that bias is function of response probability and predictive power of X Unbiased under URM

Weight Adjustment, cont. Continuous (Auxiliary Data) Weight adjustment Adjusted estimator:

Weight Adjustment, cont. Weighting Classes Assumption of URM very strong and somewhat unrealistic Usually define weighting classes Mutually exclusive and exhaustive groups C 1, C 2, …,C R Assume URM within each class

Weight Adjustment, cont. How to define weighting classes? Using auxiliary data to group units so that within the weighting class Using auxiliary data and logistic regression models Obtain for all i Form groups so that

Weight Adjustment, cont. Weighting Classes If weighting class variable is good at predicting y and non-response, bias and variance will be reduced If weighting class variable unrelated to nonresponse but is good predictor of y, no bias reduction but variance reduced If weighting class variable unrelated to y, no bias reduction. Variance could increase if weighting class variable good predictor of non-response!

Imputation Usually used for item non-response Can be used for unit non-response also Several methods available Deductive imputation Class mean imputation Cold-deck imputation (earlier survey/ historical)

Imputation Hot-deck imputation (current survey) Random overall imputation Random imputation classes Sequential hot deck Distance function matching Regression imputation Simplest example is ratio

Imputation, cont. For business surveys, most commonly used methods involve auxiliary data Historical data If data available from previous time period, use it with a trend (last month / last year) If none available, use a mean imputation Administrative data (i.e. tax) Use tax data with or without an adjustment At Statistics Canada, annual tax data used to directly replace and monthly tax data adjusted before use

Summary Reduce non-response at front-end Frame Contact vehicle Editing Measure non-response Follow-up selectively and representatively Adjust for non-response Model (Weighting /imputing / Logistic Regression) Homogeneous classes

References Bethlehem, J.G. (1988) reduction of Nonresponse bias through regression estimation. Journal of Official Statistics, Vol. 4, No. 3, 251-260. Cochran, W.G. (1977): Sampling Techniques. Third Edition, Wiley, New York. Cornish J. (2004). Response Problems In Surveys: improving response and minimising the load for UNSD. Regional Seminar on 'Good Practices in the Organization and Management of Statistical Systems’ for ASEAN countries, Yangon Myanmar, 11-13 December 2002. DeLeeuw, Edith D (ed) (1999). Special issues on Survey Nonresponse Journal of Official Statistics 15, 2. Dillman, D. A. Procedures for Conducting Government-Sponsored Establishment Surveys: Comparisons of the Total Design Method (TDM), a Traditional Cost- Compensation Model, and Tailored Design, Washington State University. Ekholm, A. and Laaksonen, S. (1991). Weighting via Response Modeling in the Finnish Household Budget Survey. Journal of Official Statistics, 7, 325–337. Elliot, M.R., Little, R.J.A., and Lewitzky, S. (2000). Subsampling Callbacks to Improve Survey Efficiency. Journal of the American Statistical Association, 95, 730–838. Groves R M, Dillman D A, Eltinge J L & Little R J A (eds), Survey Nonresponse, 2002, Chichester: Wiley Hansen, M. H., and Hurwitz, W. N. (1946), The Problem of Nonresponse in Sample Surveys, Journal of the American Statistical Association, 41, 517–529. Hedlin, D. (2003).Score Functions to Reduce Business Survey Editing at the U.K. Office for National Statistics. Journal of Official Statistics, Vol.19, No.2, 177-199 Hidiroglou, M. A, Drew, D. J, and Gray, G. B, June 1993 A frameworkfor Measuring and Reducing Nonresponse in Surveys, Survey Methodology 19:81-94 International Conference on Survey Nonresponse (1999). http://jpsm.umd.edu/icsn/papers/Index.htm. Kalton G. and Flores-Cervantes I. (2003). Weighting Methods. Journal of Official Statistics, Vol.19, No.2, 2003. pp. 81-97

References Laaksonen, S. and Chambers, R. (2006). Survey Estimation under Informative Nonresponse with Follow-up. Journal of Official Statistics, Vol. 22, No. 1, 2006, 81–95. Latouche, M. and Berthelot, J.-M., (1992). Use of a Score Function to Prioritize and Limit Recontacts in Editing Business Surveys. Journal of Official Statistics, Vol.8, No.3, 1992. 389-400. Lawrence D. and McKenzie R. (2000).The General Application of Significance Editing. Journal of Official Statistics, Vol.16, No.3, 243-253 Little, R. (1986). Survey Nonresponse Adjustments for Estimates of Means. International Statistical Review, 54, 139–157. Lundstrom Sixten and Särndal C.-E. (1999). Calibration as a Standard Method for Treatment of Nonresponse. Journal of Official Statistics, Vol. 15, No. 2, 1999, 305-327. Lynn, Peter and Clarke, Paul, Separating refusal bias and con-contact bias: evidence from UK national surveys, The Statistician, 51, Part 3, 391-333. Madow, W.G., Nisselson, H., and Olkin, I. (eds.) (1983): Incomplete Data in Sample Surveys. Vol. 1: Report and Case Studies. Academic Press, New York. McKenzie, Richard. (2000). A Framework for Priority Contact of Non Respondents. In the Proceedings of The Second International Conference on Establishment Surveys, Buffalo, New York. 473 - 482. Rao, J.N.K.(1973 ).Double sampling for stratification and survey.Biometrika,Vol. 60, No. 1 : 125-133 Särndal, C.-E. and Swensson, B. (1987). A General View of Estimation for Two Phases of Selection with Applications to Two-Phase Sampling and Nonresponse. International Statistical Review, 55, 279–294. Strauss, E.E., and Hidiroglou, M.A. (1984). A Follow-up Procedure for Business Census Type Surveys. In Topics in Applied Statistics. Y.P. Chaubey and T.D. Dwivedi ed., 447-453. Published by Concordia University, Montréal. Valliant R. (2004) The Effect of Multiple Weighting Steps on Variance Estimation Journal of Official Statistics, Vol.20, No.1, 1-18. Wang, J.E. (2004). Non-response in the Norwegian Business Tendency Survey. Statistics Norway Department of Economic Statistics.

Score Function, cont No follow-up on occasion t-a Partial follow-up on occasion t-a Full follow-up on occasion t-a

Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada.

Similar presentations

Presentation on theme: "Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada.

Similar presentations

Presentation on theme: "Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada."— Presentation transcript:

Similar presentations

About project

Feedback