Presentation is loading. Please wait.

Presentation is loading. Please wait.

Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada.

Similar presentations


Presentation on theme: "Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada."— Presentation transcript:

1 Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada

2 Outline 1.Why is it a problem? 2.Causes 3.Measurement 4.Follow-up 5.Score Function 6.Adjusting for nonresponse 7.Weight adjustment 8.Imputation 9.Summary

3 Why is it a Problem? Bias Non-respondents differ from respondents in the characteristics measured Sampling variance Increased Reduced effective sample size

4 Causes Frame quality Contact information name, address, telephone number and fax number Classification (industry/geography) Over-coverage: sampled unit not in scope to the survey - does not respond Under coverage: units declared out-of-scope – not contacted

5 Causes, c ont. Questionnaire Design and layout Coverage: complex businesses Language Length / time to fill out

6 Causes, c ont. Data collection method Did not adjust to respondent’s preferred contact mode Mail, personal interview, telephone interview, computer assisted interviewing, etc

7 Causes, c ont. Contact: Agency and respondent Lack of communication and follow-up Too much contact: editing checks Timing Best day and time Fiscal year end

8 Causes, c ont. Contact: Agency and respondent Data availability Response load Who else is asking? Legal obligations for respondents and statistical agency Confidentiality protection

9 Measurement Compile non-response rates Refusals Non-contact Out-of-scope Seasonality /death status (unknown) Mail returns Other reasons

10 Follow Up Follow-up non-respondents All and/or targeted sub-group Effective way to increase the response rate

11 Follow Up, c ont. Prioritise follow-up Who? Target large or significant units first Non-responding births Delinquent businesses How? Score-function

12 Follow Up, c ont. Annual business census type surveys Split non-respondents by into take-all and take-some strata Boundary Select with certainty ta units: Select n - ta remaining units from take-some stratum Largest Smallest Response Follow-up Non- Response

13 Follow Up, c ont. Hansen-Hurwitz (1946) Initial sample : Follow-up sample of non-respondents Estimator

14 Score Function Basic idea Follow-up non-responding units that have most impact on estimates Adaptation of Latouche and Berthelot (1992), McKenzie (2001), and Hedlin (2003).

15 Score Function, c ont. Key steps 1.Define and compute score function from past values 2.Determine score cut-off: minimize absolute standard bias 3.Follow-up units above score cut-off

16 Score Function, cont. 1.Define and compute score function

17 Score Function, cont. 2.Determine score cut-off

18 Score Function, cont. 2. Determine score cut-off 3.Follow-up units above score cut-off

19 Score Function, cont. Score-function (Latouche and Berthelot 1992) Establish threshold based on ASB Follow-up k-th unit if

20 Score Function, cont. Number of recontacts Absolute standard bias 0 Cut-off

21 Weight Adjustment, cont. Select sample s: Design weights Portion of sampled units that respond: Portion of sampled units that does not respond:

22 Adjusting for nonresponse Two options 1.Weight adjustment: Inverse of response probability Use of auxiliary data 2.Imputation: Impute for missing values to get a full data matrix

23 Weight Adjustment Used to reduce bias due to non-response Depends on the probability to respond Assumes independent of variable of interest, y Ignorable non-response Respondents behave same as non-respondents

24 Weight Adjustment, cont. If known, then adjustment is Unbiased estimator is However, not known Use estimates of : may be biased If are ‘good’, then estimates are approximately unbiased

25 Weight Adjustment, cont. Let true response mechanism be and If assume missing at random: Bias for estimated total:

26 Weight Adjustment, cont. How to estimate (approximate) ? Auxiliary variables Logistic regression Auxiliary data (discrete, continuous)

27 Weight Adjustment, cont. Logistic regression Define indicator response variable Probability that unit k responds Equivalent to:

28 Weight Adjustment, cont. Logistic regression Solve Response probability adjusted weight Reweighed estimator:

29 Weight Adjustment, cont. 127 sampled businesses 71 businesses respond Same : 0.56 Example: Logistic regression

30 Weight Adjustment, cont. Example Logistic regression

31 Weight Adjustment, cont. 127 sampled businesses 55 businesses respond Same : 0.43 Example: Logistic regression

32 Weight Adjustment, cont. Discrete (Count Adjustment) Assume that and for all i and j That is, everyone has the same probability of response and the probability of response is independent between individuals (Uniform Response Mechanism) Estimate of is

33 Weight Adjustment, cont. Discrete (Count Adjustment) Non-response adjustment is Non-response adjusted estimator is

34 Weight Adjustment, cont. Continuous ( Auxiliary Data) Suppose we have auxiliary data x i and the known population total X Estimate by either Under a Uniform Response Mechanism (URM), and provide approximately unbiased estimates

35 Weight Adjustment, cont. Continuous ( Auxiliary Data) Note that leads to a two-phase estimator and to the well known ratio estimator calibrates to the known total X

36 Weight Adjustment, cont. Continuous ( Auxiliary Data) If we have marginal totals for 2 auxiliary variables, X and Z, one can use raking MF 15-30??Z1 30-65??Z2 65+??Z3 X1X2

37 Weight Adjustment, cont. Continuous ( Auxiliary Data) Raking assumes that and Raking is an iterative procedure Rake to one margin then the other At convergence, get adjustment so that marginal totals are met

38 Weight Adjustment, cont. Continuous ( Auxiliary Data) Generalized Regression (GREG) estimator Weight adjustment not really an estimate of response probability Can show that bias is function of response probability and predictive power of X Unbiased under URM

39 Weight Adjustment, cont. Continuous (Auxiliary Data) Weight adjustment Adjusted estimator:

40 Weight Adjustment, cont. Weighting Classes Assumption of URM very strong and somewhat unrealistic Usually define weighting classes Mutually exclusive and exhaustive groups C 1, C 2, …,C R Assume URM within each class

41 Weight Adjustment, cont. How to define weighting classes? Using auxiliary data to group units so that within the weighting class Using auxiliary data and logistic regression models Obtain for all i Form groups so that

42 Weight Adjustment, cont. Weighting Classes If weighting class variable is good at predicting y and non-response, bias and variance will be reduced If weighting class variable unrelated to non- response but is good predictor of y, no bias reduction but variance reduced If weighting class variable unrelated to y, no bias reduction. Variance could increase if weighting class variable good predictor of non-response!

43 Imputation Usually used for item non-response Can be used for unit non-response also Several methods available Deductive imputation Class mean imputation Cold-deck imputation (earlier survey/ historical)

44 Imputation Hot-deck imputation (current survey) Random overall imputation Random imputation classes Sequential hot deck Distance function matching Regression imputation Simplest example is ratio

45 Imputation, cont. For business surveys, most commonly used methods involve auxiliary data Historical data If data available from previous time period, use it with a trend (last month / last year) If none available, use a mean imputation Administrative data (i.e. tax) Use tax data with or without an adjustment At Statistics Canada, annual tax data used to directly replace and monthly tax data adjusted before use

46 Summary Reduce non-response at front-end Frame Contact vehicle Editing Measure non-response Follow-up selectively and representatively Adjust for non-response Model (Weighting /imputing / Logistic Regression) Homogeneous classes

47 References Bethlehem, J.G. (1988) reduction of Nonresponse bias through regression estimation. Journal of Official Statistics, Vol. 4, No. 3, 251-260. Cochran, W.G. (1977): Sampling Techniques. Third Edition, Wiley, New York. Cornish J. (2004). Response Problems In Surveys: improving response and minimising the load for UNSD. Regional Seminar on 'Good Practices in the Organization and Management of Statistical Systems’ for ASEAN countries, Yangon Myanmar, 11-13 December 2002. DeLeeuw, Edith D (ed) (1999). Special issues on Survey Nonresponse Journal of Official Statistics 15, 2. Dillman, D. A. Procedures for Conducting Government-Sponsored Establishment Surveys: Comparisons of the Total Design Method (TDM), a Traditional Cost- Compensation Model, and Tailored Design, Washington State University. Ekholm, A. and Laaksonen, S. (1991). Weighting via Response Modeling in the Finnish Household Budget Survey. Journal of Official Statistics, 7, 325–337. Elliot, M.R., Little, R.J.A., and Lewitzky, S. (2000). Subsampling Callbacks to Improve Survey Efficiency. Journal of the American Statistical Association, 95, 730–838. Groves R M, Dillman D A, Eltinge J L & Little R J A (eds), Survey Nonresponse, 2002, Chichester: Wiley Hansen, M. H., and Hurwitz, W. N. (1946), The Problem of Nonresponse in Sample Surveys, Journal of the American Statistical Association, 41, 517–529. Hedlin, D. (2003).Score Functions to Reduce Business Survey Editing at the U.K. Office for National Statistics. Journal of Official Statistics, Vol.19, No.2, 177-199 Hidiroglou, M. A, Drew, D. J, and Gray, G. B, June 1993 A frameworkfor Measuring and Reducing Nonresponse in Surveys, Survey Methodology 19:81-94 International Conference on Survey Nonresponse (1999). http://jpsm.umd.edu/icsn/papers/Index.htm. Kalton G. and Flores-Cervantes I. (2003). Weighting Methods. Journal of Official Statistics, Vol.19, No.2, 2003. pp. 81-97

48 References Laaksonen, S. and Chambers, R. (2006). Survey Estimation under Informative Nonresponse with Follow-up. Journal of Official Statistics, Vol. 22, No. 1, 2006, 81–95. Latouche, M. and Berthelot, J.-M., (1992). Use of a Score Function to Prioritize and Limit Recontacts in Editing Business Surveys. Journal of Official Statistics, Vol.8, No.3, 1992. 389-400. Lawrence D. and McKenzie R. (2000).The General Application of Significance Editing. Journal of Official Statistics, Vol.16, No.3, 243-253 Little, R. (1986). Survey Nonresponse Adjustments for Estimates of Means. International Statistical Review, 54, 139–157. Lundstrom Sixten and Särndal C.-E. (1999). Calibration as a Standard Method for Treatment of Nonresponse. Journal of Official Statistics, Vol. 15, No. 2, 1999, 305-327. Lynn, Peter and Clarke, Paul, Separating refusal bias and con-contact bias: evidence from UK national surveys, The Statistician, 51, Part 3, 391-333. Madow, W.G., Nisselson, H., and Olkin, I. (eds.) (1983): Incomplete Data in Sample Sur­veys. Vol. 1: Report and Case Studies. Academic Press, New York. McKenzie, Richard. (2000). A Framework for Priority Contact of Non Respondents. In the Proceedings of The Second International Conference on Establishment Surveys, Buffalo, New York. 473 - 482. Rao, J.N.K.(1973 ).Double sampling for stratification and survey.Biometrika,Vol. 60, No. 1 : 125-133 Särndal, C.-E. and Swensson, B. (1987). A General View of Estimation for Two Phases of Selection with Applications to Two-Phase Sampling and Nonresponse. International Statistical Review, 55, 279–294. Strauss, E.E., and Hidiroglou, M.A. (1984). A Follow-up Procedure for Business Census Type Surveys. In Topics in Applied Statistics. Y.P. Chaubey and T.D. Dwivedi ed., 447-453. Published by Concordia University, Montréal. Valliant R. (2004) The Effect of Multiple Weighting Steps on Variance Estimation Journal of Official Statistics, Vol.20, No.1, 1-18. Wang, J.E. (2004). Non-response in the Norwegian Business Tendency Survey. Statistics Norway Department of Economic Statistics.

49 Score Function, cont No follow-up on occasion t-a Partial follow-up on occasion t-a Full follow-up on occasion t-a


Download ppt "Treatment Of Unit Non-response In Establishment Surveys ICES –III: June 18 -21, 2007 M.A. Hidiroglou Wesley Yung Statistics Canada."

Similar presentations


Ads by Google