Presentation is loading. Please wait.

Presentation is loading. Please wait.

Optimal Sampling Strategies for Multidomain, Multivariate Case with different amount of auxiliary information Piero Demetrio Falorsi, Paolo Righi 

Similar presentations


Presentation on theme: "Optimal Sampling Strategies for Multidomain, Multivariate Case with different amount of auxiliary information Piero Demetrio Falorsi, Paolo Righi "— Presentation transcript:

1 Optimal Sampling Strategies for Multidomain, Multivariate Case with different amount of auxiliary information Piero Demetrio Falorsi, Paolo Righi  falorsi@istat.it, parighi@istat.it  Italian National Statistical Institute Seminar UNECE, 12 June 2012

2 Outline Aim of the talk Statement of the problem (The unified approach for) sampling design (Mgreg) Estimator Experimental results Conclusions

3 Aim of the talk An overall strategy

4 Statement of the problem

5 Statement of the problem: Challenging informative context Multiple sources of auxiliary information

6 Statement of the problem: Design

7 Statement of the problem: Estimation  Standard solution for estimation (calibration estimators) may allow for calibrating at domain level only for the register variables and does not calibrate on the domain existing totals deriving from auxiliary data sources  Main drawback: Too small sample size for some domains Risk that the estimation of variables that could derive from administrative Data Source are significantly different from known totals Biased estimation for small domains Effect of non response or measurement error

8 Sampling Design: Multiple sources of auxiliary information

9

10

11 Estimation: Multiple sources of auxiliary information

12 Estimation:The Working model

13 Estimation:The Mgreg Estimator

14 Estimation: Properties

15

16 Estimation: Properties - auxiliary=interest

17 Empirical Results: Population of simulation - 1999 Italian enterprises from 1 to 99 employees- Computer and related economic activities (2-digits NACE Rev.1) ITACOSM 2011 - 27-29 June 2011, Pisa, Italy - 12 Populatio n size Number of cross-classified strata Cumulative (%) distribution 16818.89 23729.17 3-56346.67 6-105060.56 11-10011993.61 More than 100 23100.00 The domains of interest (44): (1) geographical region with 20 marginal domains (DOM1); (2) economic activity group by Size class (24 domains)

18 Empirical Results: Simulation: allocation comparison between the one- way and multi-way design  Prediction models: M1M1 M2M2 61.065.1 64.168.1 Value addedLabour cost % Model

19 Sampling distributions over the partition with different auxiliary information Empirical Results: multiple sources of auxiliary information: example – efficiency of the proposed strategy

20 Conclusions

21 The last result (The unified approach) of a research that has lasted almost 6 years Survey Methodology (2008) Statistics in Transition (2006) 2 books published by Franco Angeli illustrating the main findings of a research of strategic interest financed by the Ministry of University and Research Presentations NTTS (2011), Neuchatel (2011) Invited talk to the next scientific conference of the Italian Society of Statistics Accepted talk for the ICES

22 References  Bethel J. (1989) Sample Allocation in Multivariate Surveys, Survey Methodology, 15, 47-57.  Chromy J. (1987). Design Optimization with Multiple Objectives, Proceedings of the Survey Research Methods Sec-tion. American Statistical Association, 194-199.  Deville J.-C., Tillé Y. (2004) Efficient Balanced Sampling: the Cube Method, Biometrika, 91, 893-912.  Deville J.-C., Tillé Y. (2005) Variance approximation under balanced sampling, Journal of Statistical Planning and Inference, 128, 569-591  Falorsi P. D., Righi P. (2008) A Balanced Sampling Approach for Multi-way Stratification Designs for Small Area Estimation, Survey Methodology, 34, 223- 234  Falorsi P. D., Orsini D., Righi P., (2006) Balanced and Coordinated Sampling Designs for Small Domain Estimation, Statistics in Transition, 7, 1173-1198  Isaki C.T., Fuller W.A. (1982) Survey design under a regression superpopulation model, Journal of the American Statistical Association, 77, 89-96


Download ppt "Optimal Sampling Strategies for Multidomain, Multivariate Case with different amount of auxiliary information Piero Demetrio Falorsi, Paolo Righi "

Similar presentations


Ads by Google