Small-area estimation in Official Statistics: ICT survey in Enterprises of the Basque Country Jorge Aramendi, Jose Miguel Escalada, Elena Goni & Anjeles.

Slides:



Advertisements
Similar presentations
Innovation data collection: Methodological procedures & basic forms Regional Workshop on Science, Technology and Innovation (STI) Indicators.
Advertisements

Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Innovation Surveys: Advice from the Oslo Manual South Asian Regional Workshop on Science, Technology and Innovation Statistics Kathmandu,
Innovation Surveys: Advice from the Oslo Manual National training workshop Amman, Jordan October 2010.
1 Measuring ICT4D: ITUs Focus on Household and Individual Market, Economics & Finance Unit Telecommunication Development Bureau.
Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.
Estimates and sampling errors for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Sampling Strategy for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
17 September SME Statistics OECD Workshop SME data and methodologies in the EU - item 5 Paul Feuvrier / Eurostat.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Survey vehicles for ICT indicators of the business sector Joint UNCTAD – ITU – UNESCAP Regional Workshop on Information Society Measurements in Asia-Pacific.
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
Discussion Sampling Methods
2006 August Labour statistics The usage of administrative data sources for Lithuanian data of earnings Milda Šličkutė-Šeštokienė Statistics Lithuania.
Global Event on Measuring the Information Society, May 2008, Palais des Nations, Geneva Session 4: Measurement of ICT Impact ICT and labour productivity,
Measuring and Monitoring Poverty for the MDGs Johan A. Mistiaen Economist-Statistician Development Data Group The World Bank Overview of the Approach and.
Challenges in designing mixed mode in business surveys Dr Mojca Noc Razinger, Statistical Office of the Republic of Slovenia.
UNECE Workshop on Confidentiality Manchester, December 2007 Comparing Fully and Partially Synthetic Data Sets for Statistical Disclosure Control.
Arun Srivastava. Small Areas What is a small area? Sub - population Domain The Domain need not necessarily be geographical. Examples Geographical Subpopulations.
ICVS IN SLOVENIA Tatjana Škrbec. Content of presentation  Short history  Crime victim survey 2001 within SORS  Methodology and content of questionnaire.
1 The ICT Statistics of Business Sector in China By Lu Haiqi International Statistical Information Center, National Bureau of Statistics People’s Republic.
PREPARATION, ORGANISATION AND CONDUCTING OF THE POST- ENUMERATION SURVEY IN THE STATE STATISTICAL OFFICE OF THE REPUBLIC OF MACEDONIA Skopje, May 2008.
1 Item 7: National Accounts And Employment Data Using Employment Statistics in the Russian National Accounts Alexander Surinov Deputy Head of Rosstat Joint.
9 th Workshop on Labour Force Survey Methodology – Rome, May 2014 The Italian LFS sampling design: recent and future developments 9 th Workshop on.
Improving the Design of UK Business Surveys Gareth James Methodology Directorate UK Office for National Statistics.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
ICT adoption in Irish manufacturing: An example of merging two CSO business microdata sets Stefanie Haller CSO Business Statistics Seminar, Dublin 26 February.
Use of web scraping and text mining techniques in the Istat survey on “Information and Communication Technology in enterprises” Giulio Barcaroli(*), Alessandra.
A Centralised Management Tool for Quality Indicators in EUSTAT Jorge Aramendi, Anjeles Iztueta, Cristina Prado, Marta Mas EUSTAT. Basque Statistics Office.
Linking micro data for the analysis of ICT effects Mika Maliranta, ETLA Istat – Stat Fin Workshop, June 26th and 27th, Rome.
Can We Continue to Exclude Small Single-Establishment Businesses from Data Collection in the Annual Retail Trade Survey and the Service Annual Survey?
Valentina Stoevska ILO Department of Statistics Workshop on MDG Data Reconciliation: Employment Indicators, Beirut, July
Using administrative registers in sample surveys European Conference on Quality in Official Statistics 3-–6 May 2010 Kaja Sõstra Statistics Estonia.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Sampling Error Estimation – SORS practice Rudi Seljak, Petra Blažič Statistical Office of the Republic of Slovenia.
1 Experience of Thai NSO in ICT Statistics. Paper presented in Joint UNCTAD-ITU-UNESCAP Regional Workshop on Information Society Measurements in Asia-Pacific,
Energy use in manufacturing sector - data sources and data collection 7 th Oslo Group meeting, Helsinki Merja Eskelinen.
Impact of updating weights on tracking performance and volatility: Industry survey G. Bruno, L. Crosilla, P. Margani, A. Righi EU Workshop on Recent Developments.
RENEWING THE EUSTAT TOURISM SURVEY: NEW COLLECTION METHODS AND DESIGN FOR MORE DETAILED ESTIMATES Elena Goni Jorge Aramendi Marta Salvador Anjeles Iztueta.
ICES 2007 Labour Cost Index and Sample Allocation Outi Ahti-Miettinen and Seppo Laaksonen Statistics Finland (+ University of Helsinki) Labour cost index.
Learning Objectives Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors.
© Federal Statistical Office Germany, Division IB, Institute for Research and Development in Federal Statistics Sheet 1 Surveys, administrative data or.
United Nations Statistics Division Work Programme on Economic Census Vladimir Markhonko, Chief Trade Statistics Branch, UNSD Youlia Antonova, Senior Statistician,
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
Improving of Household Sample Surveys Data Quality on Base of Statistical Matching Approaches Ganna Tereshchenko Institute for Demography and Social Research,
Impact of e-commerce on the UK economy Cecil Prescott Head of Research& Development and Information Technology, ONS #UKecomStats.
Performance Indicators Workshop for African countries on the Implementation of International Recommendations for Distributive Trade Statistics May.
Multivariate selective editing via mixture models: first applications to Italian structural business surveys Orietta Luzi, Guarnera U., Silvestri F., Buglielli.
Ph. Brion Insee The contribution of different ways of dealing with non-responses in french business surveys.
Improving visualisation tools in EUSTAT: Explaining the data Jorge Aramendi Rique (Eustat), Anjeles Iztueta Azkue (Eustat), Cristina Prado Valle (Eustat),
1 Optimal Number of Replicates for Variance Estimation Mansour Fahimi, Darryl Creel, Peter Siegel, Matt Westlake, Ruby Johnson, and Jim Chromy Third International.
Workshop on MDG, Bangkok, Jan.2009 MDG 3.2: Share of women in wage employment in the non-agricultural sector National and global data.
Regional Seminar on Developing a Program for the Implementation of the 2008 SNA and Supporting Statistics Cenker Burak METİN September 2013 Ankara.
IAOS Shanghai – Reshaping Official Statistics Some Initiatives on Combining Data to Support Small Area Statistics and Analytical Requirements at.
Suphalak Prasertsang Pornpan Kaewsringam Krongsuk Chayakul Tanaporn Kongprasert Pannee Pattanapradit NATIONAL STATISTICAL OFFICE, THAILAND MAY 2010.
Building a tourism intelligence system using big data Jon Kepa Gerrikagoitia, Ph.D. OPTIMA / Optimization Modelling & Analytics ICT - European Software.
Measuring e-commerce - the Eurostat and OECD approach and the Statistics Finland experience Aarno Airaksinen Regional Workshop, Strengthening.
Big Data: Automatic hotel prices collection on the Internet for the Tourism Survey in the Basque Country EUSTAT. Euskal Estatistika Erakundea – Basque.
Informal Sector Statistics
Regression composite estimation for the Finnish LFS from a practical perspective Riku Salonen.
The European Statistical Training Programme
ADMINISTRATIVE DATA IN ANNUAL BUSINESS STATISTICS OF LATVIA
Estimation of Employment for Cities, Towns and Rural Districts
Implementation of NACE Rev. 2
Regional Seminar on Developing a Program for the Implementation of the 2008 SNA and Supporting Statistics Gülçin ERDOĞAN September 2013 Ankara.
Marie Reijo, Population and Social Statistics
Sampling and estimation
European Conference on Quality in Official Statistics
Task Force on Small and Medium Sized Enterprise Data (SMED)
SMALL AREA ESTIMATION FOR CITY STATISTICS
Presentation transcript:

Small-area estimation in Official Statistics: ICT survey in Enterprises of the Basque Country Jorge Aramendi, Jose Miguel Escalada, Elena Goni & Anjeles Iztueta EUSTAT. Basque Statistics Office European Establishment Statistics Workshop Nuremberg – Germany 9 September 2013

Index Review and Historical Context ICT Survey Design Main Objective Methodology Variables Auxiliary information Estimators Accuracy Conclusion and Results

Review and Historical Context From 2003 onwards, partnership with the UPNA (Navarra Public University) to study small area techniques in EUSTAT surveys In 2004, work began on studying the first survey Industrial Survey (2005) Labour Force Survey (2008) Information Society Survey in Families (2009) Technological Innovation Survey (2010) Information and Communications Technologies (ICT) Survey in Enterprises (2012)

ICT Survey design The ICT (Information and Communications Technologies ) Survey in Enterprises was implemented in 2001 to find out the level of use of new technologies in the Basque economy The ICT Survey is a panel of around 7500 establishments with an annual renewal between 15% and 20% of the elements. Stratified design with optimal allocation according to three variables: province, activity and employment stratum. 3 provinces: Araba, Bizkaia and Gipuzkoa 65 branches of activity, based on NACE 6 employment strata: 0-5, 6-9, 10-19, 20-49, 50-99, >=100

Currently estimates calculated using the direct Horvitz- Thompson estimator for 3 provinces, 65 activity sectors and 3 employment stratum. OBJECTIVE : Obtaining district estimations (20 districts and 3 capital cities) for the main ICT variables. PROBLEM: Insufficient sample size for direct estimates So we decided to improve the estimation methodology of this survey and to introduce small area estimation techniques. Main Objective Bilbao Vitoria - Gasteiz San Sebastián

Methodology Variables : Selection and analysis of the target variables Definition of the small area: 20 districs and 3 capital cities Auxiliary information: Availability of sources, quality of the data,.. Estimators: Select best estimators for the data and asses them

Methodology: Variables Selection and analysis of the target variables: Internet and computer (%) e-commerce (sales or purchases) (%) web (%) freeware operating-systems (%) electronic data exchange (%) web proceedings with the public administration (%).

Methodology: Auxiliary Information Analysis of the auxiliary information: DIRAE – Our Directory for economic establishments establishments Updated with different surveys and administrative sources Employment and Activity are key variables regularly updated

Assess several estimators Design based estimators: Direct Postestratified Synthethic Composite estimators Model based estimators: Logistic regression model, linear mixed model and no-linear mixed model Analyze the consistency and stability of the aggregated estimates at province level Methodology: Model selection

First level model: where p1 is the proportion of establishments that responds affirmatively in a certain variable. β0 is the intercept. β1, …, β19 are the coefficients of explanatory variables for the 20 districts. β20, β21 are the coefficients of explanatory variables for the 3 employment strata. β22, …, β47 are the coefficients of explanatory variables for the 27 categories in the activity classification. Methodology: Final Model

Second level model: where p2 is the proportion of establishments that responds affirmatively in a certain variable. β0 is the intercept. β1, β2 are the coefficients of explanatory variables for the 3 provinces. β3, β4 are the coefficients of explanatory variables for the 3 employment strata. β5, …, β30 are the coefficients of explanatory variables for the 27 categories in the activity classification. Methodology: Model

Mean Squared Error is calculated using the bootstrap resampling method. where: is the total population of a certain variable in region d. R number of repetitions used in the bootstrap method (R = 200) i is the repeated number (i = 1, 2, …, R) d is the region the estimator in the i-th bootstrap sample of said total in region d Methodology: Accuracy

Conclusion and Results Estimates are consistent and stable. The results offer acceptable levels of quality in terms of accuracy. The estimated coefficients of variation (CV) are not excessively high, the majority of the CV-s obtained in the estimations do not exceed 15%., given the relatively small samples and population in some districts. As a result of the study we have a computer programme based on SAS that is used to analyse this methodology and to apply the mentioned estimators and the calculation of the mean squared errors. We have disseminated distric-level information for 3 years 2010, 2011 and 2012

Eskerrik asko Thank you Danke Schön Jorge Aramendi