The European Statistical Training Programme (ESTP)

Slides:



Advertisements
Similar presentations
1 Incorporating Statistical Process Control and Statistical Quality Control Techniques into a Quality Assurance Program Robyn Sirkis U.S. Census Bureau.
Advertisements

Quality indicators for measuring and enhancing the composition of survey response Q2008 – Special topic session, July 9 Jelke Bethlehem and Barry Schouten.
Methodology for a school- leavers’ survey Irena Kogan MZES, University of Mannheim.
Quantitative Methods in the Social Sciences (QMSS) Lugano August 2005 Increasing response rates Ineke Stoop SCP.
REPRESENTATIVE RESPONSE Some Examples ITSEW Peter Lundquist Statistics Sweden.
National Household Survey: collection, quality and dissemination Laurent Roy Statistics Canada March 20, 2013 National Household Survey 1.
The new HBS Chisinau, 26 October Outline 1.How the HBS changed 2.Assessment of data quality 3.Data comparability 4.Conclusions.
TURKISH STATISTICAL INSTITUTE Social Sector Statistics Department Tourism Statistics Group
Current Population Survey Sponsor: Bureau of Labor Statistics Collector: Census Bureau Purpose: Monthly Data for Analysis of Labor Market Conditions –CPS.
Economics and Statistics Administration U.S. CENSUS BUREAU U.S. Department of Commerce Assessing the “Year of Naturalization” Data in the American Community.
ISR Training Jan. 21,  Canada’s largest survey  Complete population count  Gathers information on the demographic, social and economic conditions.
Centraal Bureau voor de Statistiek Challenges of redesigning household surveys and maintaining output quality Menno Cuppen Paul van der Laan Wim van Nunspeet.
Measuring Disability: Results from the 2001 Census and the 2001 Post-Censal Disability Survey Statistics Canada January 10, 2003.
The Dutch travel survey Mixed-mode experiences from the Netherlands Ilona Bouhuijs Netherlands Statistics June 17th 2013 Disclaimer: the views expressed.
Implementing cawi into the data collection process Kees van Berkel Mariëtte Vosmer Jerusalem, July 2013.
Regional Roundtable on
Conversely Mixed Mode in the Swedish Crime Survey Sanna Wallin, researcher The Swedish National Council for Crime Prevention Thank you! It’s very nice.
The Language of Sampling
Rose Krebill-Prather, PhD
UNECE Work Session on Gender Statistics Belgrade November, 2017
SAMPLING (Zikmund, Chapter 12.
LISA, Anticipating the Next Generation of Longitudinal Data
NCN module on cultural participation:
The second wave of the new design of the Dutch EU-SILC: Possibilities and challenges Judit Arends.
The Language of Sampling
Nonresponse Bias in a Nationwide Dual-Mode Survey
Chapter 2: The nonresponse problem
LISA, Anticipating the Next Generation of Longitudinal Data
The European Statistical Training Programme (ESTP)
Effect of Panel Length and Following Rules on Cross-Sectional Estimates of Income Distribution: Empirical Evidence from FI-SILC Marjo Pyy-Martikainen Workshop.
Chapter 7: Reducing nonresponse
The European Statistical Training Programme (ESTP)
The European Statistical Training Programme (ESTP)
Chapter 14: Mixed-mode datacollection
The European Statistical Training Programme (ESTP)
2011 POPULATION AND HOUSING CENSUS PREPARATORY WORKS
Chapter 8: Weighting adjustment
Chapter 12: Other nonresponse correction techniques
The European Statistical Training Programme (ESTP)
Chapter 11: Adjustment for different types of nonresponse
Assessing Quality of Paradata to Better Understand the Data Collection Process for CAPI Social Surveys François Laflamme Milana Karaganis European Conference.
Chapter 10: Selection of auxiliary variables
Do now – identify the sampling method
National needs for AES Purpose - describe participation in learning during a 12 months period. The main parameters are; Participation rates in different.
Census Planning and Management
SAMPLING (Zikmund, Chapter 12).
The European Statistical Training Programme (ESTP)
Workshop on migration statistics
Biography Hans Schmeets is senior researcher at Statistics Netherlands (Division of Social and Spatial Statistics, Heerlen) and professor at the University.
Chapter: 9: Propensity scores
Metro ACEs Data 2018 Community Health Needs Assessment
Chapter 3: Response models
New Techniques and Technologies for Statistics 2017  Estimation of Response Propensities and Indicators of Representative Response Using Population-Level.
Implementing mixed mode questionnaire in FI-SILC
The European Statistical Training Programme (ESTP)
Chapter 4: Missing data mechanisms
The European Statistical Training Programme (ESTP)
Collecting the Data Tim Vizard, Office for National Statistics.
Week 9 Sampling a population
The European Statistical Training Programme (ESTP)
Chapter 6: Measures of representativity
The European Statistical Training Programme (ESTP)
Chapter 13: Item nonresponse
The Application of Statistical Matching to the 2010 ESF Leavers Survey
Census topics selection
Chapter 2: The nonresponse problem
Adaptive mixed-mode design WP1
Chapter 5: The analysis of nonresponse
Dealing with nonresponse
Stratification, calibration and reducing attrition rate in the Dutch EU-SILC Judit Arends.
Presentation transcript:

The European Statistical Training Programme (ESTP)

Chapter 5: The analysis of nonresponse Handbook: chapter 5 How to detect a bias? Where to find auxiliary information An example: POLS 1998 Global analysis A more detailed analysis Conclusions

How to detect a bias? The question: How to detect whether estimates for a target variable are biased if you don’t know its values for the nonrespondents? The answer: Use auxiliary variables! Auxiliary variables: Have been measured in the survey Population (or complete sample) distribution is available Correlated with target variables Important: Decision to include auxiliary variables in survey at design stage.

How to detect a bias? Possible relationships between target variable, response behaviour and auxiliary variable Missing Completely At Random (MCAR): No relation between X and R. No bias. No problem. Missing At Random (MAR): Relation between X and R. Bias can be reduced by using X. Not Missing At Random (MAR): Direct relation between Y and R. Bias cannot be reduced. Response behaviour R Auxiliary variable X Target variable Y

Where to find auxiliary variables? In the sampling frame itself Example: Survey on Household Living Conditions 1998 Two-stage sample, first stage: municipalities, second stage: persons from population register Auxiliary variables: gender, age, marital status, geographical area

Where to find auxiliary variables? Observations by interviewers Example: Housing Demand Survey 1977/1978 Estimate of building period of house, Type of house, Number of floors.

Where to find auxiliary variables? National Statistical Institute Example: Survey on Household Living Conditions 1998 Population by sex, age, marital status and region

Survey on Household Living Conditions 1998 (POLS) Continuous survey Every month a new sample, thematic modules Fieldwork First month CAPI, second month CATI Fieldwork results Frequency Percentage Sample size 39 302 100.0 % Response Immediate response Converted refusers Other response 24 008 9718 14 275 15 61.1 % 24.7 % 36.3 % 0.0 % Nonresponse Unprocessed cases Non contact (not-at-home) Non contact (moved) Not able (illness, handicap) Not able (language problem) Refusal Other nonresponse 15 294 2 514 2 093 376 735 416 8 918 242 38.9 % 6.4 % 5.3 % 1.0 % 1.9 % 1.1 % 22.7 % 0.6 %

Survey on Household Living Conditions 1998 (POLS) Composition of nonresponse

Survey on Household Living Conditions 1998 (POLS) Auxiliary variables Social Statistics Database of Statistics Netherlands Data on demography, geography, income, labour, education, health, social protection Variables used: sex, age, province, ethnic group, employment, social security benefits Data on postal code areas 420,000 area’s, homogeneous with respect to socio-economic characteristics Variables used: urbanisation, town size, percentage non-natives, average housing value Fieldwork data Information about all contact attempts Variables used: result of contact attempt, type of nonresponse, interviewer district, telephone number

Global nonresponse analysis of POLS 1998 Age

Global nonresponse analysis of POLS 1998 Marital status

Global nonresponse analysis of POLS 1998 Household size

Global nonresponse analysis of POLS 1998 Degree of urbanisation

Global nonresponse analysis of POLS 1998 Province

Global nonresponse analysis of POLS 1998 Average house value in neighbourhood

Global nonresponse analysis of POLS 1998 Percentage of non-natives in neighbourhood

Global nonresponse analysis of POLS 1998 Listed fixed-line telephone number

Detailed analysis Ingredients for a detailed analysis of nonresponse Respondent characteristics Neighbourhood characteristics Interviewer and data collection characteristics

Detailed analysis Respondent characteristics Demographics like age, gender, marital status, type of household Socio-economics like income, employment, education Neighbourhood characteristics Type of dwelling and neighbourhood, degree of urbanisation Interviewer and data collection characteristics Type of nonresponse Contact and refusal strategy Interviewer experience and gender Interview mode

Detailed analysis Decomposition of response process Sample Processed? Not processed Not moved? Moved Contacted? Not contacted Participated? Language problem Not able Refusal Response

Detailed analysis Why a decomposition of the response process? It gives insight into the possible causes for selective response It may reveal possible measures to reduce nonresponse It enables the incorporation of auxiliary information Compare survey topics to steps in response process Do you travel a lot? Did you recently buy a new house? How many years of education in Dutch do you have? Do you participate in surveys?

Detailed analysis Processing allocated cases Respondent characteristics: listed phone, access to web Neighbourhood characteristics: type of neighbourhood Interviewer and data collection characteristics: interviewer workload Example Interviewer district, average value of houses Proportion explained: 9.1%

Detailed analysis Processing allocated cases Respondent characteristics: listed phone, access to web Neighbourhood characteristics: type of neighbourhood Interviewer and data collection characteristics: interviewer workload Example Interviewer district, average value of houses Proportion explained: 9.1%

Detailed analysis Non-contacted cases Respondent characteristics: available auxiliary information Neighbourhood characteristics: available auxiliary information Interviewer and data collection characteristics: number and timing of contact attempts, registered phone, interviewer workload Example Registered phone, interviewer district, age, type of household Proportion explained: 15.2%

Detailed analysis Non-contacted cases N=32996P= 5.9% N=25512 P=3.8% Listed phone number No listed phone number Other Child, married, single parent, partner in unmarried couple without children Other towns Big cities Flevoland, Amsterdam N=232 P=8.2% N=1494 P=18.9% Divorced N=482 P=13.3% N=1012 P=21.5% N=1064 P=30.3% N=581 P=17.2% Districts: 8,12,15,17 Couple without children, single parent Districts Detailed analysis Non-contacted cases

Detailed analysis Language problems Respondent characteristics: available auxiliary information Neighbourhood characteristics: available auxiliary information Interviewer and data collection characteristics: ethnicity of interviewer, proxy-interviewing Example Ethnicity, i.e. origin and generation Proportion explained: 51.2%

Detailed analysis Mentally or physically not able Respondent characteristics: available auxiliary information Neighbourhood characteristics: available auxiliary information Interviewer and data collection characteristics: - Example Age, employment status Proportion explained: 20%

Detailed analysis Refusals Respondent characteristics: available auxiliary information Neighbourhood characteristics: available auxiliary information Interviewer and data collection characteristics: interviewer experience, interviewer gender and age, interview mode, follow up strategy Example Registered phone, type of household, average house value, age, interviewer district Proportion explained: <5%

Detailed analysis In summary, literature confirms most findings Sample Processed? Interviewer workload, average house value Not moved? Age,type of household, Contacted? Registered phone, interviewer workload, age, type of household Participated? Age, interviewer experience,ethnic background, interviewer experience, average house value Response

Conclusions Identify and link available auxiliary information Respondent characteristics Neighbourhood characteristics Interviewer and data collection characteristics Investigate literature on similar surveys and data collection strategies If not available plan future collection of missing crucial auxiliary information Decompose response process Make at least a distinction between contact and participation Relate survey topics to steps in response process Relate interviewer and data collection characteristics to steps in response process Identify mode characteristics