Using Modern Missing Data Analyses for effective inference about Hunters’ satisfaction towards OFW Program Muhammad Imran Khan.

Slides:

Advertisements

Similar presentations

Survey design. What is a survey?? Asking questions – questionnaires Finding out things about people Simple things – lots of people What things? What people?

Advertisements

Handling attrition and nonresponse in longitudinal data Harvey Goldstein University of Bristol.

Handling Missing Data on ALSPAC

Treatment of missing values

CountrySTAT Team-I November 2014, ECO Secretariat,Teheran.

Missing values problem in Data Mining

1crmda.KU.edu Todd D. Little University of Kansas Director, Quantitative Training Program Director, Center for Research Methods and Data Analysis Director,

Approaches for Addressing Issues of Missing Data in the Statistical Modeling of Adolescent Fertility Dudley L. Poston, Jr. Texas A&M University & Eugenia.

 Overview  Types of Missing Data  Strategies for Handling Missing Data  Software Applications and Examples.

Some birds, a cool cat and a wolf

CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.

Getting Started with Large Scale Datasets Dr. Joni M. Lakin Dr. Margaret Ross Dr. Yi Han.

NLSCY – Non-response. Non-response There are various reasons why there is non-response to a survey  Some related to the survey process Timing Poor frame.

Adapting to missing data

How to Handle Missing Values in Multivariate Data By Jeff McNeal & Marlen Roberts 1.

Missing Data in Randomized Control Trials

Journal Club Alcohol, Other Drugs, and Health: Current Evidence May–June 2009.

How to deal with missing data: INTRODUCTION

Partially Missing At Random and Ignorable Inferences for Parameter Subsets with Missing Data Roderick Little Rennes

Missing Data.. What do we mean by missing data? Missing observations which were intended to be collected but: –Never collected –Lost accidently –Wrongly.

Psych 524 Andrew Ainsworth Data Screening 2. Transformation allows for the correction of non-normality caused by skewness, kurtosis, or other problems.

Statistical Methods for Missing Data Roberta Harnett MAR 550 October 30, 2007.

PEAS wprkshop 2 Non-response and what to do about it Gillian Raab Professor of Applied Statistics Napier University.

Survey Experiments. Defined Uses a survey question as its measurement device Manipulates the content, order, format, or other characteristics of the survey.

Factors that Associated with Stress in Nursing Faculty in Thailand

Biostatistics Case Studies 2014 Session 6 An Overview of Missing Data Youngju Pak Biostatistician

Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 19 Process of Quantitative Data Analysis and Interpretation.

1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.

Multiple Imputation (MI) Technique Using a Sequence of Regression Models OJOC Cohort 15 Veronika N. Stiles, BSDH University of Michigan September’2012.

G Lecture 11 G Session 12 Analyses with missing data What should be reported?  Hoyle and Panter  McDonald and Moon-Ho (2002)

Handling Attrition and Non- response in the 1970 British Cohort Study Tarek Mostafa Institute of Education – University of London.

Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 10 Missing Data Henian Chen, M.D., Ph.D.

Imputation for Multi Care Data Naren Meadem. Introduction What is certain in life? –Death –Taxes What is certain in research? –Measurement error –Missing.

1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.

SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,

1crmda.KU.edu Todd D. Little University of Kansas Director, Quantitative Training Program Director, Center for Research Methods and Data Analysis Director,

© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.

1 G Lect 13W Imputation (data augmentation) of missing data Multiple imputation Examples G Multiple Regression Week 13 (Wednesday)

The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.

1 G Lect 13M Why might data be missing in psychological studies? Missing data patterns Overview of statistical approaches Example G Multiple.

Missing Values Raymond Kim Pink Preechavanichwong Andrew Wendel October 27, 2015.

Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.

Diagnostic methods for checking multiple imputation models Cattram Nguyen, Katherine Lee, John Carlin Biometrics by the Harbour, 30 Nov, 2015.

1crmda.KU.edu Todd D. Little University of Kansas Director, Quantitative Training Program Director, Center for Research Methods and Data Analysis Director,

Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 13, 2013.

Tutorial I: Missing Value Analysis

INFO 4470/ILRLE 4470 Visualization Tools and Data Quality John M. Abowd and Lars Vilhuber March 16, 2011.

BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.

Pre-Processing & Item Analysis DeShon Pre-Processing Method of Pre-processing depends on the type of measurement instrument used Method of Pre-processing.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 9, 2012.

REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.

A framework for multiple imputation & clustering -Mainly basic idea for imputation- Tokei Benkyokai 2013/10/28 T. Kawaguchi 1.

1 Survey Nonresponse Survey Research Laboratory University of Illinois at Chicago March 16, 2010.

DATA STRUCTURES AND LONGITUDINAL DATA ANALYSIS Nidhi Kohli, Ph.D. Quantitative Methods in Education (QME) Department of Educational Psychology 1.

Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.

HANDLING MISSING DATA.

Missing data: Why you should care about it and what to do about it

Handling Attrition and Non-response in the 1970 British Cohort Study

Maximum Likelihood & Missing data

Introduction to Survey Data Analysis

Multiple Imputation Using Stata

The bane of data analysis

Peng Zhang Jinnan Liu Mei-ting Chiang Yin Liu

MEASUREMENT OF THE QUALITY OF STATISTICS

EM for Inference in MV Data

Missing Data Mechanisms

Analysis of missing responses to the sexual experience question in evaluation of an adolescent HIV risk reduction intervention Yu-li Hsieh, Barbara L.

EM for Inference in MV Data

Clinical prediction models

Missing data: Is it all the same?

Presentation transcript:

Using Modern Missing Data Analyses for effective inference about Hunters’ satisfaction towards OFW Program Muhammad Imran Khan

Motivation of Study Hunting & fishing are part of Nebraska's heritage NGPC is interested in improving hunter/angler recruitment & retention ( NGPC,2008 ) Data collected in 2013 to know about hunters’ motivations & satisfactions towards OFW lands Purpose of this study is to compare estimates using appropriate imputation methods 2

Missing Data Missingness in Surveys ( Groves et al., 2004 ) – Noncoverage – Unit Nonresponse – Item Nonresponse – Partial Nonresponse ( Brick & Kalton,1996 ) – Data Entry Error ( Anne & Andrea,2014 ) Missing data Mechanism( Buuren, 2012 ) – Missing Completely At Random (MCAR) – Missing At Random (MAR) – Missing Not At Random (MNAR) 3

How much missing data is “problematic” Researchers assign some limits: – > 5% ( Schafer,1999 ) – >10% ( Benntt,2001 ) – >20% ( Peng et al., 2006 ) – ( Widaman,2006 ) specified the following scale o 1%-2% (Negligible) o 5%-10% ( Minor) o 10%-25% (Moderate) o 25%-50% (High) o >50% (Excessive) Important problems of missingness ( Bell & Fairclough,2013 ) – decrease in precision – Increase bias in parameter estimation 4

NGPC & UNL conducted survey Sampling frame: hunters who purchased hunting license for hunting in 2012 in NE – The survey contained three parts: o Where, & what hunt; Environment Impact o Motivations(Relatedness, Competence, Autonomy) o Socio-demographic factors About collected data – Total questions = 42 (used 19 Qus. for analysis) – Sample size = 8181 – Completely filled =1555 (19%) – Unit nonresponse = 627 (8%) – Item nonresponse = 5999 (73%) o Varies from 1 to 8 missingness per respondent in all 19 Qus. 5 81%

Determining Type of Missing Data 6 M.Satisf.Rel_1Rel_2Comp.Auto. H_Days“Harvest” Educ.IncomeAge Ns %

Data used for analysis 13 Questions for motivation based on SDT 5 Questions on relatedness transformed to 2 factors 7

Data used for analysis 13 Questions for motivation based on SDT 4 Qus. on competence & autonomy transformed each to 1 factor 8

Satisfaction=Rel_1+Rel_2+Comp+Auto+ Educ+Age+Income+H_Days+Harvest Model used for the analysis 9 VariableDescription of the variable [measured on 7 point Likert scale] SatisfactionHow satisfied were you with your experience on private lands enrolled in the Open Fields and Waters (OFW)? Releatedness_1I enjoy mentoring other hunters Releatedness_2I go hunting primarily to spend time with others & people I care about CompetenceOverall, Hunting makes me feel competent in other areas of my life AutonomyHunting helps me to feel independent; self-sufficient and more control in life Education Highest level of education that you have complete (<HS;HS;S.C;C;≥ G ) Age Age (Approximately in years) Income Total annual income for your household before taxes (8 diff. levels) Hunting_Days Visiting OFW sites allowed me to increase total days I spent hunting “Harvest” If you hunted in 2012 on a OFW site, did you harvest? (Yes/No)

Deletion or non-imputing methods: o List-wise Deletion ( Pigott, 2001 ) o Pair-wise Deletion ( Bennett, 2001 ) Nonstochastic or ad-hoc methods: o Mean Imputation (Graham,2003) o Regression Imputation ( Qin et.al., 2007 ) Stochastic or Established methods: o Stochastic Regression ( Todd et al., 2013 ) o Multiple Imputation(MI) (John, et al., 2007) o Full Information Maximum Likelihood(FIML) o Expectation Maximization (EM)(Yiran & Chao-Ying, 2013) Methods for Handling Missing Data 10

Mean Imputation 11

Comparing Results 12 Fitted Model List-wise DeletionMean Imputation p-value Intercept Releatedness_ Releatedness_ Competence Autonomy Education Age Income Hunting_Days “Harvest” cases or rows are Deletedm=1, maxit=1

Multiple Imputation 13

Comparing Results 14 Fitted Model List-wise DeletionMean ImputationMultiple Imputation p-value Intercept Releatedness_ Releatedness_ Competence Autonomy Education Age Income Hunting_Days “Harvest” cases or rows are Deletedm=1, maxit=1 m=20, maxit=10

Comparing Results 15 Fitted Model List-wise Deletion Full Information Maximum Likelihood (FIML) Imputation Expectation Maximization (EM) Imputation p-value Intercept Releatedness_ Releatedness_ Competence Autonomy Education Age Income Hunting_Days “Harvest” cases or rows are Deleted EM algorithm (MLE) converges in 37 iterations

EM only shows that Releadness_2 is significant EM estimates smallest standard error for Income Comparison of Imputation Methods Summary 16 % of smaller estimations than List-wise Deletion out of 10 variables ApproachesEstimatesStd. Err.P-valueSuggestions List-wise DeletionBase Avoid to use Mean Imputation60%100%40%Careful use Multiple Imputation30%100%20%Better Full Information Maximum Likelihood 30%100%20%Better Expectation Maximization 40%90%20%Preferred if converged

Thanks for your kind attention Special Thanks to: Dr. Andrew Tyre, Uni. Of Nebraska, Lincoln Dr. Lisa Pennisi, Uni. Of Nebraska, Lincoln Dr. Allan McCutcheon, Uni. Of Nebraska, Lincoln Nebraska Game & Parks Commission

Anne-Kathrin,F. & Andrea B. (2014). The economic performance of Swiss drinking water utilities. Journal of Prod. Analysis. 41: doi /s Bell, M. L.,& Fairclough,D.L. (2013). Practical and statistical issues in missing data for longitudinal patient reported outcomes. Statistical Methods in Medical Research, 0(0), doi: / Bennett, D.A. (2001). How can I deal with missing data in my study? Australian and New Zealand Journal of Public Health, 25, Brick, J., & Kalton, J. (1996). Handling missing data in survey research. Statistical Methods in Medical Research, 5, 215–238. doi: / Buuren, S.V.(2012). Flexible imputation of missing data. Taylor & Francis, FL: CRC Press. John, W. G. & Allison E. O. & Tamika D. G.(2007). How many imputations are really needed? some practical clarifications of multiple imputation theory, Springer,8: Graham, J. W. (2003). Adding missing-data-relevant variables to FIML based structuralequation models. Structural Equation Modeling, 10,80–100. Groves, R., Fowler, F., Couper, M., Lepkowski, J., Singer, E., & Tourangeau, R. (2004). Survey methodology. Hoboken, NJ: John Wiley. Little, R.J.A. (1988). A test of missing completely at random for multivariate data with missing values. Journal of the American Statistical Association, 83, NGPC (2008). Nebraska 20 year hunter/angler recruitment, development and retention plan. Lincoln, NE. Pigott, T. D. (2001). A Review of Methods for Missing Data. Educational Research and Evaluation, 7(4), Peng, C.Y., Harwell, M., Liou, S.M., & Ehman, L.H. (2006). Advances in missing data methods and implications for educational research. In S Sawilowsky (Ed.), Real data analysis (pp.31-78), Greenwich, CT: Information Age. Qin,Y.,Zhang,S.,Zhu,X.,Zang,J.,& Zhang,C. (2007). Semi-parametric optimization for missing data imputation. Appl Intell 27, DOI /s Schafer, J.L. (1999). Multiple imputation: A primer. Statistical Methods in Medical Research. 8: Todd D. L., Terrence D. J., Kyle M. L., & Whitney M. (2013). On the joys of missing data. Journal of Pediatric Psychology, doi: /jpepsy/jst048 Yiran D. & Chao-Ying J.P.(2013). Principled missing data methods for researchers. Springer, 2:222. References 18

Contact Information: