Presentation is loading. Please wait.

Presentation is loading. Please wait.

Multiple Imputation Multiple Regression. Input From SPSS *** Mult-Imput_M-Reg.sas ***; PROC IMPORT OUT= WORK.IntroQuest DATAFILE= "C:\Users\Vati\Documents\StatData\IntroQ\IntroQ.sav"

Similar presentations


Presentation on theme: "Multiple Imputation Multiple Regression. Input From SPSS *** Mult-Imput_M-Reg.sas ***; PROC IMPORT OUT= WORK.IntroQuest DATAFILE= "C:\Users\Vati\Documents\StatData\IntroQ\IntroQ.sav""— Presentation transcript:

1 Multiple Imputation Multiple Regression

2 Input From SPSS *** Mult-Imput_M-Reg.sas ***; PROC IMPORT OUT= WORK.IntroQuest DATAFILE= "C:\Users\Vati\Documents\StatData\IntroQ\IntroQ.sav" DBMS=SPSS REPLACE; Run; Use the Import Wizard to bring the data into SAS.

3 Create Missingness Variable Data Priapus; set IntroQuest; SATM_Miss = 0; If SATM =. then SATM_Miss = 1;

4 Check For Missing Data proc means n nmiss; run;

5 Check Correlates of Missingness on SATM proc corr nosimple; var SATM_Miss; with statoph gender ideal nucoph year; run;

6 Predictorr Ideal-.017 Statoph.084 * Nucoph.007 Year.082* Gender-.057 * p <.05

7 Oh Crap ! We have a lot of missing data on SATM Missingness on SATM is associated with statophobia and year. It is not missing completely at random. Need to employ multiple imputation.

8 Create Five Imputations Proc MI seed=69301 out=MIdata; var gender ideal nucoph SATM year; run;

9 Patterns of Missingness Most frequent pattern of missing data is missing on SATM only. GroupStatophGenderIdealNucophSATMYearFreq 1 XXXXXX540 2 XXXX.X139 3 XXX.XX1 4 XX.XXX2 5 XX.X.X3 6.XXXXX3 7.XXX.X5 8.XX.XX1

10 Means By Pattern of Missingness. GroupGroup Means StatophGenderIdealNucophSATMYear 16.17121.2666670.279258.0574506.6681997.66 26.67261.2086370.237459.1870.1999.15 36.00001.0000073.0000.650.0002012.00 45.00001.50000.57.5000440.0001993.00 55.33331.33333.50.0000.1999.33 6.1.0000070.666658.3333550.0002010.00 7.1.0000067.200043.4000.2010.00 8.1.0000075.0000.730.0001991.00

11 Estimated Means & Covariances EM (Posterior Mode) Estimates _TYPE__NAME_StatophGenderIdealNucophSATMYear MEAN 6.2598241.25216170.2559758.15510507.40231998.110 COVStatoph5.252201-0.141890.6859980.893161-72.9087-3.50985 COVGender-0.141890.186693-0.91273-0.826162.167681-0.04196 COVIdeal0.685998-0.9127314.816088.297513-23.9405-1.32310 COVNucoph0.893161-0.826168.297513497.023777.921512.866324 COVSATM-72.90872.167681-23.940577.921519130.096284.1264 COVYear-3.50985-0.04196-1.323102.866324284.126479.07055

12 Analyze the Imputed Data Proc Reg outest = MRbyImput covout; Model Statoph = gender ideal nucoph SATM year / stb; By _Imputation_; run; Proc MIAnalyze; modeleffects intercept gender ideal nucoph SATM year; run; See the complete output herehere In every imputation, Gender, SATM, and Year have significant effects.

13 Proc MIAnalyze Output Pools the results from the five imputations. The variance in the scores is partitioned between that among imputations and that within imputations. Ideally, little of the variance is due to differences among imputations.

14 Variance Among/Within Imputations ParameterVariance BetweenWithinTotal intercept24.530433377.042475406.478994 gender0.0015390.0507010.052548 ideal0.000051070.0006390.000701 nucoph0.000000220.0000133990.000013669 SATM0.000000240.0000008020.000001092 year0.000007300.0000940110.000103

15 “Relative Increase in Variance” is the increase in variance due to having missing data imputed (relative to the condition where no data are missing). Low is good. “Fraction of Missing Information,” is an index of how much more precise the parameter estimate would have been if there had been no missing data. Low is good.

16 “Relative efficiency” tells you how much power you have for the number of imputations you have employed relative to what you would have if you used an uncountably large number of imputations. High is good.

17 RIV, FMI, & RE ParameterRelative Increase in Variance Fraction Missing Information Relative Efficiency intercept0.0780720.0748410.985253 gender0.0364330.0357480.992901 ideal0.0958740.0909580.982133 nucoph0.0200940.0198880.996038 SATM0.3621700.2905190.945087 year0.0931980.0885590.982596

18 ParameterEstimate 95% Confidence Limits DFMinMaxtPr > |t| intercept46.8727.294686.451762.7241.56454.5182.320.0203 gender-0.740-1.189-0.2903237-0.778-0.685-3.230.0013 ideal-0.015-0.0660.0371522.62-0.024-0.009-0.560.5747 nucoph0.0022-0.0050.0094103090.00150.00280.600.5453 SATM-0.007-0.009-0.00556.584-0.008-0.007-7.22<.0001 year-0.017-0.0370.0024550.36-0.021-0.014-1.730.0851

19 Conclusions Women report greater fear of the stats course than do men. Reported Math Aptitude is inversely correlated with fear of stats.


Download ppt "Multiple Imputation Multiple Regression. Input From SPSS *** Mult-Imput_M-Reg.sas ***; PROC IMPORT OUT= WORK.IntroQuest DATAFILE= "C:\Users\Vati\Documents\StatData\IntroQ\IntroQ.sav""

Similar presentations


Ads by Google