CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.

Slides:



Advertisements
Similar presentations
SEM PURPOSE Model phenomena from observed or theoretical stances
Advertisements

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
Structural Equation Modeling
Structural Equation Modeling: An Overview P. Paxton.
Ch11 Curve Fitting Dr. Deshi Ye
SOC 681 James G. Anderson, PhD
Structural Equation Modeling
Multivariate Data Analysis Chapter 11 - Structural Equation Modeling.
“Ghost Chasing”: Demystifying Latent Variables and SEM
Structural Equation Modeling
Chapter 11 Multiple Regression.
Chapter 7 Correlational Research Gay, Mills, and Airasian
G Lect 31 G Lecture 3 SEM Model notation Review of mediation Estimating SEM models Moderation.
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Review Guess the correlation. A.-2.0 B.-0.9 C.-0.1 D.0.1 E.0.9.
Stages in Structural Equation Modeling
Structural Equation Modeling Continued: Lecture 2 Psy 524 Ainsworth.
G Lecture 61 G SEM Lecture 6 An Example Measures of Fit Complex nonrecursive models How can we tell if a model is identified? Direct and.
AM Recitation 2/10/11.
Structural Equation Modeling 3 Psy 524 Andrew Ainsworth.
Confirmatory Factor Analysis Psych 818 DeShon. Purpose ● Takes factor analysis a few steps further. ● Impose theoretically interesting constraints on.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
SEM: Basics Byrne Chapter 1 Tabachnick SEM
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Testing Path Models, etc. What path models make you think about… Model Identification Model Testing –Theory Trimming –Testing Overidentified Models.
CJT 765: Structural Equation Modeling Class 10: Non-recursive Models.
CJT 765: Structural Equation Modeling Highlights for Quiz 2.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
CJT 765: Structural Equation Modeling Class 12: Wrap Up: Latent Growth Models, Pitfalls, Critique and Future Directions for SEM.
Chapter 13 Multiple Regression
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
Academic Research Academic Research Dr Kishor Bhanushali M
CFA: Basics Beaujean Chapter 3. Other readings Kline 9 – a good reference, but lumps this entire section into one chapter.
G Lecture 3 Review of mediation Moderation SEM Model notation
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
SEM: Basics Byrne Chapter 1 Tabachnick SEM
SEM Basics 2 Byrne Chapter 2 Kline pg 7-15, 50-51, ,
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
SEM Model Fit: Introduction David A. Kenny January 12, 2014.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Estimation Kline Chapter 7 (skip , appendices)
ALISON BOWLING CONFIRMATORY FACTOR ANALYSIS. REVIEW OF EFA Exploratory Factor Analysis (EFA) Explores the data All measured variables are related to every.
Tutorial I: Missing Value Analysis
Evaluation of structural equation models Hans Baumgartner Penn State University.
CJT 765: Structural Equation Modeling Final Lecture: Multiple-Group Models, a Word about Latent Growth Models, Pitfalls, Critique and Future Directions.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
The SweSAT Vocabulary (word): understanding of words and concepts. Data Sufficiency (ds): numerical reasoning ability. Reading Comprehension (read): Swedish.
Methods of Presenting and Interpreting Information Class 9.
Advanced Statistical Methods: Continuous Variables
Structural Equation Modeling using MPlus
CJT 765: Structural Equation Modeling
Correlation, Regression & Nested Models
CJT 765: Structural Equation Modeling
Structural Equation Modeling
Instrumental Variables Estimation and Two Stage Least Squares
Linear Regression Summer School IFPRI
Causal Relationships with measurement error in the data
Testing Causal Hypotheses
MGS 3100 Business Analysis Regression Feb 18, 2016
Structural Equation Modeling
Presentation transcript:

CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power

Outline of Class Finishing up Identification Issues  Rules for Assessing Identification  Problems, Prevention, and Tests  Fixing Identification Problems Steps in Testing a Model Testing Model Fit Comparing Models Sample Size Considerations Statistical Power Issues

Necessary but not Sufficient Conditions for Identification: Counting Rule Counting rule: Number of estimated parameters cannot be greater than the number of sample variances and covariances. Where the number of observed variables = p, this is given by [p x (p+1)] / 2

Necessary but not Sufficient Conditions for Identification: Order Condition If m = # of endogenous variables in the model and k = # of exogenous variables in the model, and k e = # exogenous variables in the model excluded from the structural equation model being tested and m i = number of endogenous variables in the model included in the equation being tested (including the one being explained on the left-hand side), the following requirement must be satisfied: k e > m i -1

Necessary but not Sufficient Conditions for Identification: Rank Condition For nonrecursive models, each variable in a feedback loop must have a unique pattern of direct effects on it from variables outside the loop. For recursive models, an analogous condition must apply which requires a very complex algorithm or matrix algebra.

Guiding Principles for Identification A fully recursive model (one in which all the variables are interconnected) is just identified. A model must have some scale for unmeasured variables

Where are Identification Problems More Likely? Models with large numbers of coefficients relative to the number of input covariances Reciprocal effects and causal loops When variance of conceptual level variable and all factor loadings linking that concept to indicators are free Models containing many similar concepts or many error covariances

How to Avoid Underidentification Use only recursive models Add extra constraints by adding indicators Fixed whatever structural coefficients are expected to be 0, based on theory, especially reciprocal effects, where possible Fix measurement error variances based on known data collection procedures Given a clear time order, reciprocal effects shouldn’t be estimated If the literature suggests the size of certain effects, one can fix the coefficient of that effect to that constant

How to Test for Underidentification If ML solution repeatedly converges to same set of final estimates given different start values, suggests identification If concerned about the identification of a particular equation/coefficient, run the model once with the coefficient free, once at a value thought to be “minimally yet substantially different” than the estimated value. If the fit of the model is worse, it suggests identification.

What to do if a Model is Underidentified Simplify the model Add indicators Eliminate reciprocal effects Eliminate correlations among residuals

Steps in SEM Specify the model Determine identification of the model Select measures and collect, prepare and screen the data Use a computer program to estimate the model Re-specify the model if necessary Describe the analysis accurately and completely Replicate the results* Apply the results*

Model Specification Use theory to determine variables and relationships to test Fix, free, and constrain parameters as appropriate

Estimation Methods Maximum Likelihood—estimates maximize the likelihood that the data (observed covariances) were drawn from this population. Most forms are simultaneous. The fitting function is related to discrepancies between observed covariances and those predicted by the model. Typically iterative, deriving an initial solution then improves is through various calculations. Generalized and Unweighted Least Squares-- based on least squares criterion (rather than discrepancy function) but estimate all parameters simultaneously. 2-Stage and 3-Stage Least Squares—can be used to estimate non- recursive models, but estimate only one equation at a time. Applies multiple regression in two stages, replacing problematic variables (those correlated to disturbances) with a newly created predictor (instrumental variable that has direct effect on problematic variable but not on the endogenous variable).

Measures of Model Fit  2 = N-1 * minimization criterion. Just-identified model has = 0, no df. As chi-square increases, fit becomes worse. Badness of fit index. Tests difference in fit between given overidentified model and just-identified version of it. RMSEA—parsimony adjusted index to correct for model complexity. Approximates non-central chi-square distribution, which does not require a true null hypothesis, i.e., not a perfect model. Noncentrality parameter assesses the degree of falseness of the null hypothesis. Badness of fit index, with 0 best and higher values worse. Amount of error of approximation per model df. RMSEA.10 poor fit CFI—Assess fit of model compared to baseline model, typically independence or null model, which assumes zero population covariances among the observed variables AIC—used to select among nonhierarhical models

Comparison of Models Hierarchical Models:  Difference of  2 test Non-hierarchical Models:  Compare model fit indices

Model Respecification Model trimming and building Empirical vs. theoretical respecification Consider equivalent models

Sample Size Guidelines Small (under 100), Medium ( ), Large (200+) [try for medium, large better] Models with 1-2 df may require samples of thousands for model-level power of.8. When df=10 may only need n of for model level power of.8. When df > 20 may only need n of 200 for power of.8 20:1 is ideal ratio for # cases/# free parameters, 10:1 is ok, less than 5:1 is almost certainly problematic For regression, N > m for overall R 2, with m = # IVs and N > m for individual predictors

Statistical Power Use power analysis tables from Cohen to assess power of specific detecting path coefficient. Saris & Satorra: use  2 difference test using predicted covariance matrix compared to one with that path = 0 McCallum et al. (1996) based on RMSEA and chi-square distrubtion for close fit, not close fit and exact fit Small number of computer programs that calculate power for SEM at this point