Structural Equation Modeling

Slides:



Advertisements
Similar presentations
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Advertisements

Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
Structural Equation Modeling
1 1 Chapter 5: Multiple Regression 5.1 Fitting a Multiple Regression Model 5.2 Fitting a Multiple Regression Model with Interactions 5.3 Generating and.
Structural Equation Modeling: An Overview P. Paxton.
Forecasting Using the Simple Linear Regression Model and Correlation
Soc 3306a: Path Analysis Using Multiple Regression and Path Analysis to Model Causality.
Outline 1) Objectives 2) Model representation 3) Assumptions 4) Data type requirement 5) Steps for solving problem 6) A hypothetical example Path Analysis.
A Short Introduction to Curve Fitting and Regression by Brad Morantz
Bivariate Regression Analysis
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
Linear Regression with One Regression
Multivariate Data Analysis Chapter 11 - Structural Equation Modeling.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
“Ghost Chasing”: Demystifying Latent Variables and SEM
Structural Equation Modeling
Economics 310 Lecture 18 Simultaneous Equations There is a two-way, or simultaneous, relationship between Y and (some of) the X’s, which makes the distinction.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
G Lect 31 G Lecture 3 SEM Model notation Review of mediation Estimating SEM models Moderation.
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Simple Linear Regression Analysis
Introduction to Linear Regression and Correlation Analysis
Elements of Multiple Regression Analysis: Two Independent Variables Yong Sept
Chapter 11 Simple Regression
Soc 3306a Lecture 8: Multivariate 1 Using Multiple Regression and Path Analysis to Model Causality.
Diploma in Statistics Introduction to Regression Lecture 2.21 Introduction to Regression Lecture Review of Lecture 2.1 –Homework –Multiple regression.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Basics of Regression Analysis. Determination of three performance measures Estimation of the effect of each factor Explanation of the variability Forecasting.
SEM: Basics Byrne Chapter 1 Tabachnick SEM
MTH 161: Introduction To Statistics
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Ordinary Least Squares Estimation: A Primer Projectseminar Migration and the Labour Market, Meeting May 24, 2012 The linear regression model 1. A brief.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
CJT 765: Structural Equation Modeling Class 12: Wrap Up: Latent Growth Models, Pitfalls, Critique and Future Directions for SEM.
G Lecture 81 Comparing Measurement Models across Groups Reducing Bias with Hybrid Models Setting the Scale of Latent Variables Thinking about Hybrid.
G Lecture 3 Review of mediation Moderation SEM Model notation
Structural Equation Modeling Mgmt 291 Lecture 3 – CFA and Hybrid Models Oct. 12, 2009.
Regression Analysis1. 2 INTRODUCTION TO EMPIRICAL MODELS LEAST SQUARES ESTIMATION OF THE PARAMETERS PROPERTIES OF THE LEAST SQUARES ESTIMATORS AND ESTIMATION.
CJT 765: Structural Equation Modeling Final Lecture: Multiple-Group Models, a Word about Latent Growth Models, Pitfalls, Critique and Future Directions.
LESSON 4.1. MULTIPLE LINEAR REGRESSION 1 Design and Data Analysis in Psychology II Salvador Chacón Moscoso Susana Sanduvete Chaves.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
Regression Overview. Definition The simple linear regression model is given by the linear equation where is the y-intercept for the population data, is.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Linear Regression Modelling
Kakhramon Yusupov June 15th, :30pm – 3:00pm Session 3
3.1 Examples of Demand Functions
Moderation, Mediation, and Other Issues in Regression
CJT 765: Structural Equation Modeling
Simultaneous equation system
Chapter 12: Regression Diagnostics
Fundamentals of regression analysis
Chapter 14: Correlation and Regression
BIVARIATE REGRESSION AND CORRELATION
Multiple Regression A curvilinear relationship between one variable and the values of two or more other independent variables. Y = intercept + (slope1.
Structural Equation Modeling
Simple Linear Regression
Linear Regression Summer School IFPRI
3.2. SIMPLE LINEAR REGRESSION
Path Analysis Application of multiple linear regression.
Structural Equation Modeling
Presentation transcript:

Structural Equation Modeling Mgmt 291 Lecture 2 – Path Analysis Oct. 5, 2009

Regression Modeling Yi = β 0 + β 1 X1i + β 2 X2i + β 3 X3i + εi X1 Y ε Formulation of the research question in terms of independent Xs and dependent variable Y To explain (or predict) Y by Xs (one equation) Explain the variance of Y Assumptions: measured without errors, linear, … X1 X1 Y ε X2 X3

In the real world No clear differentiation between independent variables and dependent variables How the Xs relate to each other may be important to our research Not all Xs affect Y directly Variables may be measured with error

Need to Move from Regression to SEM X1 X1 Y ε X2 X2 X3 Y X1 X3

(1) Selecting SEM Models to Fit Your Data # of models = 4 n!/2!(n-2)! # Vars # Models 2 4 3 64 4096 5 1048576 6 1073741824

(2) Estimating the Coefficients of SEM Model y = a1 + b1 * x + v z = a2 + b2 * y + u u v y z x

FOUR Elements of a Method 1) Equation (model representation and model types) 2) Estimation (identification issue and estimation options) 3) Errors (between data and assumptions) and Evaluation of SEM Models (Fit Indexes) 4) Explanation of the Results In these four areas, methods diff from each other.

4 Elements of Regression 1) Equation - Yi = β 0 + β 1 X1i + β 2 X2i + β 3 X3i + εi 2) Estimation – OLS 3) Errors (Assumptions and Diagnostics) and Evaluation (R2) 4) Using Bi, t statistics for results Explanation

Part 1: Equations and More -- The Representation of Path Models 3 ways: diagram, equation, table (matrix)

Diagram Rep of path Models Y2 X1 X1 Y X1 X2 Y1 Y X2 X2 X3 X4 D1 A picture is worth a thousand words X3

Equation Rep of Path Models SEM Y2i = β 0 + β 1 Y1i + β 2 X1i + εi Y1i = Ø 0 + Ø1 X1i + Ø 2 X2i + Ø 3 X3i + ε1i X3i = C 0 + C 2 X2i + C 3 X4i + ε2i Regression Yi = β 0 + β 1 X1i + β 2 X2i + β 3 X3i + εi

Table Rep of Path Models Called as “system matrix” X1 X2 X3 X4 Y1 1 Y2

Example - Diagram Kline 2005, p67 Fitness Exercise Illness Hardy Stress

Example - equations Fitness = β 0 + β 1 Exercise + β 2 Hardiness Stress = Ø 0 + Ø 1 Exercise + Ø 2 Hardiness + Ø 3 Fitness Illness = C 0 + C 1 Exercise + C 2 Hardiness + C 3 Fitness + C 4 Stress

Example - table Exercise Hardiness Fitness Stress 1 Illness

Building Blocks of Path Model 1) Variables – exogenous, endogenous 2) Relationships – direct, indirect, reciprocal, 3) Residual – disturbance and (measurement errors) Similar to regression

Model Complexity 1 4 X 3 X 2 X 1 = 24 X1 X2 X3 X4 X1 x1 X2 X4 x4 x2 X3

Model Complexity 2 Number of observations = v(v+1) / 2 .23 .17 .45 .56 .09 .33 Model Complexity 2 Number of observations = v(v+1) / 2 Number of model parameters <= number of observations # parameters = p + e + d (p - # paths, e - #obs for exogenous, d - #obs for disturbance) X1 X2 X4 X3

Two Types of Path Models Recursive 1) disturbances uncorrelated 2) no feedback loops Non-recursive 1) have feedback loops 2)or have correlated disturbances

Examples of Recursive Models Y1 D2 X1 X2 Y2 Y1 D1 D2 X2 Y2

Examples of Non-Recursive Models Y1 D2 X2 Y2

Part 2: Estimation Methods for Path Models Identification Issue and Estimation Options

The Identification Issue whether or not theoretically identifiable (whether or not the model parameters can be represented by observations – covariances) Or whether a set of equations can be solved. Diff Σ (yi – b0 – b1* xi)2 against b0 and b1, then solve two equations to get b0 and b1 for simple regression similar for SEM (see Duncan’s book)

Examples Under-identified x+y = 6 Just-identified x+y=6 2x+y=10 Over-identified 3x+y=12

Conditions for Identifiable Pattern of disturbance correlation Feedback loops? Conditions for identification Necessary or sufficient All possible pairwise Yes or no Parameters <= observations Order condition Rank condition Nec suf All possible pairwise within recursively related blocks Rank condition in each block Any other pattern Model-fitting program yields a solution that passes checks or uniqueness

The Order Condition Number of excluded variables >= (number of endogenous variables – 1) For each endogenous variable (Kline p306-307) D1 X1 Y1 D2 X2 Y2

The Rank Condition Use the system matrix rank >= #endogenous vars - 1 x1 y1 D1 x2 y2 D2 x1 x2 x3 y1 y2 y3 1 x3 y3 D3

Estimation Options OLS ML OLS and ML produces the identical results for just-identified recursive models, and similar results for over-identified models

About ML Maximum Likelihood Estimation Simultaneous Iterative (starting values) Most common used (default method in LISREL and many other programs)

Using OLS for Path Analysis D1 Ok for recursive models Not for non-recursive (violate the assumption that residuals uncorrelated with ind vars) X1 Y1 D2 X2 Y2

Part 3: Errors and Evaluation

The Fit Indexes Chi squares – most common used d.f. = # obs - # parameters RMR (mean squared residual) RMSEA (root mean squared error of approximation) GFI & AGFT (0 ~ 1) (Similar to R2 and adjusted R2)

More about Fit indexes Dozens available LISREL prints 15 ChiSquares/df < 3 GFI > .90 Inspect correlation residuals

The Main Assumptions Similar to multiple regression Except for 1) Correlation between endogenous vars and disturbances 2) Multivariate normality

More on assumptions Linearity & residuals normally dist. … -> affect sig. Tests Normality does not affect ML estimates much

Diagnostics Multi-collinearity Normality Linearity Homoscedasticity Missing Values Outliners

Part 4: Explanation of the Findings Path coefficients

Interpret the Findings Path coefficients (standardized, un-standardized) (direct, indirect, total effects) Significance of path coefficients <10 Small .30 medium >.50 large

Findings - Diagram .84 Fitness .39** -.26** Exercise .08 .82 .03 -.03 -.11* -.01 Illness -.07 Hardy -.22** .29** Stress .93

Findings - equations Fitness = β 0 + .11 Exercise + .39 Hardiness Stress = Ø 0 - .01 Exercise -.39 Hardiness -.04 Fitness Illness = C 0 +.32 Exercise –12.14 Hardiness –8.84 Fitness + 27.12 Stress

Findings - table Exercise Hardiness Fitness Stress 1.11**(.01) .39 (.23) -.01(.01) -.39(.09) -.04(.02) Illness .32(.48) -12.14(7.98) -8.84(1.75) 27.12(4.55)

Calculation of Indirect and Total Effects Fitness .39** -.26** .08 Exercise .03 Fitness Stress Illness Exercise Direct Effect .39** -.01 .03 Indirect via Fitness -.04* -.10** Indirect via Stress .00 Indirect via Fitness & Stress Total Effect .39 -.05 -.08 Illness -.11* -.01 .29** Stress

About the assignment # 1 The study of structural equation models can be divided into two parts: the easy part and the hard part – “talking about it” and “doing it”. (Duncan 1975, p149) The easy part is mathematical. The hard part is constructing causal models that are consistent with sound theory. (Wolfle 1985, p385)

Assignment # 1 Introduction Argument your theory, causal models represented in words Variables/Dataset (including literature, NO more than 2 pages)