Path Analysis Frühling Rijsdijk SGDP Centre Institute of Psychiatry King’s College London, UK.

Slides:



Advertisements
Similar presentations
OpenMx Frühling Rijsdijk.
Advertisements

CORRELATIONAL RESEARCH o What are the Uses of Correlational Research?What are the Uses of Correlational Research? o What are the Requirements for Correlational.
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Topic 12: Multiple Linear Regression
SEM PURPOSE Model phenomena from observed or theoretical stances
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
General Structural Equation (LISREL) Models
Structural Equation Modeling
Confirmatory Factor Analysis
1 General Structural Equation (LISREL) Models Week #2 Class #2.
Outline 1) Objectives 2) Model representation 3) Assumptions 4) Data type requirement 5) Steps for solving problem 6) A hypothetical example Path Analysis.
Linear Regression and Binary Variables The independent variable does not necessarily need to be continuous. If the independent variable is binary (e.g.,
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Structural Equation Modeling
Causal Modelling and Path Analysis. Some Notes on Causal Modelling and Path Analysis. “Path analysis is... superior to ordinary regression analysis since.
Path Analysis Danielle Dick Boulder Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.
Multivariate Genetic Analysis: Introduction(II) Frühling Rijsdijk & Shaun Purcell Wednesday March 6, 2002.
Summarizing Variation Michael C Neale PhD Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
ACDE model and estimability Why can’t we estimate (co)variances due to A, C, D and E simultaneously in a standard twin design?
“Ghost Chasing”: Demystifying Latent Variables and SEM
Univariate Analysis in Mx Boulder, Group Structure Title Type: Data/ Calculation/ Constraint Reading Data Matrices Declaration Assigning Specifications/
David M. Evans Sarah E. Medland Developmental Models in Genetic Research Wellcome Trust Centre for Human Genetics Oxford United Kingdom Twin Workshop Boulder.
Basic Mathematics for Portfolio Management. Statistics Variables x, y, z Constants a, b Observations {x n, y n |n=1,…N} Mean.
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Path Analysis Frühling Rijsdijk. Biometrical Genetic Theory Aims of session:  Derivation of Predicted Var/Cov matrices Using: (1)Path Tracing Rules (2)Covariance.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Correlation. The sample covariance matrix: where.
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Lecture 5 Correlation and Regression
Structural Equation Modeling Continued: Lecture 2 Psy 524 Ainsworth.
Correlation and Regression
Path Analysis HGEN619 class Method of Path Analysis allows us to represent linear models for the relationship between variables in diagrammatic.
Factor Analysis and Inference for Structured Covariance Matrices
Karri Silventoinen University of Helsinki Osaka University.
Some matrix stuff.
Institute of Psychiatry King’s College London, UK
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Environmental Modeling Basic Testing Methods - Statistics III.
Path Analysis. Path Diagram Single headed arrowruns from cause to effect Double headed bent arrow: correlation The model above assumes that all 5 variables.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 22.
Univariate Analysis Hermine Maes TC21 March 2008.
MANAGEMENT SCIENCE AN INTRODUCTION TO
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Categorical Data Frühling Rijsdijk 1 & Caroline van Baal 2 1 IoP, London 2 Vrije Universiteit, A’dam Twin Workshop, Boulder Tuesday March 2, 2004.
Multivariate Transformation. Multivariate Transformations  Started in statistics of psychology and sociology.  Also called multivariate analyses and.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Introduction to Multivariate Genetic Analysis Danielle Posthuma & Meike Bartels.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Chapter 16 PATH ANALYSIS. Chapter 16 PATH ANALYSIS.
Multivariate Genetic Analysis (Introduction) Frühling Rijsdijk Wednesday March 8, 2006.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
Introduction to Multivariate Genetic Analysis
MRC SGDP Centre, Institute of Psychiatry, Psychology & Neuroscience
Path Analysis Danielle Dick Boulder 2008
Structural Equation Modeling
Univariate modeling Sarah Medland.
Correlation and Regression
Linear regression Fitting a straight line to observations.
Structural Equation Modeling
Sarah Medland faculty/sarah/2018/Tuesday
5.4 General Linear Least-Squares
HW# : Complete the last slide
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Causal Relationships with measurement error in the data
Multivariate Genetic Analysis: Introduction
Regression Part II.
Autoregressive and Growth Curve Models
Structural Equation Modeling
Presentation transcript:

Path Analysis Frühling Rijsdijk SGDP Centre Institute of Psychiatry King’s College London, UK

Twin ModelTwin Data Hypothesised Sources of Variation Biometrical Genetic Theory Predicted Var/Cov from Model Structural Equation Modelling (Maximum Likelihood) Path Tracing Rules Covariance Algebra Summary Statistics Matrix Algebra Model Equations Path Diagrams Observed Variation Data Preparation Observed Var/Cov from Data

Path Analysis Path analysis was developed around 1918 by Sewall Wright Combines knowledge we have with regard to causal relations with degree of observed correlations Guinea pigs: interrelationships of factors determining weight at birth and at weaning (33 days) Wright, S. (1921). "Correlation and causation". J. Agricultural Research 20: 557–585 Birth weight Early gain Litter size Gestation period Environmental conditions Health of dam Heredity factors

Path Diagram

Path Analysis Present linear relationships between variables by means of diagrams ; Derive predictions for the variances and covariances of the variables under the specified model The relationships can also be represented as structural equations and covariance matrices All three forms are mathematically complete, it is possible to translate from one to the other Structural equation modelling (SEM) represents a unified platform for path analytic and variance components models

In SEM models, expected relationships between observed variables are expressed by: –A system of linear model equations or –Path diagrams which allow the model to be represented in schematic form Both allow derivation of predicted variances and covariances of the variables under the specified model Aims of this session: Derivation of predicted Var- Cov Matrices using: (1) Path Tracing & (2) Covariance Algebra

Observed Variable Latent Variable Causal Path Covariance Path Path Diagram Conventions

Twin 1 E CA Twin 2 A CE Model for an MZ PAIR 1 1 Note: a, c and e are the same cross twins e a ca e c

Twin 1 E CA Twin 2 A CE Model for a DZ PAIR 1.5 Note: a, c and e are also the same cross groups e a ca e c

(1) Path Tracing The covariance between any two variables is the sum of all legitimate chains connecting the variables The numerical value of a chain is the product of all traced path coefficients within the chain A legitimate chain is a path along arrows that follow 3 rules:

(i)Trace backward, then forward, or simply forward from one variable to another. NEVER forward then backward. Include double-headed arrows from the independent variables to itself. These variances will be 1 for standardized variables Cov BC : a*V A *b NOT c*V D *d VAVA B C D A ee e a b cd

(ii)Loops are not allowed, i.e. we can not trace twice through the same variable Cov AB : a*V C *b NOT a* V C *c*e*d* V C A B C DE ab cd e e ee VDVD VEVE

(iii)A maximum of one curved arrow per path. So, the double-headed arrow from the independent variable to itself is included, unless the chain includes another double-headed arrow (e.g. a correlation path) C DE cd e e VDVD VEVE Cov CD : c*V D + d*e NOT d*V E *e

Since the variance of a variable is the covariance of the variable with itself, the expected variance will be the sum of all paths from the variable to itself, which follow Wright’s rules The Variance

Twin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a

Twin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a

Twin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a

Twin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a*1*a = a 2 + a

Twin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a*1*a = a 2 + c*1*c = c 2 e*1*e = e 2 + Total Variance = a 2 + c 2 + e 2 a

Twin 1 E CA Covariance Twin 1-2: MZ pairs Twin 2 A CE e a ca e c

Twin 1 E CA Covariance Twin 1-2: MZ pairs Twin 2 A CE e a ca e c

Twin 1 E CA Covariance Twin 1-2: MZ pairs Total Covariance = a 2 + Twin 2 A CE e a ca e c

Twin 1 E CA Covariance Twin 1-2: MZ pairs Total Covariance = a 2 + c 2 Twin 2 A CE e a ca e c

Twin 1 E CA Covariance Twin 1-2: DZ pairs Twin 2 A CE e a ca e c

Twin 1 E CA Covariance Twin 1-2: MZ pairs Twin 2 A CE e a ca e c

Twin 1 E CA Covariance Twin 1-2: DZ pairs Total Covariance =.5a 2 + Twin 2 A CE e a ca e c

Twin 1 E CA Covariance Twin 1-2: DZ pairs Total Covariance =.5a 2 + c 2 Twin 2 A CE e a ca e c

Tw1Tw2 Tw1 Tw2 Tw1 Tw2 Tw1 Tw2 Predicted Var-Cov Matrices

Twin 1 E DA Twin 2 A DE ADE Model 1(MZ) / 0.25 (DZ) 1/.5 e a da e d

Tw1Tw2 Tw1 Tw2 Tw1 Tw2 Tw1 Tw2 Predicted Var-Cov Matrices

ACE or ADE Cov(mz) = a 2 + c 2 or a 2 + d 2 Cov(dz) = ½ a 2 + c 2 or ½ a 2 + ¼ d 2 V P = a 2 + c 2 + e 2 or a 2 + d 2 + e 2 3 unknown parameters (a, c, e or a, d, e), and only 3 distinctive predicted statistics: Cov MZ, Cov DZ, Vp) this model is just identified

The twin correlations indicate which of the two components is more likely to be present: Cor(mz) = a 2 + c 2 or a 2 + d 2 Cor(dz) = ½ a 2 + c 2 or ½ a 2 + ¼ d 2 If a 2 =.40, c 2 =.20 r mz = 0.60 r dz = 0.40 If a 2 =.40, d 2 =.20 r mz = 0.60 r dz = 0.25 Effects of C and D are confounded ADE ACE

Three Fundamental Covariance Algebra Rules Cov (aX,bY) = ab Cov(X,Y) Cov (X,Y+Z) = Cov (X,Y) + Cov (X,Z) Var (X) = Cov(X,X) (2) Covariance Algebra

The variance of a dependent variable (Y) caused by independent variable A, is the squared regression coefficient multiplied by the variance of the independent variable Y a Y = aA Example 1 A 1

Example 2 Y a Y = aA A Z a Z = aA A 1 1.5

Summary Path Tracing and Covariance Algebra have the same aim: To work out the predicted variances and covariances of variables, given a specified model The Ultimate Goal: To fit predicted variances/covariances to observed variances/covariances of the data in order to estimate the model parameters: regression coefficients,correlations