Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Slides:



Advertisements
Similar presentations
SEM PURPOSE Model phenomena from observed or theoretical stances
Advertisements

Bivariate analysis HGEN619 class 2007.
Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
General Structural Equation (LISREL) Models
Structural Equation Modeling: An Overview P. Paxton.
Confirmatory Factor Analysis
Intro to Quantitative Genetics HGEN502, 2011 Hermine H. Maes.
Outline: 1) Basics 2) Means and Values (Ch 7): summary 3) Variance (Ch 8): summary 4) Resemblance between relatives 5) Homework (8.3)
Outline 1) Objectives 2) Model representation 3) Assumptions 4) Data type requirement 5) Steps for solving problem 6) A hypothetical example Path Analysis.
Linear Regression and Binary Variables The independent variable does not necessarily need to be continuous. If the independent variable is binary (e.g.,
Structural Equation Modeling
Causal Modelling and Path Analysis. Some Notes on Causal Modelling and Path Analysis. “Path analysis is... superior to ordinary regression analysis since.
Estimating “Heritability” using Genetic Data David Evans University of Queensland.
Multivariate Genetic Analysis: Introduction(II) Frühling Rijsdijk & Shaun Purcell Wednesday March 6, 2002.
Bivariate Model: traits A and B measured in twins cbca.
Quantitative Genetics
Summarizing Variation Michael C Neale PhD Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
ACDE model and estimability Why can’t we estimate (co)variances due to A, C, D and E simultaneously in a standard twin design?
“Ghost Chasing”: Demystifying Latent Variables and SEM
Univariate Analysis in Mx Boulder, Group Structure Title Type: Data/ Calculation/ Constraint Reading Data Matrices Declaration Assigning Specifications/
Reminder - Means, Variances and Covariances. Covariance Algebra.
Biometrical Genetics Pak Sham & Shaun Purcell Twin Workshop, March 2002.
David M. Evans Sarah E. Medland Developmental Models in Genetic Research Wellcome Trust Centre for Human Genetics Oxford United Kingdom Twin Workshop Boulder.
Path Analysis Frühling Rijsdijk SGDP Centre Institute of Psychiatry King’s College London, UK.
Mx Practical TC18, 2005 Dorret Boomsma, Nick Martin, Hermine H. Maes.
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Path Analysis Frühling Rijsdijk. Biometrical Genetic Theory Aims of session:  Derivation of Predicted Var/Cov matrices Using: (1)Path Tracing Rules (2)Covariance.
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Structural Equation Modeling Continued: Lecture 2 Psy 524 Ainsworth.
Path Analysis HGEN619 class Method of Path Analysis allows us to represent linear models for the relationship between variables in diagrammatic.
Correlation and Regression
Karri Silventoinen University of Helsinki Osaka University.
Continuously moderated effects of A,C, and E in the twin design Conor V Dolan & Sanja Franić (BioPsy VU) Boulder Twin Workshop March 4, 2014 Based on PPTs.
Institute of Psychiatry King’s College London, UK
Extending Simplex model to model Ph  E transmission JANNEKE m. de kort & C.V. DolAn Contact:
Cholesky decomposition May 27th 2015 Helsinki, Finland E. Vuoksimaa.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
CJT 765: Structural Equation Modeling Class 10: Non-recursive Models.
The importance of the “Means Model” in Mx for modeling regression and association Dorret Boomsma, Nick Martin Boulder 2008.
Gene-Environment Interaction & Correlation Danielle Dick & Danielle Posthuma Leuven 2008.
Univariate Analysis Hermine Maes TC21 March 2008.
Genetic Theory Pak Sham SGDP, IoP, London, UK. Theory Model Data Inference Experiment Formulation Interpretation.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
David M. Evans Multivariate QTL Linkage Analysis Queensland Institute of Medical Research Brisbane Australia Twin Workshop Boulder 2003.
Introduction to Genetic Theory
Introduction to Multivariate Genetic Analysis Danielle Posthuma & Meike Bartels.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Chapter 16 PATH ANALYSIS. Chapter 16 PATH ANALYSIS.
Multivariate Genetic Analysis (Introduction) Frühling Rijsdijk Wednesday March 8, 2006.
Extended Pedigrees HGEN619 class 2007.
Invest. Ophthalmol. Vis. Sci ;57(1): doi: /iovs Figure Legend:
Bivariate analysis HGEN619 class 2006.
Introduction to Multivariate Genetic Analysis
Re-introduction to openMx
MRC SGDP Centre, Institute of Psychiatry, Psychology & Neuroscience
Path Analysis Danielle Dick Boulder 2008
Structural Equation Modeling
Univariate modeling Sarah Medland.
ACE Problems Greg Carey BGA, 2009 Minneapolis, MN.
Pak Sham & Shaun Purcell Twin Workshop, March 2002
Sarah Medland faculty/sarah/2018/Tuesday
\nathan\RaterBias.
Linear Panel Data Models
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Multivariate Genetic Analysis: Introduction
Structural Equation Modeling
Rater Bias & Sibling Interaction Meike Bartels Boulder 2004
Presentation transcript:

Path Analysis Danielle Dick Boulder 2006

Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form Makes it easy to derive expectation for the variances and covariances of variables in terms of the parameters proposed by the model Is easily translated into matrix form for use in programs such as Mx

Example

Conventions of Path Analysis Squares or rectangles denote observed variables. Circles or ellipses denote latent (unmeasured) variables. (Triangle denote means, used when modeling raw data) Upper-case letters are used to denote variables. Lower-case letters (or numeric values) are used to denote covariances or path coefficients.

Conventions of Path Analysis Single-headed arrows or paths (–>) are used to represent causal relationships between variables under a particular model - where the variable at the tail is hypothesized to have a direct influence on the variable at the head. A –> B Double-headed arrows ( ) are used to represent a covariance between two variables, which may arise through common causes not represented in the model. They may also be used to represent the variance of a variable. A B

Conventions of Path Analysis Double-headed arrows may not be used for any variable which has one or more single-headed arrows pointing to it - these variables are called endogenous variables. Other variables are exogenous variables. Single-headed arrows may be drawn from exogenous to endogenous variables or from endogenous variables to other endogenous variables.

Conventions of Path Analysis Omission of a two-headed arrow between two exogenous variables implies the assumption that the covariance of those variables is zero (e.g., no genotype-environment correlation). Omission of a direct path from an exogenous (or endogenous) variable to an endogenous variable implies that there is no direct causal effect of the former on the latter variable.

Tracing Rules of Path Analysis Trace backwards, change direction at a double-headed arrow, then trace forwards. This implies that we can never trace through double- headed arrows in the same chain. The expected covariance between two variables, or the expected variance of a variable, is computed by multiplying together all the coefficients in a chain, and then summing over all possible chains.

Example EXOGENOUS VARIABLES ENDOGENOUS VARIABLES

Exercises Cov AB = Cov BC = Cov AC = Var A = Var B = Var C = Var E

Covariance between A and B Cov AB = kl + mqn + mpl

Exercises Cov AB = Cov BC = Cov AC = Var A = Var B = Var C = Var E

Expectations Cov AB = kl + mqn + mpl Cov BC = no Cov AC = mqo Var A = k 2 + m kpm Var B = l 2 + n 2 Var C = o 2 Var E = 1

Quantitative Genetic Theory Observed behavioral differences stem from two primary sources: genetic and environmental

Quantitative Genetic Theory Observed behavioral differences stem from two primary sources: genetic and environmental eg P GE PHENOTYPE 1 1 EXOGENOUS VARIABLES ENDOGENOUS VARIABLES

Quantitative Genetic Theory There are two sources of genetic influences: Additive and Dominant ed P DE PHENOTYPE 1 1 a A 1

Quantitative Genetic Theory There are two sources of environmental influences: Common (shared) and Unique (nonshared) cd P DC PHENOTYPE 1 1 a A 1 e E 1

In the preceding diagram… A, D, C, E are exogenous variables A = Additive genetic influences D = Non-additive genetic influences (i.e., dominance) C = Shared environmental influences E = Nonshared environmental influences A, D, C, E have variances of 1 Phenotype is an endogenous variable P = phenotype; the measured variable a, d, c, e are parameter estimates

Univariate Twin Path Model A1A1 D1D1 C1C1 E1E1 P 1111 adce

A1A1 D1D1 C1C1 E1E1 P Twin1 E2E2 C2C2 D2D2 A2A2 P Twin adce ecda

Univariate Twin Path Model A1A1 D1D1 C1C1 E1E1 P Twin1 E2E2 C2C2 D2D2 A2A2 P Twin2 MZ=1.0 DZ=0.5 MZ=1.0 DZ=0.25 MZ & DZ = adce ecda

Assumptions of this Model All effects are linear and additive (i.e., no genotype x environment or other multiplicative interactions) A, D, C, and E are mutually uncorrelated (i.e., there is no genotype-environment covariance/correlation) Path coefficients for Twin 1 = Twin 2 There are no reciprocal sibling effects (i.e., there are no direct paths between P 1 and P 2

Tracing Rules of Path Analysis Trace backwards, change direction at a double-headed arrow, then trace forwards. This implies that we can never trace through double- headed arrows in the same chain. The expected covariance between two variables, or the expected variance of a variable, is computed by multiplying together all the coefficients in a chain, and then summing over all possible chains.

Calculating the Variance of P 1 A1A1 D1D1 C1C1 E1E1 P Twin1 E2E2 C2C2 D2D2 A2A2 P Twin2 MZ=1.0 DZ=0.5 MZ=1.0 DZ=0.25 MZ & DZ = adce ecda

Calculating the Variance of P 1 A1A1 D1D1 C1C1 E1E1 P Twin1 E2E2 C2C2 D2D2 A2A2 P Twin2 MZ=1.0 DZ=0.5 MZ=1.0 DZ=0.25 MZ & DZ = adce ecda

Calculating the Variance of P 1 A1A1 D1D1 C1C1 E1E1 P Twin1 E2E2 C2C2 D2D2 A2A2 P Twin2 MZ=1.0 DZ=0.5 MZ=1.0 DZ=0.25 MZ & DZ = adce ecda P=a(1)a

Calculating the Variance of P 1 A1A1 D1D1 C1C1 E1E1 P Twin1 E2E2 C2C2 D2D2 A2A2 P Twin2 MZ=1.0 DZ=0.5 MZ=1.0 DZ=0.25 MZ & DZ = adce ecda P=a(1)a + d(1)d + c(1)c + e(1)e

Calculating the Variance of P 1 P1P1 P1P1 A1A1 A1A1 aa1 P1P1 P1P1 D1D1 D1D1 dd1 P1P1 P1P1 C1C1 C1C1 cc1 P1P1 P1P1 E1E1 E1E1 ee1 = = = = 1a 2 1d 2 1c 2 1e 2 Var P 1 = a 2 + d 2 + c 2 + e 2

Calculating the MZ Covariance A1A1 D1D1 C1C1 E1E1 P Twin1 E2E2 C2C2 D2D2 A2A2 P Twin2 MZ=1.0 DZ=0.5 MZ=1.0 DZ=0.25 MZ & DZ = adce ecda

Calculating the MZ Covariance A1A1 D1D1 C1C1 E1E1 P Twin1 E2E2 C2C2 D2D2 A2A2 P Twin2 MZ=1.0 DZ=0.5 MZ=1.0 DZ=0.25 MZ & DZ = adce ecda

Calculating the MZ Covariance P1P1 P2P2 A1A1 A2A2 aa1 P1P1 P2P2 D1D1 D2D2 dd1 P1P1 P2P2 C1C1 C2C2 cc1 P1P1 P2P2 E1E1 E2E2 ee = = = = 1a 2 1d 2 1c 2 Cov MZ = a 2 + d 2 + c 2

Calculating the DZ Covariance A1A1 D1D1 C1C1 E1E1 P Twin1 E2E2 C2C2 D2D2 A2A2 P Twin2 MZ=1.0 DZ=0.5 MZ=1.0 DZ=0.25 MZ & DZ = adce ecda

Calculating the DZ covariance Cov DZ = ?

Calculating the DZ covariance P1P1 P2P2 A1A1 A2A2 aa0.5 P1P1 P2P2 D1D1 D2D2 dd0.25 P1P1 P2P2 C1C1 C2C2 cc1 P1P1 P2P2 E1E1 E2E2 ee = = = = 0.5a d 2 1c 2 Cov DZ = 0.5a d 2 + c 2

Var Twin1 Cov 12 Cov 21 Var Twin2 a 2 + d 2 + c 2 + e 2 a 2 +d 2 + c 2 a 2 + d 2 + c 2 a 2 + d 2 + c 2 + e 2 Twin Variance/Covariance a 2 + d 2 + c 2 + e 2 0.5a d 2 + c 2 0.5a d 2 + c 2 a 2 + d 2 + c 2 + e 2 MZ = DZ =