Longitudinal Modeling Nathan Gillespie & Dorret Boomsma \\nathan\2008\Longitudinal neuro_f_chol.mx neuro_f_simplex.mx jepq6.dat.

Slides:



Advertisements
Similar presentations
A Conceptual Introduction to Multilevel Models as Structural Equations
Advertisements

SEM PURPOSE Model phenomena from observed or theoretical stances
Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
MCUAAAR: Methods & Measurement Core Workshop: Structural Equation Models for Longitudinal Analysis of Health Disparities Data April 11th, :00 to.
Kin 304 Regression Linear Regression Least Sum of Squares
Multivariate Mx Exercise D Posthuma Files: \\danielle\Multivariate.
Inference for Regression
Growth Curve Model Using SEM
Multiple Regression [ Cross-Sectional Data ]
Associations Among Adolescent Conduct Problems and Perceived Peer and Parental Acceptance of Adolescent Alcohol Use Julia D. Grant, Kathleen K. Bucholz,
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Chapter 13 Multiple Regression
Correlation and Regression. Spearman's rank correlation An alternative to correlation that does not make so many assumptions Still measures the strength.
Chapter 12 Multiple Regression
Developmental models. Multivariate analysis choleski models factor models –y =  f + u genetic factor models –P j = h G j + c C j + e E j –common pathway.
David M. Evans Sarah E. Medland Developmental Models in Genetic Research Wellcome Trust Centre for Human Genetics Oxford United Kingdom Twin Workshop Boulder.
Heterogeneity II Danielle Dick, Tim York, Marleen de Moor, Dorret Boomsma Boulder Twin Workshop March 2010.
Gene x Environment Interactions Brad Verhulst (With lots of help from slides written by Hermine and Liz) September 30, 2014.
Data Analysis Statistics. Inferential statistics.
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Change Score Analysis David A. Kenny December 15, 2013.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Introduction to Multilevel Modeling Using SPSS
Multiple Sample Models James G. Anderson, Ph.D. Purdue University.
Inference for regression - Simple linear regression
Simple Linear Regression Models
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
L 1 Chapter 12 Correlational Designs EDUC 640 Dr. William M. Bauer.
Cholesky decomposition May 27th 2015 Helsinki, Finland E. Vuoksimaa.
Regression. Population Covariance and Correlation.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
The importance of the “Means Model” in Mx for modeling regression and association Dorret Boomsma, Nick Martin Boulder 2008.
Gene-Environment Interaction & Correlation Danielle Dick & Danielle Posthuma Leuven 2008.
HLM Models. General Analysis Strategy Baseline Model - No Predictors Model 1- Level 1 Predictors Model 2 – Level 2 Predictors of Group Mean Model 3 –
G Lecture 7 Confirmatory Factor Analysis
Lecture 10 Chapter 23. Inference for regression. Objectives (PSLS Chapter 23) Inference for regression (NHST Regression Inference Award)[B level award]
Genetic and Environmental Influences on Stages of Alcohol Use Across Adolescence and Into Young Adulthood Jason L. Pagan 1, Richard J. Rose 2, Richard.
Longitudinal Modeling Nathan & Lindon Template_Developmental_Twin_Continuous_Matrix.R Template_Developmental_Twin_Ordinal_Matrix.R jepq.txt GenEpiHelperFunctions.R.
Chapter 12 Inference for Linear Regression. Reminder of Linear Regression First thing you should do is examine your data… First thing you should do is.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
Developmental Models: Latent Growth Models Brad Verhulst & Lindon Eaves.
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
David M. Evans Multivariate QTL Linkage Analysis Queensland Institute of Medical Research Brisbane Australia Twin Workshop Boulder 2003.
Genetic simplex modeling of personality in adolescent Australian twins N.A. GILLESPIE 1, D. M. EVANS 1, N.G. MARTIN 1 1 Queensland Institute of Medical.
Introduction to Multivariate Genetic Analysis Danielle Posthuma & Meike Bartels.
Developmental Models/ Longitudinal Data Analysis Danielle Dick & Nathan Gillespie Boulder, March 2006.
BPS - 5th Ed. Chapter 231 Inference for Regression.
QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
The SweSAT Vocabulary (word): understanding of words and concepts. Data Sufficiency (ds): numerical reasoning ability. Reading Comprehension (read): Swedish.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Multivariate Genetic Analysis (Introduction) Frühling Rijsdijk Wednesday March 8, 2006.
Inference about the slope parameter and correlation
Chapter 14 Introduction to Multiple Regression
REGRESSION G&W p
Univariate Twin Analysis
Introduction to Multivariate Genetic Analysis
Path Analysis Danielle Dick Boulder 2008
GxE and GxG interactions
LEARNING OUTCOMES After studying this chapter, you should be able to
Longitudinal Modeling
Lucía Colodro Conde Sarah Medland & Matt Keller
Special Topic: Longitudinal Mediation Analysis
Presentation transcript:

Longitudinal Modeling Nathan Gillespie & Dorret Boomsma \\nathan\2008\Longitudinal neuro_f_chol.mx neuro_f_simplex.mx jepq6.dat

Why conduct longitudinal analyses? 1. Can improve power by using multiple observations from the same individual - Cross twin cross trait correlation 2. Can examine and estimate time-dependent genetic and environmental effects - Changing magnitude of genetic & environmental influence across time - Same versus different genes across development

Methods for Longitudinal Data Analysis 1. Cholesky Models 2. Simplex Models Eaves et al., 1986 Boomsma & Molenaar, Growth Curve Models

Aims 1. Revisit the Mx trivariate Cholesky 2. Take a look at simplex or auto-regression models - Explain some of the underlying theory of this form of longitudinal modelling - Run through an Mx script 3. Compare the Cholesky and simplex models

Longitudinal modeling of adolescent personality Introduce longitudinal modeling in the context of personality change Eaves, Eysenck & Martin (1989) - Adult personality - High of genetic continuity over time - Effect stronger in Neuroticism vs Extraversion Genetic continuity in adolescents?

X 2 = , df = 45, p =.101, AIC = , RMSEA =.011 Personality Data Twin Mole and Twin MAPS projects Assess the genetic / environmental etiology of Melanocytic Naevi (common moles) in twins aged 12 & 14 years + Cognition at items JEPQ: Psychoticism (P), Extraversion (E), and Neuroticism (N)- Lie (L)

X 2 = , df = 45, p =.101, AIC = , RMSEA =.011 Personality Data Raw continuous data methods Assumptions of mean and variance homogeneity by twin order and zygosity (and necessarily by sex and age)

1. Cholesky Model

Cholesky Model Advantages: - Logical: organized such that all factors are constrained to impact later, but not earlier time points - Requires few assumptions, can predict any pattern of change Disadvantages: - Not falsifiable - Does not make predictions about what will happen in the future (as yet unmeasured time points) - Only feasible for limited number of measurements

Cholesky Model Questions you can address: - Magnitude of genetic/environmental influence at each time - Extent to which genetic/environmental influences overlap across time

Run ACE Cholesky Model neuro_f_chol.mx

Model Fit for Female Neuroticism ModelLLdf  2LL  df pBIC ACE Cholesky

Cholesky Model Fitting Results Phenotypic correlations Proportions of variance A C E

Cholesky Model Fitting Results

2. Simplex Models

Simplex or autoregressive models Simplex designs model changes in true scores (y [t] ) over time by fitting auto-regressive or Markovian chains Each true score is predicted to be causally related to the immediately preceding latent true score in a linear fashion y [t]n =  [t]n  y [t-1]n +  [t]n  [t] = linear regression of latent factor (y [t] ) on the previous latent factor (y [t-1] ),  [t] = new input, change or innovation at time [t], uncorrelated with y [t-1]

Simplex Model Latent True Scores Innovations Errors Observed phenotypes

ACE Simplex Model script neuro_f_simplex.mx

Model Fitting Results ModelLLdf  2LL  df pBIC ACE Cholesky Full Simplex AE Simplex CE Simplex E Simplex

Model Fitting Results ModelLLdf  2LL  df pBIC ACE Cholesky Full Simplex AE Simplex CE Simplex E Simplex <

Test for non-significant parameters Run confidence intervals on all parameters ! AE Simplex Structure Get neuro_f.mxs Dr ! C transmission coefficients DR 4 5 6! C innovations IN X ENd

Matrix Element Int. Estimate Lower Upper Lfail Ufail X ! AE Simplex Structure, Remove final genetic innovation Get neuro_f.mxs Dr ! C transmission coefficients DR 4 5 6! C innovations DR 3 ENd

Model Fitting Results ModelLLdf  2LL  df pBIC ACE Cholesky Full Simplex AE Simplex Dropζ a CE Simplex E Simplex <

Best Fitting Model for Female Neuroticism Degree of genetic continuity Age specific genetic effects Genetic innovation at 14 years Is it related to developmental or hormonal changes during puberty and psychosexual development?

Additional Longitudinal Models Dual Change Score (DCS) Model for Ordinal Data

Additional Longitudinal Models Bivariate Dual Change Score (DCS) Model for Ordinal Data

Best-fitting model for drinking

Methods for Longitudinal Analysis Cholesky Models Simplex Models Growth Curve Models

Simplex Model (Boomsma & Molenaar, 1987)

Genetic Simplex Model AAAA BAC2BAC6BAC4BAC3 EEEE n 21 n 32 n 43 p 21 p 32 p 43 x 11 x 22 x 33 x 44 z 11 z 22 z 33 z u 11 u 22 u 33 u 44 x and z = genetic and nonshared environmental innovations respectively n and p = genetic and nonshared environmental transmission respectively u = error variances

Simplex Model Advantages:  Makes restrictive predictions about covariance pattern  Falsifiable

Today’s example Grant et al., 1999, Behavior Genetics, 29, Australian alcohol challenge data, collected between 1979 and 1981  Mean age = 23.5 years Subjects drank 0.75 g/kg alcohol at a steady rate over a 20-minute period. Blood Alcohol Concentration (BAC) was assessed at 6 points after consumption: Minutes post- consump. Mean BAC # of individuals with data MZM (43 prs) DZM (37 prs) BAC BAC BAC BAC BAC BAC

A simplex correlation pattern… BAC 2 BAC 3 BAC 4 BAC 6 BAC BAC BAC BAC Sample correlations (the DZM twin A quadrant of an intraclass correlation matrix)

Practical - Simplex Model AAAA BAC2BAC6BAC4BAC3 EEEE n 21 n 32 n 43 p 21 p 32 p 43 x 11 x 22 x 33 x 44 z 11 z 22 z 33 z u 11 u 22 u 33 u 44 x and z = genetic and nonshared environmental innovations respectively n and p = genetic and nonshared environmental transmission respectively u = error variances

Practical - Simplex Model AAAA BAC2BAC6BAC4BAC3 EEEE n 21 n 32 n 43 p 21 p 32 p 43 x 11 x 22 x 33 x 44 z 11 z 22 z 33 z u 11 u 22 u 33 u 44 x and z = genetic and nonshared environmental innovations respectively n and p = genetic and nonshared environmental transmission respectively u = error variances

Practical - Simplex Model AAAA BAC2BAC6BAC4BAC3 EEEE ? x and z = genetic and nonshared environmental innovations respectively n and p = genetic and nonshared environmental transmission respectively u = error variances ? ? ? ???? ?? ? ?

Full Genetic Simplex Model AAAA BAC2BAC6BAC4BAC3 EEEE Basic_simplex.mxo-2*LL= , 23 est. parameters, 606 df

Sub-Models 1)Is the error variance on individual variable assessments significant? 2)Is the genetic innovations on BAC6 significant? BAC4? BAC2?

Sub-Models 1)Is the error variance on individual variable assessments significant? - drop 200 2)Is the genetic innovations on BAC6 significant? BAC4? BAC2? - drop 4, 3, 2

Simplex Model Advantages:  Makes restrictive predictions about covariance pattern  Falsifiable Disadvantages:  Makes restrictive predictions about covariance pattern (future depends on current state only)  Number of parameters increases with number of measurements

Methods for Longitudinal Analysis Cholesky Models Simplex Models Growth Curve Models

Latent Growth Curve Model (shown here as linear) Mean Level of the Trait (Intercept) Rate of Change In Trait (Slope)

Latent Growth Curve Model (shown here as linear)

Genetically Informative Latent Growth Curve Model

Genetically Informative Latent Growth Curve Model →Like a bivariate model!

Growth Model Questions What is the contribution of genetic/environmental factors to the variation of α (intercept) and  (slope)? Same or different genes influencing α (intercept) and  (slope)? Same or different environments influencing α (intercept) and  (slope)?

Practical Mx latent growth curve example (script from Submodels to test: 1.No covariance between slope and intercept 2.No genetic effect on intercept 3.No genetic effect on slope 4.No common environmental effect on intercept 5.No common environmental effect on slope 6.Best fitting model? (i.e., ACE, AE, CE, E?)

Slope A ACCEE 1.0 (0.5) 1.0 Intercept Time 1Time 2Time 3Time x 11 x 21 x 22 F 11 F 21 F 31 F 41 L L 11 L L L

Practical Mx latent growth curve example (script from Submodels to test: 1.No covariance between slope and intercept – signif decrease in fit 2.No genetic effect on intercept – signif decrease in fit 3.No genetic effect on slope – signif decrease in fit 4.No common environmental effect on intercept - - ns 5.No common environmental effect on slope -- ns 6.Best fitting model? (i.e., ACE, AE, CE, E?) -- AE

Growth Curve Model Advantages:  Very efficient: number of parameters does not increase with number of measurements  Provides prediction about behavior beyond measured timepoints Disadvantages:  Note regarding slope parameters  Can be computationally intense  Assumptions to reduce computational burden Linearity, no genetic effects on residuals, equal variance among residuals at differing timepoints

Latent Growth Curve Modeling Additional Considerations Standard approach assumes data are collected at identical set of fixed ages for all individuals (e.g., start at age 12, yearly assessments) Age heterogeneity and unequal spacing of measurements can be handled using definition variables  Mehta & West, 2000, Psychological Methods

1.00 Age lgdri161 Latent Growth Curve Model with Measured Variable

Extensions of Growth Curve Models Incorporation of measured variables (genotype, environment) Nonlinear growth  Neale, MC & McArdle, JJ (2000). A structured latent growth curves for twin data. Twin Research, 3, Slope Intercept Time 1Time 2Time 3Time LLLL

Latent Growth Curve Modeling McArdle, JJ (1986). Latent variable growth within behavior genetic models. Behavior Genetics, 16, Baker, LA et al. (1992). Biometrical analysis of individual growth curves. Behavior Genetics, 22, McArdle, JJ et al. (1998). A contemporary method for developmental -genetic analyses of age changes in intellectual abilities. Developmental Neuropsychology, 14,

Summary of Longitudinal Models Cholesky Model  Few assumptions, predict any pattern of correlations  Not falsifiable  Limited measurements Simplex Model  Falsifiable  Limited measurements Growth Curve Model  G, E influences on initial level, rate of change  Unlimited measurements  Computationally intensive, assumptions