Introduction to Multivariate Genetic Analysis

Slides:



Advertisements
Similar presentations
Multivariate Twin Analysis
Advertisements

Bivariate analysis HGEN619 class 2007.
Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
Elizabeth Prom-Wormley & Hermine Maes
Univariate Model Fitting
Multivariate Mx Exercise D Posthuma Files: \\danielle\Multivariate.
Path Analysis Danielle Dick Boulder Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.
Multivariate Analysis Nick Martin, Hermine Maes TC21 March 2008 HGEN619 10/20/03.
Introduction to Multivariate Analysis Frühling Rijsdijk & Shaun Purcell Twin Workshop 2004.
Multivariate Genetic Analysis: Introduction(II) Frühling Rijsdijk & Shaun Purcell Wednesday March 6, 2002.
Bivariate Model: traits A and B measured in twins cbca.
Multivariate Analysis Hermine Maes TC19 March 2006 HGEN619 10/20/03.
Genetic Growth Curve Models Practica Boulder, March 2008 Irene Rebollo & Gitta Lubke & Mike Neale VU Amsterdam NL & Notre Dame, US & VCU, US.
Gene x Environment Interactions Brad Verhulst (With lots of help from slides written by Hermine and Liz) September 30, 2014.
Regressing SNPs on a latent variable Michel Nivard & Nick Martin.
Path Analysis Frühling Rijsdijk SGDP Centre Institute of Psychiatry King’s College London, UK.
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Path Analysis Frühling Rijsdijk. Biometrical Genetic Theory Aims of session:  Derivation of Predicted Var/Cov matrices Using: (1)Path Tracing Rules (2)Covariance.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Standard genetic simplex models in the classical twin design with phenotype to E transmission Conor Dolan & Janneke de Kort Biological Psychology, VU 1.
Karri Silventoinen University of Helsinki Osaka University.
Karri Silventoinen University of Helsinki Osaka University.
Continuously moderated effects of A,C, and E in the twin design Conor V Dolan & Sanja Franić (BioPsy VU) Boulder Twin Workshop March 4, 2014 Based on PPTs.
Institute of Psychiatry King’s College London, UK
Introduction to Multivariate Genetic Analysis (2) Marleen de Moor, Kees-Jan Kan & Nick Martin March 7, 20121M. de Moor, Twin Workshop Boulder.
Cholesky decomposition May 27th 2015 Helsinki, Finland E. Vuoksimaa.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Practical SCRIPT: F:\meike\2010\Multi_prac\MultivariateTwinAnalysis_MatrixRaw.r DATA: DHBQ_bs.dat.
Gene-Environment Interaction & Correlation Danielle Dick & Danielle Posthuma Leuven 2008.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
Introduction to Genetic Theory
Linkage in Mx & Merlin Meike Bartels Kate Morley Hermine Maes Based on Posthuma et al., Boulder & Egmond.
Introduction to Multivariate Genetic Analysis Danielle Posthuma & Meike Bartels.
Developmental Models/ Longitudinal Data Analysis Danielle Dick & Nathan Gillespie Boulder, March 2006.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Multivariate Genetic Analysis (Introduction) Frühling Rijsdijk Wednesday March 8, 2006.
Slides: faculty/sanja/2016/Moderating_covariances_IQ_SES/Slides/Moderating_covariances_practical.
Extended Pedigrees HGEN619 class 2007.
Regression Models for Linkage: Merlin Regress
All in the Family The Shared and Distinctive Causes of Personality and Well-Being Chris C. Martin & Corey L. M. Keyes | Department of Sociology,
Genetic simplex model: practical
Multivariate Analysis
Bivariate analysis HGEN619 class 2006.
Univariate Twin Analysis
Gene-environment interaction
Re-introduction to openMx
Heterogeneity HGEN619 class 2007.
MRC SGDP Centre, Institute of Psychiatry, Psychology & Neuroscience
Path Analysis Danielle Dick Boulder 2008
Behavior Genetics The Study of Variation and Heredity
Univariate modeling Sarah Medland.
GxE and GxG interactions
Pak Sham & Shaun Purcell Twin Workshop, March 2002
(Re)introduction to Mx Sarah Medland
Longitudinal Modeling
Sarah Medland faculty/sarah/2018/Tuesday
\nathan\RaterBias.
Getting more from multivariate data: Multiple raters (or ratings)
Lucía Colodro Conde Sarah Medland & Matt Keller
Bivariate Genetic Analysis Practical
Simplex Model Practical
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Multivariate Genetic Analysis
Multivariate Genetic Analysis: Introduction
Heritability of Subjective Well-Being in a Representative Sample
Rater Bias & Sibling Interaction Meike Bartels Boulder 2004
Presentation transcript:

Introduction to Multivariate Genetic Analysis Meike Bartels, Hermine Maes, Elizabeth Prom-Wormley and Michel Nivard

Aim and Rationale Aim: to examine the source of factors that make traits correlate or co-vary Rationale: Traits may be correlated due to shared genetic factors (A) or shared environmental factors (C or E) Can use information on multiple traits from twin pairs to partition covariation into genetic and environmental components

Example 1 Why do traits correlate/covary? How can we explain the association? Additive genetic factors (rG) Shared environment (rC) Non-shared environment (rE) rG rC 1 1 1 1 A1 C1 A2 C2 A112 C112 A222 C222 ADHD IQ E112 E222 1 1 E1 E2 rE

Adolescent depression Example 2 Associations between phenotypes over time Does anxiety in childhood lead to depression in adolescence? How can we explain the association? Additive genetic factors (a21) Shared environment (c21) Non-shared environment (e21) How much is not explained by prior anxiety? 1 1 1 1 A1 C1 A2 C2 c21 a11 c11 a21 a22 c22 Childhood anxiety Adolescent depression e11 e21 e22 1 1 E1 E2

Sources of Information For example: two traits measured in twin pairs Interested in: Cross-trait covariance within individuals Cross-trait covariance between twins MZ:DZ ratio of cross-trait covariance between twins

Observed Covariance Matrix Twin 1 Twin 2 Phenotype 1 Phenotype 2 Variance P1 Covariance P1-P2 P2 Within-trait Cross-trait Twin 1 Twin 2

Observed Covariance Matrix Twin 1 Twin 2 Phenotype 1 Phenotype 2 Variance P1 Covariance P1-P2 P2 Within-trait Cross-trait Within-twin covariance Twin 1 Within-twin covariance Twin 2

Observed Covariance Matrix Twin 1 Twin 2 Phenotype 1 Phenotype 2 Variance P1 Covariance P1-P2 P2 Within-trait Cross-trait Within-twin covariance Twin 1 Cross-twin covariance Within-twin covariance Twin 2

SEM: Cholesky Decomposition 1 1 1 1 A1 C1 A2 C2 a11 c11 a22 c22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 e11 e22 1 1 E1 E2

SEM: Cholesky Decomposition 1 1 1 1 A1 C1 A2 C2 c21 a11 c11 a21 a22 c22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 e11 e21 e22 1 1 E1 E2

SEM: Cholesky Decomposition 1/0.5 1/0.5 1 1 1 1 1 1 1 1 1 1 A1 C1 A2 C2 A1 C1 A2 C2 c21 c21 a11 c11 a21 a22 c22 a11 c11 a21 a22 c22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 2 Phenotype 1 Twin 2 Phenotype 2 e11 e21 e22 e11 e21 e22 1 1 1 1 E1 E2 E1 E2

Cholesky Decomposition Path Tracing

Within-Twin Covariances (A) 1 1 A1 A2 a11 a21 a22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 a112+c112+e112 a11a21+c11c21+e11e21 a222+a212+c222+c212+e222+e212 Twin 1

Within-Twin Covariances (A) 1 1 A1 A2 a11 a21 a22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 a112+c112+e112 a11a21+c11c21+e11e21 a222+a212+c222+c212+e222+e212 Twin 1

Within-Twin Covariances (A) 1 1 A1 A2 a11 a21 a22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 a112+c112+e112 a11a21+c11c21+e11e21 a222+a212+c222+c212+e222+e212 Twin 1

Within-Twin Covariances (A) 1 1 A1 A2 a11 a21 a22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 a112+c112+e112 a11a21+c11c21+e11e21 a222+a212+c222+c212+e222+e212 Twin 1

Within-Twin Covariances (C) 1 1 C1 C2 c11 c21 c22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 a112+c112+e112 a11a21+c11c21+e11e21 a222+a212+c222+c212+e222+e212 Twin 1

Within-Twin Covariances (E) Phenotype 1 Twin 1 Phenotype 2 e11 e21 e22 1 1 E1 E2 Twin 1 Phenotype 1 Phenotype 2 a112+c112+e112 a11a21+c11c21+e11e21 a222+a212+c222+c212+e222+e212 Twin 1

Cross-Twin Covariances (A) 1/0.5 1/0.5 1 1 1 1 A1 A2 A1 A2 a11 a21 a22 a11 a21 a22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 2 Phenotype 1 Twin 2 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 +c11c21 +c222+c212 Twin 2

Cross-Twin Covariances (A) 1/0.5 1/0.5 1 1 1 1 A1 A2 A1 A2 a11 a21 a22 a11 a21 a22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 2 Phenotype 1 Twin 2 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 1/0.5a112+ +c11c21 +c222+c212 Twin 2

Cross-Twin Covariances (A) 1/0.5 1/0.5 1 1 1 1 A1 A2 A1 A2 a11 a21 a22 a11 a21 a22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 2 Phenotype 1 Twin 2 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 1/0.5a112+c112 1/0.5a11a21+c11c21 +c222+c212 Twin 2

Cross-Twin Covariances (A) 1/0.5 1/0.5 1 1 1 1 A1 A2 A1 A2 a11 a21 a22 a11 a21 a22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 2 Phenotype 1 Twin 2 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 1/0.5a112+c112 1/0.5a11a21+c11c21 1/0.5a222+1/0.5a212+c222+c212 Twin 2

Cross-Twin Covariances (C) 1 1 1 1 1 1 C1 C2 C1 C2 c21 c21 c11 c22 c11 c22 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 2 Phenotype 1 Twin 2 Phenotype 2 Twin 1 Phenotype 1 Phenotype 2 1/0.5a112+c112 1/0.5a11a21+c11c21 1/0.5a222+1/0.5a212+c222+c212 Twin 2

Predicted Model Twin 1 Twin 2 Twin 1 Twin 2 Within-twin covariance Phenotype 1 Phenotype 2 a112+c112+e112 a11a21+c11c21+e11e21 a222+a212+c222+c212+e222+e212 1/.5a112+c112 1/.5a11a21+ c11c21 1/.5a222+1/.5 a212+c222+c212 Within-twin covariance Twin 1 Cross-twin covariance Within-twin covariance Twin 2

Predicted Model Twin 1 Twin 2 Twin 1 Twin 2 Within-twin covariance Phenotype 1 Phenotype 2 Variance P1 Covariance P1-P2 P2 Within-trait Cross-trait Within-twin covariance Twin 1 Cross-twin covariance Within-twin covariance Twin 2

Predicted Model Twin 1 Twin 2 Twin 1 Twin 2 Within-twin covariance Phenotype 1 Phenotype 2 Variance P1 Covariance P1-P2 P2 Within-trait Cross-trait Within-twin covariance Variance of P1 and P2 the same across twins and zygosity groups Twin 1 Cross-twin covariance Within-twin covariance Twin 2

Predicted Model Twin 1 Twin 2 Twin 1 Twin 2 Within-twin covariance Phenotype 1 Phenotype 2 Variance P1 Covariance P1-P2 P2 Within-trait Cross-trait Within-twin covariance Covariance of P1 and P2 the same across twins and zygosity groups Twin 1 Cross-twin covariance Within-twin covariance Twin 2

Predicted Model Twin 1 Twin 2 Twin 1 Twin 2 Within-twin covariance Phenotype 1 Phenotype 2 Variance P1 Covariance P1-P2 P2 Within-trait Cross-trait Within-twin covariance Cross-twin covariance within each trait differs by zygosity Twin 1 Cross-twin covariance Within-twin covariance Twin 2

Predicted Model Twin 1 Twin 2 Twin 1 Twin 2 Within-twin covariance Phenotype 1 Phenotype 2 Variance P1 Covariance P1-P2 P2 Within-trait Cross-trait Within-twin covariance Cross-twin cross-trait covariance differs by zygosity Twin 1 Cross-twin covariance Within-twin covariance Twin 2

Example Covariance Matrix MZ Twin 1 Twin 2 P1 P2 1 .30 0.79 0.49 0.50 0.59 0.29 Within-twin covariance Twin 1 Cross-twin covariance Within-twin covariance Twin 2 Twin 1 Twin 2 DZ P1 P2 1 0.30 0.39 0.25 0.24 0.43 0.31 Within-twin covariance Twin 1 Cross-twin covariance Within-twin covariance Twin 2

Example Covariance Matrix MZ Twin 1 Twin 2 P1 P2 1 .30 0.79 0.49 0.50 0.59 0.29 Within-twin covariance Twin 1 Cross-twin covariance Within-twin covariance Twin 2 Twin 1 Twin 2 DZ P1 P2 1 0.30 0.39 0.25 0.24 0.43 0.31 Within-twin covariance Twin 1 Cross-twin covariance Within-twin covariance Twin 2

Example Covariance Matrix MZ Twin 1 Twin 2 P1 P2 1 .30 0.79 0.49 0.50 0.59 0.29 Within-twin covariance Twin 1 Cross-twin covariance Within-twin covariance Twin 2 Twin 1 Twin 2 DZ P1 P2 1 0.30 0.39 0.25 0.24 0.43 0.31 Within-twin covariance Twin 1 Cross-twin covariance Within-twin covariance Twin 2

Summary Within-individual cross-trait covariance implies common aetiological influences Cross-twin cross-trait covariance implies common aetiological influences are familial Whether familial influences genetic or environmental shown by MZ:DZ ratio of cross-twin cross-trait covariances

Cholesky Decomposition Bivariate Genetic analyses Specification in OpenMx

Within-Twin Covariance 1 1 Path Tracing: A1 A2 a11 a21 a22 P1 P2

Within-Twin Covariance 1 1 Path Tracing: A1 A2 a11 a21 a22 a Lower 2 x 2: a1 a2 P1 P2 P1 P2

Starting values and labels

Within-Twin Covariance 1 1 Path Tracing: A1 A2 a11 a11 a21 a21 a22 a22 a Lower 2 x 2: a1 a2 P1 P2 P1 P2

Within-Twin Covariance

Total Within-Twin Covar. Using matrix addition, the total within-twin covariance for the phenotypes is defined as:

OpenMx Matrices & Algebra

Additive Genetic Cross-Twin Covariance (DZ) 0.5 0.5 Path Tracing: Within-traits P11-P12 = 0.5a112 P21-P22 = 0.5a222+0.5a212 Cross-traits P11-P22 = 0.5a11a21 P21-P12 = 0.5a21a11 1 1 1 1 A1 A2 A1 A2 a11 a21 a22 a11 a21 a22 P11 P21 P12 P22 Twin 1 Twin 2

Additive Genetic Cross-Twin Covariance (MZ) 1 1 1 1 1 1 A1 A2 A1 A2 a11 a21 a22 a11 a21 a22 P11 P21 P12 P22 Twin 1 Twin 2

Common Environment Cross-Twin Covariance 1 1 1 1 1 1 C1 C2 C1 C2 c11 c21 c22 c11 c21 c22 P11 P21 P12 P22 Twin 1 Twin 2

Covariance Model for Twin Pairs

Obtaining Standardized Estimates

Three Important Results Variance Decomposition -> Heritability, (Shared) environmental influences Covariance Decomposition -> The influences of genes and environment on the covariance between the two variables “how much of the phenotypic correlation is accounted for by genetic and environmental influences” Genetic and Environmental correlations -> the overlap in genes and environmental effects “is there a large overlap in gene/ environmental sets”

OpenMx Output Variance Decomposition -> Heritability, (Shared) environmental influences Covariance Decomposition -> The influences of genes and environment on the covariance between the two variables A A C C E E SA SA SC SC SE SE VC 0.4885 0.3446 0.0934 0.0540 0.4340 0.0831 0.4809 0.7154 0.0919 0.1121 0.4272 0.1725 VC 0.3446 0.4984 0.0540 0.0312 0.0831 0.7180 0.7154 0.3995 0.1121 0.0250 0.1725 0.5755

Genetic correlation 1/.5 A1 A2 a11 P11 P21 a22 a21 P12 P22 Twin 1

OpenMx Output fitACE$rA [,1] [,2] [1,] 1.00000000 0.69847252 [,1] [,2] [1,] 1.00000000 0.69847252 [2,] 0.69847252 1.00000000 fitACE$rC [,1] [,2] [1,] 1.00000000 0.99999984 [2,] 0.99999984 1.00000000 fitACE$rE [,1] [,2] [1,] 1.00000000 0.14881543 [2,] 0.14881543 1.00000000

Genetic correlation & contribution to observed correlation rg If the rg = 1, the two sets of genes overlap completely A1 A2 If however a11 and a22 are near to zero, genes do not contribute to the observed correlation a11 a22 P11 P21 Twin 1

Interpreting Results High genetic correlation = large overlap in genetic effects on the two phenotypes Does it mean that the phenotypic correlation between the traits is largely due to genetic effects? No: the substantive importance of a particular rG depends the value of the correlation and the value of the A2 paths i.e. importance is also determined by the heritability of each phenotype

More Variables… a32 a33 c33 c21 c22 a21 c31 a31 c32 a11 c11 a22 e21 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 1 Phenotype 3 e21 e11 e31 e32 e22 e33 1 1 1 E1 E2 E3

More Variables… 1/0.5 1/0.5 1/0.5 1 1 1 a32 a33 c33 a32 a33 c33 c21 Twin 1 Phenotype 1 Twin 1 Phenotype 2 Twin 1 Phenotype 3 Twin 2 Phenotype 1 Twin 2 Phenotype 2 Twin 2 Phenotype 3 e21 e21 e11 e31 e32 e11 e31 e32 e22 e33 e22 e33 1 1 1 1 1 1 E1 E2 E3 E1 E2 E3

Expanded Matrices a Lower 3 x 3 c Lower 3 x 3 e Lower 3 x 3

OpenMx Parameter Matrices vars <- c(’varx', ’vary’, ‘varz’) nv <- 3 OpenMx

Practical SCRIPT: DATA: F:\meike\2016\Multivariate\Bivariate DHBQ_bs.dat

DATA - General Family Functioning, Subjective Happiness - T1, T2, brother, sister - missing -999 58

Observed Cross-twin Cross-trait Correlations colMeans(mzData,na.rm=TRUE) colMeans(dzData,na.rm=TRUE) cov(mzData,use="complete") cov(dzData,use="complete") cor(mzData,use="complete") cor(dzData,use="complete”) 59

Observed Cross-twin Cross-trait Correlations > cor(mzData,use="complete") family1 happy1 family2 happy2 family1 1.00 0.40 0.58 0.37 happy1 0.40 1.00 0.35 0.46 family2 0.58 0.35 1.00 0.41 happy2 0.37 0.46 0.41 1.00 > cor(dzData,use="complete") family1 happy1 family2 happy2 family1 1.00 0.47 0.30 0.06 happy1 0.47 1.00 0.30 0.21 family2 0.30 0.30 1.00 0.37 happy2 0.06 0.21 0.37 1.00 60

PRAC I. The ACE model and its estimates 1. Run the ACE model 2. What is the heritability of FAM and HAP? 3. What is the genetic influence on the covariance? 4. What is the genetic correlation? 61

OpenMx Output Variance Decomposition -> Heritability, (Shared) environmental influences Covariance Decomposition -> The influences of genes and environment on the covariance between the two variables A A C C E E SA SA SC SC SE SE VC 0.4885 0.3446 0.0934 0.0540 0.4340 0.0831 0.4809 0.7154 0.0919 0.1121 0.4272 0.1725 VC 0.3446 0.4984 0.0540 0.0312 0.0831 0.7180 0.7154 0.3995 0.1121 0.0250 0.1725 0.5755

OpenMx Output fitACE$rA [,1] [,2] [1,] 1.00000000 0.69847252 [,1] [,2] [1,] 1.00000000 0.69847252 [2,] 0.69847252 1.00000000 fitACE$rC [,1] [,2] [1,] 1.00000000 0.99999984 [2,] 0.99999984 1.00000000 fitACE$rE [,1] [,2] [1,] 1.00000000 0.14881543 [2,] 0.14881543 1.00000000

PRAC II. Trivariate Model Add a third variable (Satisfaction with Life) to the model Run model What are the parameter estimates? What is the genetic correlation? 64

Changes that had to be made # Select Variables for Analysis Vars <- c('family','happy', 'life') nv <- 3 # number of variables ntv <- nv*2 # number of total variables selVars <- paste(Vars,c(rep(1,nv),rep(2,nv)),sep="") svMe <- c(7,5,5) 65

Genetic and Environmental Influences [1] "Matrix A/V" stCovA1 stCovA2 stCovA3 family 0.4117 0.6549 0.5697 happy 0.6549 0.3297 0.3224 life 0.5697 0.3224 0.2246 [1] "Matrix C/V" stCovC1 stCovC2 stCovC3 Family 0.1517 0.1709 0.3004 happy 0.1709 0.0681 0.1275 life 0.3004 0.1275 0.1406 [1] "Matrix E/V" stCovE1 stCovE2 stCovE3 Family 0.4366 0.1742 0.1299 happy 0.1742 0.6022 0.5501 life 0.1299 0.5501 0.6348 66

Genetic and Environmental Correlations [1] "Matrix solve(sqrt(I*A)) %&% A" Family Happy Life Family 1.00 0.76 0.78 Happy 0.76 1.00 0.88 life 0.78 0.88 1.00 [1] "Matrix solve(sqrt(I*C)) %&% C" Family 1.00 0.72 0.85 happy 0.72 1.00 0.97 life 0.85 0.97 1.00 [1] "Matrix solve(sqrt(I*E)) %&% E" Family 1.00 0.14 0.10 happy 0.14 1.00 0.66 life 0.10 0.66 1.00 67

Genetic Models Common Pathway Model Independent Pathway Model Genetic and environmental factor analysis examples Common Pathway Model Independent Pathway Model 68