Sarah Medland faculty/sarah/2018/Tuesday

Slides:



Advertisements
Similar presentations
Bivariate analysis HGEN619 class 2007.
Advertisements

Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
Univariate Model Fitting
Multivariate Mx Exercise D Posthuma Files: \\danielle\Multivariate.
(Re)introduction to OpenMx Sarah Medland. Starting at the beginning  Opening R Gui – double click Unix/Terminal – type R  Closing R Gui – click on the.
Path Analysis Danielle Dick Boulder Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.
(Re)introduction to Mx Sarah Medland. KiwiChinese Gooseberry.
Multivariate Genetic Analysis: Introduction(II) Frühling Rijsdijk & Shaun Purcell Wednesday March 6, 2002.
Extended sibships Danielle Posthuma Kate Morley Files: \\danielle\ExtSibs.
Quantitative Genetics
ACDE model and estimability Why can’t we estimate (co)variances due to A, C, D and E simultaneously in a standard twin design?
(Re)introduction to Mx. Starting at the beginning Data preparation Mx expects 1 line per case/family Almost limitless number of families and variables.
Univariate Analysis in Mx Boulder, Group Structure Title Type: Data/ Calculation/ Constraint Reading Data Matrices Declaration Assigning Specifications/
Genetic Dominance in Extended Pedigrees: Boulder, March 2008 Irene Rebollo Biological Psychology Department, Vrije Universiteit Netherlands Twin Register.
Path Analysis Frühling Rijsdijk SGDP Centre Institute of Psychiatry King’s College London, UK.
Mx Practical TC18, 2005 Dorret Boomsma, Nick Martin, Hermine H. Maes.
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Path Analysis Frühling Rijsdijk. Biometrical Genetic Theory Aims of session:  Derivation of Predicted Var/Cov matrices Using: (1)Path Tracing Rules (2)Covariance.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Linkage Analysis in Merlin
Path Analysis HGEN619 class Method of Path Analysis allows us to represent linear models for the relationship between variables in diagrammatic.
Karri Silventoinen University of Helsinki Osaka University.
Copy the folder… Faculty/Sarah/Tues_merlin to the C Drive C:/Tues_merlin.
Karri Silventoinen University of Helsinki Osaka University.
Karri Silventoinen University of Helsinki Osaka University.
Introduction to Multivariate Genetic Analysis (2) Marleen de Moor, Kees-Jan Kan & Nick Martin March 7, 20121M. de Moor, Twin Workshop Boulder.
Introduction to OpenMx Sarah Medland. What is OpenMx? Free, Open-source, full–featured SEM package Software which runs on Windows, Mac OSX, and Linux.
Cholesky decomposition May 27th 2015 Helsinki, Finland E. Vuoksimaa.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Practical SCRIPT: F:\meike\2010\Multi_prac\MultivariateTwinAnalysis_MatrixRaw.r DATA: DHBQ_bs.dat.
Power and Sample Size Boulder 2004 Benjamin Neale Shaun Purcell.
The importance of the “Means Model” in Mx for modeling regression and association Dorret Boomsma, Nick Martin Boulder 2008.
Univariate Analysis Hermine Maes TC21 March 2008.
SEM Basics 2 Byrne Chapter 2 Kline pg 7-15, 50-51, ,
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
Model building & assumptions Matt Keller, Sarah Medland, Hermine Maes TC21 March 2008.
Introduction to Genetic Theory
Welcome  Log on using the username and password you received at registration  Copy the folder: F:/sarah/mon-morning To your H drive.
Introduction to Multivariate Genetic Analysis Danielle Posthuma & Meike Bartels.
QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Multivariate Genetic Analysis (Introduction) Frühling Rijsdijk Wednesday March 8, 2006.
Extended Pedigrees HGEN619 class 2007.
Ordinal Data Sarah Medland.
Univariate Twin Analysis
Introduction to OpenMx
Introduction to Multivariate Genetic Analysis
Fitting Univariate Models to Continuous and Categorical Data
Re-introduction to openMx
MRC SGDP Centre, Institute of Psychiatry, Psychology & Neuroscience
Path Analysis Danielle Dick Boulder 2008
Can resemblance (e.g. correlations) between sib pairs, or DZ twins, be modeled as a function of DNA marker sharing at a particular chromosomal location?
More on thresholds Sarah Medland.
I Have the Power in QTL linkage: single and multilocus analysis
Genetics of qualitative and quantitative phenotypes
Univariate modeling Sarah Medland.
Why general modeling framework?
Pak Sham & Shaun Purcell Twin Workshop, March 2002
(Re)introduction to Mx Sarah Medland
More on thresholds Sarah Medland.
Univariate Linkage in Mx
Power Calculation for QTL Association
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Power Calculation Practical
Power Calculation Practical
Multivariate Genetic Analysis: Introduction
Rater Bias & Sibling Interaction Meike Bartels Boulder 2004
Presentation transcript:

Sarah Medland faculty/sarah/2018/Tuesday Univariate modeling All models are wrong some are useful (George Box, 1978) Sarah Medland faculty/sarah/2018/Tuesday

Starting at the beginning… Data preparation The algebra style used in Mx expects 1 line per case/family (Almost) limitless number of families and variables Missing data Default missing code is NA No missing covariates/definition variables!

Selecting and sub-setting data Make separate data sets for the MZ and DZ Check data is numeric and behaves as expected cov (dzData, use=“complete”)

Important structural stuff openMx has a very fluid and flexible stucture Each code snippet is being saved as an object We tend to reuse the object names in our scripts This makes it very important to clear or create a new project for each series of analyses Remember the project also contains the data so these files can become very large.

Matrices are the building blocks Many types eg. type="Lower" Denoted by names eg. name="a“ Size eg. nrow=nv, ncol=nv All estimated parameters must be placed in a matrix & Mx must be told what type of matrix it is mxMatrix( type="Lower", nrow=nv, ncol=nv, free=TRUE, values=.6, label="a11", name="a" ), #X

Choosing the model Thinking about parameter space… Imagine an ACE model Solution space bounded by CIs

Choosing the model ACE vs ADE With twins alone can’t joint estimate ACDE Options Add in an extra relationship Fix one of these parameters and estimate the other 3 Accept this limitation All models are wrong some are useful (George E. P. Box) Reject the twin model, pretend genes have no influence and interpret biological inheritance as a social phenomenon No 1 size fits all solution

Choosing the model

Yesterday we ran an ADE Model 0.30 0.78 Why?

What is D again? Dominance refers to non-additive genetic effects resulting from interactions between alleles at the same locus or different loci (epistasis)

What is D again? DZ twins/full siblings share ~50% of their segregating DNA & for ~25% loci they share not only the genotype but also the parental origin of each allele

DZ twins/full siblings share ~50% of their segregating DNA & for ~25% loci they share not only the genotype but also the parental origin of each allele This is where the .5A comes from Consider a mating between mother AB x father CD: Sib2 Sib1 AC AD BC BD 2 1 This is where the .25D comes from IBD 0 : 1 : 2 = 25% : 50% : 25%

Today we will run an ACE model VE VC VA VA VC VE VE VC VA VA VC VE 1 1 1 1 MZ DZ a2+c2+e2 a2+c2 a2+c2+e2 .5a2+c2

Today we will run an ACE model Additive genetic effects Why is the coefficient for DZ pairs .5? Average genetic sharing between siblings/DZ twins Sib2 Sib1 AC AD BC BD 2 1 MZ DZ a2+c2+e2 a2+c2 a2+c2+e2 .5a2+c2

Today we will run an ACE model Common environmental effects Coefficient =1 for MZ and DZ pairs Equal environment assumption – for all the environmental influences THAT MATTER there is ON AVERAGE no differences in the degree of environmental sharing between MZ and DZ pairs MZ DZ a2+c2+e2 a2+c2 a2+c2+e2 .5a2+c2

Today we will run an ACE model MZ DZ a2+c2+e2 a2+c2 a2+c2+e2 .5a2+c2

Today we will run an ACE model

We are going to include the effect of age on the means

Covariances MZ DZ a2+c2+e2 a2+c2 a2+c2+e2 .5a2+c2

To fit a model to data, the differences between the observed covariance matrix and model-implied expected covariance matrix are minimized. Objective functions are functions for which free parameter values are chosen such that the value of the objective function is minimized. mxFitFunctionML() uses full-information maximum likelihood to provide maximum likelihood estimates of free parameters in the algebra defined by the covariance and means arguments.

What to report Summary statistics Standardized estimates Usually from a simplified ‘saturated’ model Standardized estimates Easier to conceptualise ie 40% of the phenotypic variance vs a genetic effect of 2.84 Can easily be returned to original scale if summary statistics are provided

What to report Path coefficients / Directly estimated variances Very important in multivariate analyses Gives a much clearer picture of the directionality of effects For a path coefficent model also report Variance components (squared paths) these represent the proportion of variance explained Genetic correlations

How to report it? Path diagram onyx (java based cross platform) http://onyx.brandmaier.de/ Clasic MX GUI (complex to setup in mac) http://www.vipbg.vcu.edu/vipbg/mxgui/ umx with graphviz (complex to setup in windows) omnigraffle (mac only) https://www.omnigroup.com/omnigraffle

A=77.66, C=0, E=22.34 Δ-2LL AE-ACE = -4.2523425e-09 .47 0 .88 .88 0 .47 .88 0 .47 A=77.66, C=0, E=22.34 Δ-2LL AE-ACE = -4.2523425e-09

A=77.66, C=0, E=22.34 Δ-2LL AE-ACE = -4.2523425e-09 .47 0 .88 .88 0 .47 .88 0 .47 A=77.66, C=0, E=22.34 Δ-2LL AE-ACE = -4.2523425e-09 1 .22 -.13 .90

A=77.66, C=0, E=22.34 Δ-2LL AE-ACE = -4.2523425e-09 .47 0 .88 .88 0 .47 .88 0 .47 A=77.66, C=0, E=22.34 Δ-2LL AE-ACE = -4.2523425e-09 A=77.97, C=0, E=22.03 Δ-2LL AE-ACE = 1.98

General Advice/Problem solving Scripting styles differ Check the sample description Learn to love the webpage Comments are your friends

“Remember that all models are wrong; the practical question is how wrong do they have to be to not be useful” George E P Box and Norman R Draper. 1986. Empirical Model-Building and Response Surface. John Wiley & Sons, Inc., New York, NY, USA.