13 October 2004Statistics: Yandell © 20041 Inferring Genetic Architecture of Complex Biological Processes Brian S. Yandell 12, Christina Kendziorski 13,

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

The genetic dissection of complex traits
Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Note that the genetic map is different for men and women Recombination frequency is higher in meiosis in women.
Experimental crosses. Inbred Strain Cross Backcross.
12 Jan 2006Hort Retreat © Brian S Yandell1 Genetics, Microarrays & Evolution: Issues as a Statistician (sans formula) Brian S. Yandell Horticulture, Statistics.
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
QTL Mapping R. M. Sundaram.
1 QTL mapping in mice Lecture 10, Statistics 246 February 24, 2004.
Multiple Correlated Traits Pleiotropy vs. close linkage Analysis of covariance – Regress one trait on another before QTL search Classic GxE analysis Formal.
Gene expression analysis summary Where are we now?
How to find genetic determinants of naturally varying traits?
Quantitative Genetics
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Seattle Summer Institute : Systems Genetics for Experimental Crosses Brian S. Yandell, UW-Madison Elias Chaibub Neto, Sage Bionetworks
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Linkage and LOD score Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Thursday from QTL to candidate genes Xidan Li Xiaodong Liu DJ de Koning.
Professor Brian S. Yandell joint faculty appointment across colleges: –50% Horticulture (CALS) –50% Statistics (Letters & Sciences) Biometry Program –MS.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Quantitative Genetics
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
CK, October, 2003 A Hidden Markov Model for Microarray Time Course Data Christina Kendziorski and Ming Yuan Department of Biostatistics and Medical Informatics.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
BayesNCSU QTL II: Yandell © Bayesian Interval Mapping 1.what is Bayes? Bayes theorem? Bayesian QTL mapping Markov chain sampling18-25.
Yandell © June Bayesian analysis of microarray traits Arabidopsis Microarray Workshop Brian S. Yandell University of Wisconsin-Madison
Yandell © 2003JSM Dimension Reduction for Mapping mRNA Abundance as Quantitative Traits Brian S. Yandell University of Wisconsin-Madison
Intro to current collaboration project Why type 2 diabetes? Overview of glucose metabolism and OXPHOS Discussion of Mootha et. al Nature Genetics, 2003.
Genetic correlations and associative networks for CNS transcript abundance and neurobehavioral phenotypes in a recombinant inbred mapping panel Elissa.
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
Yandell © 2003Wisconsin Symposium III1 QTLs & Microarrays Overview Brian S. Yandell University of Wisconsin-Madison
Why you should know about experimental crosses. To save you from embarrassment.
What is a QTL? Quantitative trait locus (loci) Region of chromosome that contributes to variation in a quantitative trait Generally used to study “complex.
Gene Mapping for Correlated Traits Brian S. Yandell University of Wisconsin-Madison Correlated TraitsUCLA Networks (c)
An atlas of genetic influences on human blood metabolites Nature Genetics 2014 Jun;46(6)
ModelNCSU QTL II: Yandell © Model Selection for Multiple QTL 1.reality of multiple QTL3-8 2.selecting a class of QTL models comparing QTL models16-24.
Bayesian Variable Selection in Semiparametric Regression Modeling with Applications to Genetic Mappping Fei Zou Department of Biostatistics University.
EQTLs.
Naomi Ziv, Mark Siegal and David Gresham
Professor Brian S. Yandell
Identifying QTLs in experimental crosses
University of Tennessee-Memphis
upstream vs. ORF binding and gene expression?
Statistical Applications in Biology and Genetics
Mapping variation in growth in response to glucose concentration
Model Selection for Multiple QTL
Multiple Traits & Microarrays
Graphical Diagnostics for Multiple QTL Investigation Brian S
Gene Hunting: Design and statistics
Gene mapping in mice Karl W Broman Department of Biostatistics
Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison.
Inferring Genetic Architecture of Complex Biological Processes Brian S
4. modern high throughput biology
High Throughput Gene Mapping Brian S. Yandell
Brian S. Yandell University of Wisconsin-Madison 26 February 2003
Linking Genetic Variation to Important Phenotypes
Complex Traits Qualitative traits. Discrete phenotypes with direct Mendelian relationship to genotype. e.g. black or white, tall or short, sick or healthy.
Inferring Causal Phenotype Networks Driven by Expression Gene Mapping
Seattle SISG: Yandell © 2008
examples in detail days to flower for Brassica napus (plant) (n = 108)
Inferring Genetic Architecture of Complex Biological Processes Brian S
Seattle SISG: Yandell © 2006
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Brian S. Yandell University of Wisconsin-Madison
Evan G. Williams, Johan Auwerx  Cell 
Presentation transcript:

13 October 2004Statistics: Yandell © Inferring Genetic Architecture of Complex Biological Processes Brian S. Yandell 12, Christina Kendziorski 13, Hong Lan 4, Jessica Byers 45, Elias Chaibub 1, Alan D. Attie 4 CIBM Training Program Retreat Department of Statistics 2 Department of Horticulture 3 Department of Biostatistics & Medical Informatics 4 Department of Biochemistry 5 Department of Nutritrional Sciences University of Wisconsin-Madison

13 October 2004Statistics: Yandell © Gene mapping infers the relationship between genotype and phenotype in a segregating population. We map thousands of mRNA expression phenotypes, or expression QTL, using dimension reduction methods to uncover correlated genetic architecture, including number and location of genomic regions as well as gene action and epistasis. We show a novel blending of principal components and discriminant analysis with functional information to detect multiple expression QTL that together may affect the expression of many correlated mRNA. These common patterns of gene action are largely overlooked by simple interval mapping when conducted separately for each mRNA. In our current study with 60 F2 mice from a B6-BTBR ob/ob model of diabetes and over 40,000 mRNA measured with Affymetrix chips, we find three pairs of genomic regions of particular interest associated with signal transduction, apoptosis, and lipid metabolism. We propose to join genetic architecture with graphical models of biochemical activity. Our approach is directly applicable to gene mapping for other “omic” measurements on the horizon.

13 October 2004Statistics: Yandell © glucoseinsulin (courtesy AD Attie)

13 October 2004Statistics: Yandell © studying diabetes in an F2 mouse model: segregating panel from inbred lines –B6.ob x BTBR.ob  F1  F2 –selected mice with ob/ob alleles at leptin gene (Chr 6) –sacrificed at 14 weeks, tissues preserved physiological study (Stoehr et al Diabetes) –mapped body weight, insulin, glucose at various ages gene expression studies –RT-PCR for a few mRNA on 108 F2 mice liver tissues (Lan et al Diabetes; Lan et al Genetics) –Affymetrix microarrays on 60 F2 mice liver tissues U47 A & B chips, RMA normalization design: selective phenotyping (Jin et al Genetics)

13 October 2004Statistics: Yandell © The intercross (from K Broman)

13 October 2004Statistics: Yandell © interval mapping basics observed measurements –Y = phenotypic trait –X = markers & linkage map i = individual index 1,…,n missing data –missing marker data –Q = QT genotypes alleles QQ, Qq, or qq at locus unknown quantities –M = genetic architecture – = QT locus (or loci) –  = phenotype model parameters pr(Q=q|X, ) genotype model –grounded by linkage map, experimental cross –recombination yields multinomial for Q given X f(Y|  q ) phenotype model –distribution shape (assumed normal here) –unknown parameters  (could be non-parametric) after Sen & Churchill (2001 Genetics)  M

13 October 2004Statistics: Yandell © heterogeneity: many genes can affect phenotype –different allelic combinations can yield similar phenotypes –multiple genes can affecting phenotype in subtle ways –multiple genes can interact (epistasis) genetic architecture: model for explained genetic variation –loci (genomic regions) that affect trait –genotypic effects of loci, including possible epistasis M = { 1, 2, 3, ( 1, 2 )} = 3 loci with epistasis between two  q =  0 +  1q +  2q +  3q +  (1,2)q = linear model for genotypic mean = ( 1, 2, 3 ) = loci in model M q = (q 1, q 2, q 3 )= possible genotype at loci Q = (Q 1, Q 2, Q 3 )= genotype for each individual at loci genetic architecture: heterogeneity

13 October 2004Statistics: Yandell © hetereogeneity: many genes affect each trait major QTL on linkage map major QTL minor QTL polygenes (modifiers)

13 October 2004Statistics: Yandell © SCD mRNA expression phenotype 2-D scan for QTL (R/qtl) epistasis LOD peaks joint LOD peaks

13 October 2004Statistics: Yandell © M observations 30,000 traits 60 mice

13 October 2004Statistics: Yandell © modern high throughput biology measuring the molecular dogma of biology –DNA  RNA  protein  metabolites –measured one at a time only a few years ago massive array of measurements on whole systems (“omics”) –thousands measured per individual (experimental unit) –all (or most) components of system measured simultaneously whole genome of DNA: genes, promoters, etc. all expressed RNA in a tissue or cell all proteins all metabolites systems biology: focus on network interconnections –chains of behavior in ecological community –underlying biochemical pathways genetics as one experimental tool –perturb system by creating new experimental cross –each individual is a unique mosaic

13 October 2004Statistics: Yandell © reduce 30,000 traits to 300-3,000 heritable traits probability a trait is heritable pr(H|Y,Q) = pr(Y|Q,H) pr(H|Q) / pr(Y|Q)Bayes rule pr(Y|Q) = pr(Y|Q,H) pr(H|Q) + pr(Y|Q, not H) pr(not H|Q) phenotype given genotype pr(Y|Q, not H) = f 0 (Y) = sum  f(Y|  ) pr(  )if not H pr(Y|Q, H) = f 1 (Y|Q) = product q f 0 (Y q ) if heritable Y q = {Y i | Q i =q} = trait values with genotype Q=q finding heritable traits (from Christina Kendziorski)

13 October 2004Statistics: Yandell © hierarchical model for expression phenotypes (EB arrays: Christina Kendziorski) mRNA phenotype models given genotypic mean  q common prior on  q across all mRNA (use empirical Bayes to estimate prior)

13 October 2004Statistics: Yandell © expression meta-traits: pleiotropy reduce 3,000 heritable traits to 3 meta-traits(!) what are expression meta-traits? –pleiotropy: a few genes can affect many traits transcription factors, regulators –weighted averages: Z = YW principle components, discriminant analysis infer genetic architecture of meta-traits –model selection issues are subtle missing data, non-linear search what is the best criterion for model selection? –time consuming process heavy computation load for many traits subjective judgement on what is best

13 October 2004Statistics: Yandell © PC for two correlated mRNA

13 October 2004Statistics: Yandell © PC across microarray functional groups Affy chips on 60 mice ~40,000 mRNA mRNA show DE (via EB arrays with marker regression) organized in 85 functional groups 2-35 mRNA / group which are interesting? examine PC1, PC2 circle size = # unique mRNA

13 October 2004Statistics: Yandell © PC meta-traits by functional group focus on 2 interesting groups

13 October 2004Statistics: Yandell © red lines: peak for PC meta-trait black/blue: peaks for mRNA traits arrows: cis-action?

13 October 2004Statistics: Yandell © DA meta-traits: separate pleiotropy from environmental correlation pleiotropy only both environmental correlation only Korol et al. (2001)

13 October 2004Statistics: Yandell © interaction plots for DA meta-traits DA for all pairs of markers: separate 9 genotypes based on markers (a) same locus pair found with PC meta-traits (b) Chr 2 region interesting from biochemistry (Jessica Byers) (c) Chr 5 & Chr 9 identified as important for insulin, SCD

13 October 2004Statistics: Yandell © genotypes from Chr 4/Chr 15 locus pair (circle=centroid) PC captures spread without genotype DA creates best separation by genotype comparison of PC and DA meta-traits on mRNA traits correlation of PC and DA meta-traits note better spread of circles PC ignores genotypeDA uses genotype

13 October 2004Statistics: Yandell © relating meta-traits to mRNA traits SCD trait log2 expression DA meta-trait standard units

13 October 2004Statistics: Yandell © graphical models (with Elias Chaibub) R2 D2 P2 QTL R1 D1 P1 observable trans-action unobservable meta-trait QTL RNA DNA observable cis-action? protein

13 October 2004Statistics: Yandell © building graphical models infer genetic architecture of meta-trait find mRNA traits correlated with meta-trait apply meta-trait genetic architecture to mRNA –expect subset of QTL to affect each mRNA build graphical models QTL  RNA1  RNA2 –class of possible models –find best model as putative biochemical pathway parallel biochemical investigation –candidate genes in QTL regions –laboratory experiments on pathway components