. Parametric and Non-Parametric analysis of complex diseases Lecture #6 Based on: Chapter 25 & 26 in Terwilliger and Ott’s Handbook of Human Genetic Linkage.

Slides:



Advertisements
Similar presentations
Pedigrees Who do we inherit our traits from? DO YOU LOOK LIKE YOUR AUNT OR UNCLE? DO YOU AND YOUR COUSIN SHARE TRAITS?
Advertisements

. Exact Inference in Bayesian Networks Lecture 9.
Tutorial #1 by Ma’ayan Fishelson
Gene Frequency and LINKAGE Gregory Kovriga & Alex Ratt.
Autosomal dominant inheritance All affected individuals should have an affected parent Both sexes should be equally affected Roughly 50% of the offspring.
Modes of Inheritance Jonathan Wolfe Wolfson House, room 109
Basics of Linkage Analysis
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Mendel and Heredity Meiosis and Genetic Variation.
Pedigree Analysis.
Genetics Test Prep Quiz. Rules Each student starts with 100 points. You can bet as much as little as possible on each question based on what you think.
Transmission Genetics: Heritage from Mendel 2. Mendel’s Genetics Experimental tool: garden pea Outcome of genetic cross is independent of whether the.
13.1 Ecologists Study Relationships Unit 3: Genetics Chapter 7 Extending Mendelian Genetics Section 7-1 Chromosomes and Phenotype.
MMLS-C By : Laurence Bisht References : The Power to Detect Linkage in Complex Diseases Means of Simple LOD-score Analyses. By David A.,Paula Abreu and.
Simulation/theory With modest marker spacing in a human study, LOD of 3 is 9% likely to be a false positive.
Parametric and Non-Parametric analysis of complex diseases Lecture #8
. Basic Model For Genetic Linkage Analysis Lecture #3 Prepared by Dan Geiger.
CASE STUDY: Genetic Linkage Analysis via Bayesian Networks
The Ising Model in Physics and Statistical Genetics.
Linkage and LOD score Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Standardization of Pedigree Collection. Genetics of Alzheimer’s Disease Alzheimer’s Disease Gene 1 Gene 2 Environmental Factor 1 Environmental Factor.
. Basic Model For Genetic Linkage Analysis Lecture #5 Prepared by Dan Geiger.
1 Mendelian genetics in Humans: Autosomal and Sex- linked patterns of inheritance Obviously examining inheritance patterns of specific traits in humans.
Multifactorial Traits
Process of Genetic Epidemiology Migrant Studies Familial AggregationSegregation Association StudiesLinkage Analysis Fine Mapping Cloning Defining the Phenotype.
Pedigree Analysis.
Genetic Probabilities. Learning Objectives By the end of this class you should understand: The purpose and nature of dihybrid crosses How to calculate.
Introduction to Linkage Analysis Pak Sham Twin Workshop 2003.
Gene Mapping Quantitative Traits using IBD sharing References: Introduction to Quantitative Genetics, by D.S. Falconer and T. F.C. Mackay (1996) Longman.
Punnet Squares, Linked Genes and Pedigrees
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Recombination and Linkage
Lecture 13: Linkage Analysis VI Date: 10/08/02  Complex models  Pedigrees  Elston-Stewart Algorithm  Lander-Green Algorithm.
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
Lecture 15: Linkage Analysis VII
1 B-b B-B B-b b-b Lecture 2 - Segregation Analysis 1/15/04 Biomath 207B / Biostat 237 / HG 207B.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
Lecture 21: Quantitative Traits I Date: 11/05/02  Review: covariance, regression, etc  Introduction to quantitative genetics.
An quick overview of human genetic linkage analysis
Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.
Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.
Epistasis / Multi-locus Modelling Shaun Purcell, Pak Sham SGDP, IoP, London, UK.
Pedigrees.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
Understanding Inheritance Main Idea: The interactions among alleles, genes, and the environment determine an organism’s traits.
Powerful Regression-based Quantitative Trait Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.
Pedigree Analysis. Why do Pedigrees? Punnett squares and chi-square tests work well for organisms that have large numbers of offspring and controlled.
Lecture 17: Model-Free Linkage Analysis Date: 10/17/02  IBD and IBS  IBD and linkage  Fully Informative Sib Pair Analysis  Sib Pair Analysis with Missing.
Mendelian genetics in Humans: Autosomal and Sex- linked patterns of inheritance Obviously examining inheritance patterns of specific traits in humans.
Gonçalo Abecasis and Janis Wigginton University of Michigan, Ann Arbor
PEDIGREE ANALYSIS AND PROBABILITY
Chapter Seven: Extending Mendelian Genetics
Pedigrees Pedigrees study how a trait is passed from one generation to the next. Infers genotypes of family members Disorders can be carried on… Autosomes.
Recombination (Crossing Over)
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
California Standard and Learning Objectives
Lab: Pedigree Analysis
Week 9 Vocab Definitions
Basic Model For Genetic Linkage Analysis Lecture #3
Pedigree Analysis.
Unit 8: Mendelian Genetics
gene mapping & pedigrees
Lecture 9: QTL Mapping II: Outbred Populations
Linkage Analysis Problems
Genetic linkage analysis
11.1 Basic Patterns of Human Inheritance
Quiz #3, Problem #1 Consider the pedigree below, the allele responsible for the trait (a) is recessive to the wild-type allele (A). a. (2 pts) What is.
Polygenic Inheritance
Presentation transcript:

. Parametric and Non-Parametric analysis of complex diseases Lecture #6 Based on: Chapter 25 & 26 in Terwilliger and Ott’s Handbook of Human Genetic Linkage. Prepared by Dan Geiger.

2 Complex Diseases 1.Unknown mode of inheritance (Dominant/recessive) 2.Several interacting loci (Epistasis) 3.Unclear affected status (e.g., psychiatric disorders) 4.Genetic heterogeneity 5.Non genetic factors We start by specifying how alternative models look like using a Bayesian network model.

3 Mode of Inheritance S 23m L 21f L 21m L 23m X 21 S 23f L 22f L 22m L 23f X 22 X 23 Specify different conditional probability tables between the phenotype variables Y i and the genotypes S 13m L 11f L 11m L 13m X 11 S 13f L 12f L 12m L 13f X 12 X 13 y3y3 y2y2 y1y1 Recessive, full penetrance: P(y 1 = sick | X 11 = (a,a)) = 1 P(y 1 = sick | X 11 = (A,a)) = 0 P(y 1 = sick | X 11 = (A,A)) = 0

4 More modes of Inheritance Dominant, 60% penetrance: P(y 1 = sick | X 11 = (a,a)) = 0.6 P(y 1 = sick | X 11 = (A,a)) = 0.6 P(y 1 = sick | X 11 = (A,A)) = 0 Dominant, full penetrance: P(y 1 = sick | X 11 = (a,a)) = 1 P(y 1 = sick | X 11 = (A,a)) = 1 P(y 1 = sick | X 11 = (A,A)) = 0 Recessive, 40% penetrance, 1% penetrance for phenocopies: P(y 1 = sick | X 11 = (a,a)) = 0.4 P(y 1 = sick | X 11 = (A,a)) = 0.01 P(y 1 = sick | X 11 = (A,A)) = 0.01 Dominant, 20% penetrance, 5% penetrance for phenocopies: P(y 1 = sick | X 11 = (a,a)) = 0.2 P(y 1 = sick | X 11 = (A,a)) = 0.2 P(y 1 = sick | X 11 = (A,A)) = 0.05

5 Two or more interacting loci (epistasis) Specify different conditional probability tables between the phenotype variables Y i and the 2 or more genotypes of person i. Example: Recessive, full penetrance: P(y 11 = sick | X 11 = (a,a), X 21 = (a,a)) = 1 P(y 11 = sick | X 11 = (A,a), X 21 = (a,a)) = 0 P(y 11 = sick | X 11 = (A,A), X 21 = (a,a)) = 0 6 more zero options to specify. S 23m L 21f L 21m L 23m X 21 S 23f L 22f L 22m L 23f X 22 X 23 S 13m L 11f L 11m L 13m X 11 S 13f L 12f L 12m L 13f X 12 X 13 y3y3 y2y2 y1y1

6 Unclear affection status S 23m L 21f L 21m L 23m X 21 S 23f L 22f L 22m L 23f X 22 X 23 S 13m L 11f L 11m L 13m X 11 S 13f L 12f L 12m L 13f X 12 X 13 Specify a “confusion matrix” regarding the process that determines affected status. Y3Y3 Y2Y2 Y1Y1 P(z 1 = measured sick | y 1 = sick) = 0.9 P(z 1 = measured sick | y 1 = not sick) = 0.2 Z1Z1 Z1Z1 Z1Z1

7 Genetic Heterogeneity Non-Allelic heterogeneity: several independent loci predisposes to the disease. S i3m L i1f L i1m L i3m X i1 S i3f L i2f L i2m L i3f X i2 X i

8 Non genetic factors S 23m L 21f L 21m L 23m X 21 S 23f L 22f L 22m L 23f X 22 X 23 S 13m L 11f L 11m L 13m X 11 S 13f L 12f L 12m L 13f X 12 X 13 y3y3 y2y2 y1y1 Under liability class 1 (L 1 =1): P(y 1 = sick | X 11 = (a,a), L 1 =1) = 1 P(y 1 = sick | X 11 = (A,a), L 1 =1) = 0.05 P(y 1 = sick | X 11 = (A,A), L 1 =1) = 0.05 Liability Class L 1 Example: L i = 1 means “old” L i = 2 means “young”. L3L3 L2L2 Under L 1 =2 (“young”): the first line changes, say, to 0.3 and the other two lines to, say, 0.

9 Parametric versus Non-Parametric All analyses considered so far are “parametric” meaning that a mode of inheritance is assumed. In some cases, several options of modes of inheritance are assumed but still the analysis uses each option in turn. For complex diseases it is believed that “non-parametric” methods might work better. In our context, these are methods that do not take mode of inheritance into account. The idea is that computing linkage without assuming mode of inheritance is more robust to error in model specification. Clearly, if the model is correct, parametric methods perform better, but not so if the model is wrong as for complex traits.

10 Some Non-Parametric Methods Main idea: if affected siblings share more IBD alleles at some marker locus than randomly expected among siblings, then that locus might be near a locus of a predisposing gene. Definitions: Any two identical copies of an allele l are said to be identical by state (IBS). If these alleles are inherited from the same individual then they are also identical by descent (IBD). Clearly, IBD implies IBS but not vice versa. We will consider the following non-parametric methods: Affected Sib-Pair Analysis (ASP) Extended Affected Sib-Pair Analysis (ESPA) Affected Pedigree Member method (APM)

11 Identical By Descent (IBD) Exactly one allele IBD. 1/2 1/1 1/3 1/2 1/3 No allele is IBD. One allele is IBS. 1/1 1/2 At least one allele IBD. Expected 1.5 alleles IBD.

12 Affected Sib-Pair Analysis The idea is that any two siblings are expected to have one allele IBD by chance (and at most two IBD alleles, ofcourse). When a deviation of this pattern is detected, by examining many sib-pairs, a linkage is established between a disease gene and the marker location. This phenomena happens regardless of mode of inheritance, but its strength is different for each mode.

13 Affected Sib-Pair Analysis 1/2 1/3 1/4 3/4 There are 16 combinations of sibling marker genotypes: SON1 SON2 IBD SON1 SON2 IBD 1/3 1/3 2 1/4 1/4 2 2/3 2/3 2 2/4 2/4 2 1/3 1/4 1 1/4 1/3 1 2/3 2/4 1 2/4 2/3 1 1/3 2/3 1 1/4 2/4 1 2/3 1/3 1 2/4 1/4 1 1/3 2/4 0 1/4 2/3 0 2/3 1/4 0 2/4 1/3 0 But now assume a dominant disease coming from the father and is on the haplotype with the 1 allele. The only viable options are marked in the table. The expected IBD is thus (2*2+2*1)/4 = 1.5, which can be detected in analysis. Not surprisingly, the expected number of IBD alleles is (4*2+8*1)/16=1. For a recessive disease linked on the haplotype of 1 and 3, the only viable pair is 1/3, 1/3 with expected IBD of 2.

14 Affected Sib-Pair Analysis 1/2 1/3 1/4 3/4 Standard practice of the ASP method where pedigrees look like the above (two parents, two children, all observed), can be done even by hand. However, one can use general pedigrees, and assume some family members are not observed, and consider more distant relatives such as first-cousins, etc.

15 Extended Affected Sib-Pair Analysis (e.g, the ESPA program) ?/? 1/3 1/4 3/4 Compute the probability of alleles of every family configuration given the other typed persons in the pedigree. Based on this probabilities compute: E[IBD] = 1  Pr(1 allele IBD) + 2  Pr(2 allele IBD) (The ESPA program currently assumes no loops and at most 5 alleles at a locus.)

16 Affected Pedigree Members method (APM) Computing IBD for distant relatives is considered hard on large pedigrees so researchers used IBS instead. Consider one relative to have alleles (A 1,A 2 ) and the other to have (B 1,B 2 ). There are four possibilities to have IBS alleles. Weeks and Lang (1988) used the following statistics z ij for counting IBS status of two individuals: This measure should be compared to what is expected under no linkage. To use many pedigrees, a converstion to standard normal variables is used.

17 Taking Gene Frequencies into Account Clearly it is more surprising for affected relatives to share a rare allele than a common one. So one can use a weighted average: where