Genetic Heterogeneity Taken from: Advanced Topics in Linkage Analysis. Ch. 27 Presented by: Natalie Aizenberg Assaf Chen.

Slides:



Advertisements
Similar presentations
Statistical methods for genetic association studies
Advertisements

Chapter 10: The t Test For Two Independent Samples
Mapping genes with LOD score method
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Chi square.  Non-parametric test that’s useful when your sample violates the assumptions about normality required by other tests ◦ All other tests we’ve.
Gene Frequency and LINKAGE Gregory Kovriga & Alex Ratt.
Chapter 15 Comparing Two Populations: Dependent samples.
Linkage genes and genetic recombination
Basics of Linkage Analysis
. Parametric and Non-Parametric analysis of complex diseases Lecture #6 Based on: Chapter 25 & 26 in Terwilliger and Ott’s Handbook of Human Genetic Linkage.
MMLS-C By : Laurence Bisht References : The Power to Detect Linkage in Complex Diseases Means of Simple LOD-score Analyses. By David A.,Paula Abreu and.
2050 VLSB. Dad phase unknown A1 A2 0.5 (total # meioses) Odds = 1/2[(1-r) n r k ]+ 1/2[(1-r) n r k ]odds ratio What single r value best explains the data?
Office hours 3-4pm Wednesdays 304A Stanley Hall More realistic situation: in dad, phase of alleles unknown A1d d D A2d A1 A2 or A1d A2D.
Inferences About Process Quality
5-3 Inference on the Means of Two Populations, Variances Unknown
Nonparametric or Distribution-free Tests
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Chapter 8 Inferences Based on a Single Sample: Tests of Hypothesis.
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
1 Chapter 1: Introduction to Design of Experiments 1.1 Review of Basic Statistical Concepts (Optional) 1.2 Introduction to Experimental Design 1.3 Completely.
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Non-Mendelian Genetics
1 Chapter 1: Introduction to Design of Experiments 1.1 Review of Basic Statistical Concepts (Optional) 1.2 Introduction to Experimental Design 1.3 Completely.
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Quantitative Genetics
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Lecture 12: Linkage Analysis V Date: 10/03/02  Least squares  An EM algorithm  Simulated distribution  Marker coverage and density.
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
Lecture 15: Linkage Analysis VII
AP Statistics Section 11.1 B More on Significance Tests.
Copyright © Cengage Learning. All rights reserved. Chi-Square and F Distributions 10.
Population structure at QTL d A B C D E Q F G H a b c d e q f g h The population content at a quantitative trait locus (backcross, RIL, DH). Can be deduced.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
 Assumptions are an essential part of statistics and the process of building and testing models.  There are many different assumptions across the range.
Lecture 22: Quantitative Traits II
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
1 Genetic Mapping Establishing relative positions of genes along chromosomes using recombination frequencies Enables location of important disease genes.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Association Mapping in Families Gonçalo Abecasis University of Oxford.
Lecture 17: Model-Free Linkage Analysis Date: 10/17/02  IBD and IBS  IBD and linkage  Fully Informative Sib Pair Analysis  Sib Pair Analysis with Missing.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
 Check the Random, Large Sample Size and Independent conditions before performing a chi-square test  Use a chi-square test for homogeneity to determine.
Comparing Counts Chi Square Tests Independence.
Part 2: Genetics, monohybrid vs. Dihybrid crosses, Chi Square
Testing Hypotheses about a Population Proportion
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Test for Goodness of Fit
Categorical Data Aims Loglinear models Categorical data
John Loucks St. Edward’s University . SLIDES . BY.
Different Scales, Different Measures of Association
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Error Checking for Linkage Analyses
Topic 10.2 Inheritance.
Chapter 10 Analyzing the Association Between Categorical Variables
Genetic Mapping Linked Genes.
Inference for Relationships
Balanced Translocation detected by FISH
Chi-Squared AP Biology.
Multivariate Linkage Continued
Linkage Analysis Problems
10.2 Inheritance Skills: Calculation of the predicted genotypic and phenotypic ratio of offspring of dihybrid crosses involving unlinked autosomal genes.
10.2 Inheritance Skills: Calculation of the predicted genotypic and phenotypic ratio of offspring of dihybrid crosses involving unlinked autosomal genes.
10.2 Inheritance Skills: Calculation of the predicted genotypic and phenotypic ratio of offspring of dihybrid crosses involving unlinked autosomal genes.
Inference for Distributions of Categorical Data
Lecture 46 Section 14.5 Wed, Apr 13, 2005
Presentation transcript:

Genetic Heterogeneity Taken from: Advanced Topics in Linkage Analysis. Ch. 27 Presented by: Natalie Aizenberg Assaf Chen

Agenda Motivation General approach to genetic heterogeneity. –Allelic heterogeneity. –Non-allelic heterogeneity. Test for homogeneity given Linkage. Test for Linkage given heterogeneity. Using the HOMOG program. Exercise.

Motivation We will try to answer two important questions: 1.Given a positive Linkage test result, is there a significant evidence for a proportion of pedigrees segregating the linked gene and another proportion segregating a putative unlinked gene for the same disease ?

Motivation (Cont.) 2.Although not having a significant evidence for Linkage (assuming homogeneity of the disease gene), does allowing a certain percentage of pedigrees segregating an unlinked gene bring a significant evidence for linkage to a disease gene in a proportion of pedigrees?

General approach to genetic heterogeneity Number of genetic causes act differently to produce an identical disease phenotype. Allelic heterogeneity: Multiple separate disease alleles at the same locus can each cause the same disease phenotype. (CF) Non-allelic heterogeneity : Disease alleles at two or more independently acting loci could each cause the same disease phenotype.

General approach to genetic heterogeneity (Cont.) Heterogeneity is much easier to detect when there is a different mode of inheritance in some pedigrees from others. When 2 forms of the disease share a common mode of inheritance- the only way to determine that more than one genetic locus is involved is by using a special form of Linkage analysis that treats the heterogeneity as an additional parameter in the analysis.

Test for homogeneity given Linkage Given a significant test result (i.e. lod-score>3), we will check whether there is a significant evidence to support the hypothesis that some of the pedigrees might be segregating a different unlinked gene for the same disease, and not the gene that is linked to a marker in question in this analysis. We will use the A-test method.

A-test Assumptions: 1.Two categories of families in the data:  = 0.5  =  (1) <  =  (1) in Proportion  of families (segregating the linked gene), and  =0.5 in proportion (1-  ). 3.No a priori assignment of any family to one class or the other. We will check H(0): {  =1} versus H(1): {  <1}

A-test (Cont.) The likelihood for any given family: L( ,  ) =  L(  ) + (1-  )L(  = 0.5) The total likelihood for n families: Testing the null hypothesis (all families are of linked type) by forming a likelihood ratio L( ,  ) / L(  =1,  ) = Finding the Chi-square :

Test for Linkage given heterogeneity The null hypothesis of no linkage against the hypothesis of linkage and heterogeneity. The ratio test: L( ,  ) / L(  = 1,  = 0.5 ) = R There are two problems in interpretation: 1.Trying to declare a linkage significant with heterogeneity as a parameter. We need a significance level equivalent to lod-score=3 in straight linkage analysis. The lod score calculates: log10 [ R ]. There is an additional free parameter in the numerator. To allow this additional degree of freedom, we add log10(2) = 0.3 to the critical value -> lod-score=3.3.

Test for Linkage given heterogeneity (Cont.) 2.Under the hypothesis  = 0.5 in all families, the parameter  disappears: L( ,  ) =  L(  ) + (1-  )L(  = 0.5), with  = 0.5 : L( ,  =0.5 ) =  L(  =0.5) + (1-  )L(  = 0.5) = = L(  =0.5). Making  = 0, causes  to disappear as a parameter: L(  =0,  ) = 0L(  ) + (1- 0 )L(  = 0.5) = L(  =0.5). We have a completely degenerate situation under the null hypothesis, where : L(  =0,  ) = L( ,  =0.5) = L(  =0.5).

Test for Linkage given heterogeneity (Cont.) Thus, there is one parameter under H(0), whereas under H(1) there are two, leading to a problem with the asymptotic distribution  further major problems. In light of all this – we would prefer not to apply any asymptotic theory to this test statistics (  ≠0,  ≠ 0.5), and use a lod-score above 3.3 to determine significance of a test result (ratio above 2000:1).

Using HOMOG to perform test Phase1: run a linkage program on the pedigrees calculating likelihood Phase 2: run LODOS on the out put file to receive lod-scores: Z(  )=log 10 L(  )- log 10 L(  =0.5) for each value of  at each pedigree Phase 3:Run HOMOG on calculated lod scores for each  and for each family

HOMOG input file Line1: Title of the problem Line2: N STEPSIZE Where: N= number of values of  at which lod-scores are given STEPSIZE = the step size in which  should be incremented (0.05 by default) Note! Values should not be given for 0.5 where Z=0

HOMOG input file (Cont.) Line3: OUT ALOW LL Where: OUT = output options (0..3) ALOW =smallest value of  to be considered LL = line length in the output file (80 char by default)

HOMOG input file (Cont.) Line4:  1,  2,…,  N Where  I the values of  for which the lod scares are going to provided below Line5: NFAM Where NFAM is the number of families for which lod scores are provided

HOMOG input file (Cont.) Line6: Z(  1 ), Z(  2 ), …, Z(  N ) in family1 Line7: Z(  1 ), Z(  2 ), …, Z(  N ) in family2 … Line(5+NFAM): Z(  1 ), Z(  2 ), …, Z(  N ) in family NFAM

An input example The pedigrees:

Input example (Cont.) Notice: the best  estimate for the entire family set together is 0.5 while fam1&3 appear to have no recombinants q= family family family family Resulting lod-scores

The Input file Ex

The output file Alpha| ln L(alpha,theta) Theta

The output file (Cont.) Table of conditional max. Ln(L) over alpha's, given theta or x theta or x alpha max Max.Ln(L) Lik. ratio x10^ This table shows the maximum natural log likelihood for each value of  over  normalized so that L(  =0.5)=0

The output file (Cont.) Estimates of Hypotheses Max.lnL Alpha Theta H2: Linkage, heterogeneity linkage & heterogeneity H1: Linkage, homogeneity (1) 99 linkage & homogeneity H0: No linkage (0) (0) (99) neither linkage nor heterogeneity Components of chi-square Sourcedf Chi-square L ratio H2 vs. H1 Heterogeneity Heterogeneity given linkage H1 vs. H0 Linkage linkage assuming homogeneity H2 vs. H0 Total linkage & heterogeneity Note:  =99 means  =0.5

The output file (Cont.) Since there is absolutely no evidence for linkage (H 1 ), the test of H 2 versus H 1 is meaningless (null hypothesis is not valid). Instead, we must consider the comparison of H 2 versus H 0 (the correct null hypothesis of no linkage)

Conclusion from HOMOG output The likelihood ratio is 16,520 for H 2 vr. H 0 (heterogeneity) which exceeds the minimum ratio (2000) for significant evidence for linkage and heterogeneity.  although there is no evidence for linkage whatsoever under the assumption of locus homogeneity, as soon as we allow for heterogeneity, we have significant evidence for linkage (in at least some of the families