GGAW - Oct, 2001M-W LIN Study Design for Linkage, Association and TDT Studies 林明薇 Ming-Wei Lin, PhD 陽明大學醫學系家庭醫學科 台北榮民總醫院教學研究部.

Slides:



Advertisements
Similar presentations
Chapter 14~ Mendel & The Gene Idea
Advertisements

Linkage and Genetic Mapping
Genetic Heterogeneity Taken from: Advanced Topics in Linkage Analysis. Ch. 27 Presented by: Natalie Aizenberg Assaf Chen.
Tutorial #1 by Ma’ayan Fishelson
Tutorial #2 by Ma’ayan Fishelson. Crossing Over Sometimes in meiosis, homologous chromosomes exchange parts in a process called crossing-over. New combinations.
Linkage and Gene Mapping. Mendel’s Laws: Chromosomes Locus = physical location of a gene on a chromosome Homologous pairs of chromosomes often contain.
Basics of Linkage Analysis
. Parametric and Non-Parametric analysis of complex diseases Lecture #6 Based on: Chapter 25 & 26 in Terwilliger and Ott’s Handbook of Human Genetic Linkage.
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Eric Jorgenson Epidemiology 217 2/21/12
Human Gene Mapping & Disease Gene Identification Cont.
Genetic Analysis.
Human Genetics Genetic Epidemiology.
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
Finding “the gene” for cystic fibrosis. Why is this in quotes? A.CF is not caused by a gene, it’s caused by multiple genes. B.CF is not caused by genetic.
How to find genetic determinants of naturally varying traits?
Parametric and Non-Parametric analysis of complex diseases Lecture #8
2050 VLSB. Dad phase unknown A1 A2 0.5 (total # meioses) Odds = 1/2[(1-r) n r k ]+ 1/2[(1-r) n r k ]odds ratio What single r value best explains the data?
Office hours 3-4pm Wednesdays 304A Stanley Hall More realistic situation: in dad, phase of alleles unknown A1d d D A2d A1 A2 or A1d A2D.
Thoughts about the TDT. Contribution of TDT: Finding Genes for 3 Complex Diseases PPAR-gamma in Type 2 diabetes Altshuler et al. Nat Genet 26:76-80, 2000.
Observing Patterns in Inherited Traits
Linkage and LOD score Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Standardization of Pedigree Collection. Genetics of Alzheimer’s Disease Alzheimer’s Disease Gene 1 Gene 2 Environmental Factor 1 Environmental Factor.
Searching Microsatellite Markers for Mapping a Disease Gene
Chapter 9 – Patterns of Inheritance
Process of Genetic Epidemiology Migrant Studies Familial AggregationSegregation Association StudiesLinkage Analysis Fine Mapping Cloning Defining the Phenotype.
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
1 Father of genetics. Studied traits in pea plants.
A gene is composed of strings of bases (A,G, C, T) held together by a sugar phosphate backbone. Reminder - nucleotides are the building blocks.
Mendel and Genetics Terms and Protocols Mendel’s Experiments Probability Modern Additions & Modifications Mendelian Genetics and Humans.
Non-Mendelian Genetics
1 Genes and MS in Tasmania, cont. Lecture 5, Statistics 246 February 3, 2004.
Introduction to Linkage Analysis Pak Sham Twin Workshop 2003.
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
 Linked Genes Learning Objective DOT Point: predict the difference in inheritance patterns if two genes are linked Sunday, June 05,
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Quantitative Genetics
INTRODUCTION TO ASSOCIATION MAPPING
Recombination and Linkage
Lecture 13: Linkage Analysis VI Date: 10/08/02  Complex models  Pedigrees  Elston-Stewart Algorithm  Lander-Green Algorithm.
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
1 B-b B-B B-b b-b Lecture 2 - Segregation Analysis 1/15/04 Biomath 207B / Biostat 237 / HG 207B.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
GGAW - Oct, 2001 M-W LIN Searching Microsatellite Markers for Mapping a Disease Gene 林明薇 Ming-Wei Lin, PhD 陽明大學醫學系家庭醫學科 台北榮民總醫院教學研究部.
An quick overview of human genetic linkage analysis
Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.
Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.
Chapter 14: Mendel & The Gene Idea Quantitative approach to science Pea plants Austrian Monk.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
An quick overview of human genetic linkage analysis Terry Speed Genetics & Bioinformatics, WEHI Statistics, UCB NWO/IOP Genomics Winterschool Mathematics.
Chapter 3 Lecture Concepts of Genetics Tenth Edition Mendelian Genetics.
1 Genetic Mapping Establishing relative positions of genes along chromosomes using recombination frequencies Enables location of important disease genes.
Genetics Review 23 How many pairs of chromosomes do humans have?
1 A Tale of Two Families Modes of inheritance are the patterns in which single-gene traits and disorders occur in families Huntington disease is autosomal.
Association Mapping in Families Gonçalo Abecasis University of Oxford.
Lecture 17: Model-Free Linkage Analysis Date: 10/17/02  IBD and IBS  IBD and linkage  Fully Informative Sib Pair Analysis  Sib Pair Analysis with Missing.
Concept 14.2: The laws of probability govern Mendelian inheritance
Migrant Studies Migrant Studies: vary environment, keep genetics constant: Evaluate incidence of disorder among ethnically-similar individuals living.
Recombination (Crossing Over)
Genes may be linked or unlinked and are inherited accordingly.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Topic 10.2 Inheritance.
Balanced Translocation detected by FISH
Lecture 9: QTL Mapping II: Outbred Populations
Linkage Analysis Problems
Introduction to Genetics
10.2 Inheritance Skills: Calculation of the predicted genotypic and phenotypic ratio of offspring of dihybrid crosses involving unlinked autosomal genes.
10.2 Inheritance Skills: Calculation of the predicted genotypic and phenotypic ratio of offspring of dihybrid crosses involving unlinked autosomal genes.
Presentation transcript:

GGAW - Oct, 2001M-W LIN Study Design for Linkage, Association and TDT Studies 林明薇 Ming-Wei Lin, PhD 陽明大學醫學系家庭醫學科 台北榮民總醫院教學研究部

GGAW - Oct, 2001M-W LIN Collins FS. (1992) Nature genetics 1:3-6

GGAW - Oct, 2001M-W LIN Collins FS. (1992) Nature genetics 1:3-6

GGAW - Oct, 2001M-W LIN Linkage Mapping for Disease Genes Linkage analysis (Lod score method) Allele-sharing methods

GGAW - Oct, 2001M-W LIN Gregor Mendel The principle of segregation of alleles. The principle of independent assortment.

GGAW - Oct, 2001M-W LIN Linkage Linkage describes the phenomenon whereby allele at neighbouring loci are close to one another on the same chromosome, they will be transmitted together more frequently than chance.

GGAW - Oct, 2001M-W LIN Linkage Family

GGAW - Oct, 2001M-W LIN Linkage Analysis Family

GGAW - Oct, 2001M-W LIN Recombinant Gametes Crossing over between two neighbouring loci will produce recombinant gametes.

GGAW - Oct, 2001M-W LIN Recombination Fraction Recombination fraction (θ) = number of recombinant gametes total gametes

GGAW - Oct, 2001M-W LIN Estimation of Recombination Fraction Direct Method: count recombinants. Maximum Likelihood Method: Unknown phases Incomplete penetrance Heterogeneity

GGAW - Oct, 2001M-W LIN

GGAW - Oct, 2001M-W LIN

GGAW - Oct, 2001M-W LIN Recombination Fraction Recombination fraction is a measure of genetic distance. 1cM= 1% chance of recombination between two loci.

GGAW - Oct, 2001M-W LIN Likelihood Odds Likelihood of data if loci linked at θ Likelihood odds = Likelihood of data if loci unlinked L(θ< 0.5) = L(θ= 0.5)

GGAW - Oct, 2001M-W LIN Lod Score L(θ< 0.5) Lod score (θ) = log 10 L(θ = 0.5)

GGAW - Oct, 2001M-W LIN Linkage Analysis Methods Direct counting recombinants and non-recombinants Maximum Likelihood Estimate

GGAW - Oct, 2001M-W LIN Phase Known Family

GGAW - Oct, 2001M-W LIN Phase Known L(θ) = (θ/2) r ((1-θ)/2) n-r r:No. of recombinants n:All meiosis

GGAW - Oct, 2001M-W LIN Lod Score Phase Known L(θ) LOD = log L(θ= 0.5) (θ/2) r [(1-θ) / 2] n-r = log { } (0.25) n = log 2 n θ r (1-θ) n-r

GGAW - Oct, 2001M-W LIN Phase Unknown Family

GGAW - Oct, 2001M-W LIN Phase Unknown L(θ) = 1/2 (θ/2) r [(1-θ)/2] n-r +1/2 (θ/2) n-r [(1-θ)/2] r r:No. of recombinants n:All meiosis

GGAW - Oct, 2001M-W LIN Lod Score Phase Unknown L(θ) LOD = log L(θ= 0.5) 1/2 [(θ/2) r [(1-θ)/2] n-r +(θ/2) n-r [(1-θ)/2] r ] =log { } (0.25) n = log {2 n-1 [θ r (1-θ) n-r +θ n-r (1-θ) r ]}

GGAW - Oct, 2001M-W LIN Lod Score - Maximum Likelihood Estimate (Z) Can be calculated at any values of  between 0 and 0.5, but are conventionally reported at  =0, 0.01, 0.05, 0.1, 0.2, 0.3, and 0.4. Z max is the maximum likelihood estimate (MLE) of . Lod score can be converted to a chi- square statistic by 2(loge10)  4.6.

GGAW - Oct, 2001M-W LIN Total Lod Score Lod score obtained from individual families can be added together to calculate the total lod score.

GGAW - Oct, 2001M-W LIN Statistical Significance of the Lod Score lod score > 3: evidence of linkage 2 < lod score < 3: suggestive evidence of linkage -2 < lod score < 2: uninformative of linkage lod score < -2: exclusion of linkage

GGAW - Oct, 2001M-W LIN Is a Pedigree Useful for linkage Analysis? Are critical individuals in the pedigrees doubly heterozygous at the loci? (Informative) Can the offsprings be scored as recombinants or nonrecombinants? (Phase)

GGAW - Oct, 2001M-W LIN Parameters Assumed in Lod Score Analysis Transmission mode of disease Recombination fraction Trait allele frequencies Penetrance values for each possible disease phenotypes Marker allele frequencies.

GGAW - Oct, 2001M-W LIN Advantages of Lod Score Analysis Statistically, it is more powerful approach than any nonparametric method. Utilizes every family member’s phenotypic and genotypic information. Provides an estimate of the recombination fraction. Provides a statistical test for linkage and for genetic (locus) heterogeneity.

GGAW - Oct, 2001M-W LIN Limitations of Lod Score Method assumes single locus inheritance requires specification of disease gene frequency and penetrance has reduced power when disease model is grossly misspecified

GGAW - Oct, 2001M-W LIN Complex Diseases No clear pattern of Mendelian inheritance A mix of genetic and environmental factors Incomplete penetrance Phenocopies Oligogenic or polygenic Heterogeneity High frequency of disease-causing allele

GGAW - Oct, 2001M-W LIN Recurrence Risk (λ) Frequency in relatives of affected person λ r = Population frequency r denotes the degree of relationship

GGAW - Oct, 2001M-W LIN Recurrence Risk Genetic mapping is much easier for traits with high λ s (λ s > 10) than for those with low λ s (λ s < 2).

GGAW - Oct, 2001M-W LIN Recurrence Risk of Different Diseases

GGAW - Oct, 2001M-W LIN Allele-sharing Methods Identical by state (I.B.S.) Two alleles of the same form. Identical by descent (I.B.D.) Two alleles are descended from the same ancestral allele.

GGAW - Oct, 2001M-W LIN Allele-sharing Methods Testing whether affected relatives inherited a region IBD (or IBS) more often than expected under random Mendelian segregation.

GGAW - Oct, 2001M-W LIN IBD = 2IBD = 1 IBD = 0 ACAB BC ACBC ACAB CD ADBC

GGAW - Oct, 2001M-W LIN IBS = 2IBS = 1 IBS = 0 BC AC ABADBC

GGAW - Oct, 2001M-W LIN Affected Sib-pair Methods An affected sib-pair may share 0,1, 2 alleles identical by descent (IBD) with probabilities of 0.25, 0.5, 0.25, respectively, at any marker locus.

GGAW - Oct, 2001M-W LIN IBD = 2 ACAB BC AB AC BCAA IBD = 1 IBD = 0 25% 50% 25%

GGAW - Oct, 2001M-W LIN Affected Sib-pair Methods If the marker locus is independent of the trait locus, the probabilities of the affected sib-pairs share 0,1, 2 alleles ibd will remain as 0.25, 0.50, 0.25.

GGAW - Oct, 2001M-W LIN Affected Sib-pair Methods If the marker locus is linked to the trait locus, an excess of affected sib-pair sharing two alleles ibd will be expected.

GGAW - Oct, 2001M-W LIN Allele-sharing Methods Affected Sib-pairs Affected Pedigree Member

GGAW - Oct, 2001M-W LIN Pearson  2 statistics Comparing observed numbers of sib-pairs sharing 0, 1, 2 alleles IBD with their expectations under the null hypothesis.

GGAW - Oct, 2001M-W LIN Pearson  2 statistics Alternative hypothesis: IBD sharing012 observedn 0 n 1 n 2 N = n 0 + n 1 + n 2 Null hypothesis: IBD sharing:012 expected N/4N/2 N/4

GGAW - Oct, 2001M-W LIN Comments on Allele-Sharing Method  There is no need to specify any genetic parameters of the transmission model.  Less powerful to detect linkage compared with the lod score method if the genetic transmission model can be specified correctly.  It is poor at providing a precise location of the disease gene.

GGAW - Oct, 2001M-W LIN Thresholds for Mapping Complex Traits

GGAW - Oct, 2001M-W LIN Association Study Case-Control study Transmission disequilibrium test (TDT)

GGAW - Oct, 2001M-W LIN ○□ ○□ □ ○ ○ □ ○ □ ADAD ACAC BCACAC AB BCCDA ADAD ACAC ■●■ ●●■ ● ■ ■ ● DDACAC BDCDCD CDCD BCABADAD BDAD Case-Control study

GGAW - Oct, 2001M-W LIN Linkage Disequilibrium Linkage disequilibrium is the non-random association in a population of alleles at closely linked loci.

GGAW - Oct, 2001M-W LIN Linkage Disequilibrium A2---B1-----C2---X----D3-----E4----F2  A2---B1-----C2---X----D3-----E4  A2---B1-----C2---X----D3  B1-----C2---X----D3  C2---X----D3  C2---X N generations

GGAW - Oct, 2001M-W LIN TDT Study To examine the transmission of a particular allele at a locus from heterozygous parents to their affected offspring.

GGAW - Oct, 2001M-W LIN □○ ● □○ ■ □○ ■ □○ ● BCABBCBB ABAC ACAC BCACAC BBBC AB “Trios” for TDT study “transmitted allele“  “case” “Non-transmitted allele”  “control”

GGAW - Oct, 2001M-W LIN What does a positive association imply? Direct causal effect Linkage disequilibrium Population stratification

GGAW - Oct, 2001M-W LIN When to Use Association Study Candidate gene Positive evidence of linkage Candidate region allelic associations

GGAW - Oct, 2001M-W LIN Suitable Sample for Linkage Disequilibrium Mapping Genetically isolated populations Younger populations

GGAW - Oct, 2001M-W LIN Successful Examples of Mapping Genes by Association Studies Autoimmune diseases associated with HLA  IDDM  multiple sclerosis  ankylosing spondylitis  rheumatoid arthritis Angiotensin-converting enzyme and heart disease low-density lipoprotein receptor and heart disease insulin locus and IDDM

GGAW - Oct, 2001M-W LIN Sample Size Required Linkage for Monogenic Traits One large family at least 40 informative meioses 20 cM marker density Expected lod score > 3

GGAW - Oct, 2001M-W LIN Sample Size Required Allele-Sharing λ s = 2 at least 600 affected sib pairs narrow down the region to 1 cM

GGAW - Oct, 2001M-W LIN Sample Size Required Linkage for Complex Traits Sham, Lin et al (2000) Am J Human Genetics 66,

GGAW - Oct, 2001M-W LIN Genetic Markers A complete informative marker locus at 0 recombination fraction to the disease locus.

GGAW - Oct, 2001M-W LIN Genetic Models Kp: population risk, q: disease allele frequency f 0 : penetrance for the genotype AA; f 1 : penetrance for the genotype Aa f 2 : penetrance for the genotype aa

GGAW - Oct, 2001M-W LIN Pedigree Types

GGAW - Oct, 2001M-W LIN Number of Pedigrees Required  = , Power = 90%, Homogeneity

GGAW - Oct, 2001M-W LIN Number of Pedigrees Required  = , Power = 90%, Heterogeneity (  = 0.5)

GGAW - Oct, 2001M-W LIN Sample Size Required Case-Control Study (  = 0.05, Power = 90%)

GGAW - Oct, 2001M-W LIN Sample Size Required Case-Control Study (  = 0.05, Power = 90%)

GGAW - Oct, 2001M-W LIN Sample Size Required TDT Study (  = 0.001, Power = 80%)

GGAW - Oct, 2001M-W LIN Define phenotype Identify evidence of genetic component Extended families Define study design Sib pairsSingle affected member Family, clinical information and DNA collection Genotyping Data analysis Identify regions of interest Physical Mapping / Gene Identification

GGAW - Oct, 2001M-W LIN Successful Examples Cystic fibrosis Huntington disease Early onset breast cancer (BRCA1, BRCA2) Alzheimer disease (chr14, chr1) Maturity-onset diabetes of the young (MODY) (chr12)...