Presentation is loading. Please wait.

Presentation is loading. Please wait.

February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.

Similar presentations


Presentation on theme: "February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles."— Presentation transcript:

1 February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles

2 Sequence differences Intra-specific differences: between individuals within species Inter-specific differences: between orthologous genes in different species

3 Intra-specific and inter-specific variation Mutations radiation chemicals replication errors transposable elements somatic vs. germinal Mutation frequency maize: 6.5 x 10- 9 mutations per nucleotide per year (Gaut et al 1996, PNAS 93, 1997-2001)

4 Frequency of SNPs Intra-specific diversity Humans: 1 in 1000 nt = 3,000,000 ea Maize: 1 in 60~120 nt = 35,000,000 ea Soybean: 1 in 350 nt =3,000,000 ea Melon: 1 in 700 nt = 1,400,000 Inter-specific sequence difference - dependent on evolutionary distance Humans - chimpanzees 1 in 100 nt

5 Types of sequence variation Single Nucleotide Polymorphisms (SNPs) Insertions / Deletions (Indels) Silent mutations vs. amino acid changing mutations Nonsense mutations Missense mutations Frameshifts Simple sequence repeats

6 SNPs and Indels DNA Sequence...GATATTCGTACGGATGT-TCCA......GATGTTCGTACTGATGTATCCA......GATATTCGTACGGATGT-TCCA......GATATTCGTACGGATGTATCCA......GATGTTCGTACTGATGTATCCA... SNP Individuals 1 2 3 4 5 6 Indel

7 Haploypes DNA Sequence A G - G T A A G - A G A G T A SNP Individuals 1 2 3 4 5 6 Indel AG- GTA AG- AGA GTA Haplotypes

8 1.AG- 2.GTA 3.AG- 4.AGA 5.GTA 6.GTA Haplotypes provide more information than individual SNPs Haplotype list

9 Haplotype A set of closely linked genetic markers present on one chromosome which tend to be inherited together (not easily separable by recombination). Some haplotypes may be in linkage disequilibrium (from Birgid Schlindwein's Hypermedia Glossary Of Genetic Terms) GGA C A Set of SNP polymorphisms: a SNP haplotype

10 Diploids vs Haploids Haploid cell Diploid cell Chr1 Chr2 Chr1 Chr2

11 Homo vs. Hetero Chr1 Chr2 Chr1 Chr2 Homozygous Heterozygous

12 Haploid > Diploid Haploid cell Chr1 Chr2 Haploid cell Chr1 Chr2 Chr1 Chr2 Diploid Heterozygous Example: sex cells

13 Problem of Phase Chr1 GA T C SNP1 SNP2 Observed: SNP1 G / T SNP2 A/C Possible Haplotypes: GA, TC or GC, TA

14 How to resolve the problem of phase ? GA T C SNP1 SNP2 T C SNP1 SNP2 GA Experimental solution

15 Computational solution GA T C GA T C A T GA T C A T GA T C A T GA GA T C A T GA Not all combinations occur. Need to observe several haplotypes in various combinations

16 Linkage Disequilibrium: “Non-random association of alleles ” Equilibrium 12 12 Disequilibrium 33 33 Locus 1 Locus 2 D’=0 6 6 Locus 2  D’  =1

17 Linkage Disequilibrium 1 2 Disequilibrium Single marker is enough to completely define haplotypes in this example. Second marker provides redundant information. In a general case, a subset of the markers will be sufficient to define major haplotypes

18 Linkage Disequilibrium 1 2 Disequilibrium For example, marker in the 3’-UTR will be completely predicted by the marker at the 5’-end of the gene and vice versa, if LD extends across the gene.

19 Redundant A-T AIC AIT C-T Uniquely defined haplotypes: Stearoyl-ACP desaturase

20 Linkage disequilibrium The extent of linkage disequilibrium (LD) in the germplasm is important for association mapping The LD in the population depends on population history LD is also expected to vary along the length of the genome: regions that recombine less will have more LD and vice versa.

21 Recombination of ancestral haplotypes Parents Progeny Conserved ancestral haplotypes are reduced in size by recombination Size of conserved segments depends on the history of the population and on recombination frequency of the genome segment of interest time High LD Low LD

22 DNA Sequence Sequence diversity GATATTCGTACGGAT GATGTTCGTACTGAT GATATTCGTACGGAT GATGTTCGTACTGAT SNP Individuals 1 2 3 4 5 6 Genetic Map Phenotype Resistant Sensitive Resistant Sensitive

23 DNA Sequence SNP Haplotypes GATATTCGTACGGAT GATGTTCGTACTGAT GATATTCGTACGGAT GATGTTCGTACTGAT SNP Individuals with contrasting phenotypes 1 2 3 4 5 6 Genetic Map Candidate gene Phenotype distribution

24 Allelic series and phenotypes (Hypothetical example) Haplotype information provides better resolution! 102

25 Haplotype diversity in maize elite germplasm Ada Ching, Dinakar Bhattramakki, Antoni Rafalski

26 1 2 3 4 Genetic diversity in maize breeding germplasm This sample of 32 individuals provides an excellent representation of maize elite germplasm Conserved haplotypes over several hundered bp Small number of haplotypes (2-8) Lots of polymorphisms SNP frequency: 1/61 bp Insertion / deletion frequency: 1/126 bp Stearoyl-ACP desaturase 32 inbreds - 4 haplotypes

27 Two approaches to association mapping Candidate gene approach Testing candidates for association with the trait of interest Whole genome scan approach Testing thousands of markers distributed along the genome for association with trait Suitable only for large linkage disequilibrium situations

28 Consequences of LD LD

29 Extent of linkage disequilibrium in maize elite breeding germplasm

30 18 genes 32-35 individuals

31 No decline of LD within 500 bp was detected

32 LD at Adh1: ~120kb D’ = 1.0 R 2 = 0.85 Fisher P<0.001 121,171 bp Mark Jung, Ada Ching

33 Conclusions Significant LD extending in some cases to >100 kb Whole genome scans may be possible More data is needed on LD in other regions of the genome and in other populations Haplotype association analysis adds power Work in progress (D. Bhattramakki)

34 Haplotype Organization of the Genome ACACC G ATAT TAC G C A G A T G (After Lander et al. 2001) Low recombination Conserved haplotype Low recombination Conserved haplotype High recombination

35 A.R. 1-25-2001


Download ppt "February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles."

Similar presentations


Ads by Google