February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.

Slides:



Advertisements
Similar presentations
Evolution of genomes.
Advertisements

Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Identification of markers linked to Selenium tolerance genes
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Genetics notes For makeup. A gene is a piece of DNA that directs a cell to make a certain protein. –Homozygous describes two alleles that are the same.
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
Fatchiyah, PhD Dept Biology UB Fatchiyah.lecture.ub.ac.id
MALD Mapping by Admixture Linkage Disequilibrium.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Genomics An introduction. Aims of genomics I Establishing integrated databases – being far from merely a storage Linking genomic and expressed gene sequences.
The role of variation in finding functional genetic elements Andy Clark – Cornell Dave Begun – UC Davis.
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Population Genetics 101 CSE280Vineet Bafna. Personalized genomics April’08Bafna.
Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.
Genetic Variations Lakshmi K Matukumalli. Human – Mouse Comparison.
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Conservation of genomic segments (haplotypes): The “HapMap” n In populations, it appears the the linear order of alleles (“haplotype”) is conserved in.
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
SNPs and the Human Genome Prof. Sorin Istrail. A SNP is a position in a genome at which two or more different bases occur in the population, each with.
Gene Hunting: Linkage and Association
Announcements: Proposal resubmission deadline 4/23 (Thursday).
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Great Dane x Mexican Chihuahua F1 Big (Great Danes) 3 Big : 1 Small.
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
INTRODUCTION TO ASSOCIATION MAPPING
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
SNPs, Haplotypes, Disease Associations Algorithmic Foundations of Computational Biology II Course 1 Prof. Sorin Istrail.
The International Consortium. The International HapMap Project.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Genomics of Adaptation
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
GENOME ORGANIZATION AS REVEALED BY GENOME MAPPING WHY MAP GENOMES? HOW TO MAP GENOMES?
Single Nucleotide Polymorphisms (SNPs
Common variation, GWAS & PLINK
Of Sea Urchins, Birds and Men
School of Pharmacy, University of Nizwa
Gene – Expression – Mutation - polymorphism
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Gene – Expression – Mutation - polymorphism
Patterns of Linkage Disequilibrium in the Human Genome
Genome-wide Association Studies
Sexual reproduction creates unique combinations of genes.
Sequential Steps in Genome Mapping
Ho Kim School of Public Health Seoul National University
Mutation Notes.
Objective: Explain the main types of mutations
Monohybrid cross - shows inheritance of one trait from two parents
Section 20.4 Mutations and Genetic Variation
Presentation transcript:

February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles

Sequence differences Intra-specific differences: between individuals within species Inter-specific differences: between orthologous genes in different species

Intra-specific and inter-specific variation Mutations radiation chemicals replication errors transposable elements somatic vs. germinal Mutation frequency maize: 6.5 x mutations per nucleotide per year (Gaut et al 1996, PNAS 93, )

Frequency of SNPs Intra-specific diversity Humans: 1 in 1000 nt = 3,000,000 ea Maize: 1 in 60~120 nt = 35,000,000 ea Soybean: 1 in 350 nt =3,000,000 ea Melon: 1 in 700 nt = 1,400,000 Inter-specific sequence difference - dependent on evolutionary distance Humans - chimpanzees 1 in 100 nt

Types of sequence variation Single Nucleotide Polymorphisms (SNPs) Insertions / Deletions (Indels) Silent mutations vs. amino acid changing mutations Nonsense mutations Missense mutations Frameshifts Simple sequence repeats

SNPs and Indels DNA Sequence...GATATTCGTACGGATGT-TCCA......GATGTTCGTACTGATGTATCCA......GATATTCGTACGGATGT-TCCA......GATATTCGTACGGATGTATCCA......GATGTTCGTACTGATGTATCCA... SNP Individuals Indel

Haploypes DNA Sequence A G - G T A A G - A G A G T A SNP Individuals Indel AG- GTA AG- AGA GTA Haplotypes

1.AG- 2.GTA 3.AG- 4.AGA 5.GTA 6.GTA Haplotypes provide more information than individual SNPs Haplotype list

Haplotype A set of closely linked genetic markers present on one chromosome which tend to be inherited together (not easily separable by recombination). Some haplotypes may be in linkage disequilibrium (from Birgid Schlindwein's Hypermedia Glossary Of Genetic Terms) GGA C A Set of SNP polymorphisms: a SNP haplotype

Diploids vs Haploids Haploid cell Diploid cell Chr1 Chr2 Chr1 Chr2

Homo vs. Hetero Chr1 Chr2 Chr1 Chr2 Homozygous Heterozygous

Haploid > Diploid Haploid cell Chr1 Chr2 Haploid cell Chr1 Chr2 Chr1 Chr2 Diploid Heterozygous Example: sex cells

Problem of Phase Chr1 GA T C SNP1 SNP2 Observed: SNP1 G / T SNP2 A/C Possible Haplotypes: GA, TC or GC, TA

How to resolve the problem of phase ? GA T C SNP1 SNP2 T C SNP1 SNP2 GA Experimental solution

Computational solution GA T C GA T C A T GA T C A T GA T C A T GA GA T C A T GA Not all combinations occur. Need to observe several haplotypes in various combinations

Linkage Disequilibrium: “Non-random association of alleles ” Equilibrium Disequilibrium Locus 1 Locus 2 D’=0 6 6 Locus 2  D’  =1

Linkage Disequilibrium 1 2 Disequilibrium Single marker is enough to completely define haplotypes in this example. Second marker provides redundant information. In a general case, a subset of the markers will be sufficient to define major haplotypes

Linkage Disequilibrium 1 2 Disequilibrium For example, marker in the 3’-UTR will be completely predicted by the marker at the 5’-end of the gene and vice versa, if LD extends across the gene.

Redundant A-T AIC AIT C-T Uniquely defined haplotypes: Stearoyl-ACP desaturase

Linkage disequilibrium The extent of linkage disequilibrium (LD) in the germplasm is important for association mapping The LD in the population depends on population history LD is also expected to vary along the length of the genome: regions that recombine less will have more LD and vice versa.

Recombination of ancestral haplotypes Parents Progeny Conserved ancestral haplotypes are reduced in size by recombination Size of conserved segments depends on the history of the population and on recombination frequency of the genome segment of interest time High LD Low LD

DNA Sequence Sequence diversity GATATTCGTACGGAT GATGTTCGTACTGAT GATATTCGTACGGAT GATGTTCGTACTGAT SNP Individuals Genetic Map Phenotype Resistant Sensitive Resistant Sensitive

DNA Sequence SNP Haplotypes GATATTCGTACGGAT GATGTTCGTACTGAT GATATTCGTACGGAT GATGTTCGTACTGAT SNP Individuals with contrasting phenotypes Genetic Map Candidate gene Phenotype distribution

Allelic series and phenotypes (Hypothetical example) Haplotype information provides better resolution! 102

Haplotype diversity in maize elite germplasm Ada Ching, Dinakar Bhattramakki, Antoni Rafalski

Genetic diversity in maize breeding germplasm This sample of 32 individuals provides an excellent representation of maize elite germplasm Conserved haplotypes over several hundered bp Small number of haplotypes (2-8) Lots of polymorphisms SNP frequency: 1/61 bp Insertion / deletion frequency: 1/126 bp Stearoyl-ACP desaturase 32 inbreds - 4 haplotypes

Two approaches to association mapping Candidate gene approach Testing candidates for association with the trait of interest Whole genome scan approach Testing thousands of markers distributed along the genome for association with trait Suitable only for large linkage disequilibrium situations

Consequences of LD LD

Extent of linkage disequilibrium in maize elite breeding germplasm

18 genes individuals

No decline of LD within 500 bp was detected

LD at Adh1: ~120kb D’ = 1.0 R 2 = 0.85 Fisher P< ,171 bp Mark Jung, Ada Ching

Conclusions Significant LD extending in some cases to >100 kb Whole genome scans may be possible More data is needed on LD in other regions of the genome and in other populations Haplotype association analysis adds power Work in progress (D. Bhattramakki)

Haplotype Organization of the Genome ACACC G ATAT TAC G C A G A T G (After Lander et al. 2001) Low recombination Conserved haplotype Low recombination Conserved haplotype High recombination

A.R