Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.

Slides:



Advertisements
Similar presentations
Statistical methods for genetic association studies
Advertisements

Genetic Inheritance & Variation
applications of genome sequencing projects
EVOLUTION OF POPULATIONS
What is an association study? Define linkage disequilibrium
Review of main points from last week Medical costs escalating largely due to new technology This is an ethical/social problem with major conseq. Many new.
Alleles = A, a Genotypes = AA, Aa, aa
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Population Genetics and Natural Selection
Single Nucleotide Polymorphism Copy Number Variations and SNP Array Xiaole Shirley Liu and Jun Liu.
Evolution and Genetic Equilibrium
Objectives Cover some of the essential concepts for GWAS that have not yet been covered Hardy-Weinberg equilibrium Meta-analysis SNP Imputation Review.
Basics of Linkage Analysis
BMI 731- Winter 2005 Chapter1: SNP Analysis Catalin Barbacioru Department of Biomedical Informatics Ohio State University.
14 Molecular Evolution and Population Genetics
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
 Read Chapter 6 of text  We saw in chapter 5 that a cross between two individuals heterozygous for a dominant allele produces a 3:1 ratio of individuals.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Natural Selection Developed by Charles Darwin in 1859
Population Genetics Learning Objectives
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
The evolution of populations & Hardy-Weinberg Equilibrium
Broad-Sense Heritability Index
14 Population Genetics and Evolution. Population Genetics Population genetics involves the application of genetic principles to entire populations of.
The Evolution of Populations.  Emphasizes the extensive genetic variation within populations and recognizes the importance of quantitative characteristics.
Chapter 23 Notes The Evolution of Populations. Concept 23.1 Darwin and Mendel were contemporaries of the 19 th century - at the time both were unappreciated.
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
Genetic Linkage. Two pops may have the same allele frequencies but different chromosome frequencies.
CS177 Lecture 10 SNPs and Human Genetic Variation
Course outline HWE: What happens when Hardy- Weinberg assumptions are met Inheritance: Multiple alleles in a population; Transmission of alleles in a family.
Gene Hunting: Linkage and Association
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Quantitative Genetics
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
What is a SNP?. Lecture topics What is a SNP? What use are they? SNP discovery SNP genotyping Introduction to Linkage Disequilibrium.
Genes in human populations n Population genetics: focus on allele frequencies (the “gene pool” = all the gametes in a big pot!) n Hardy-Weinberg calculations.
ABC for the AEA Basic biological concepts for genetic epidemiology Martin Kennedy Department of Pathology Christchurch School of Medicine.
1 B-b B-B B-b b-b Lecture 2 - Segregation Analysis 1/15/04 Biomath 207B / Biostat 237 / HG 207B.
Allele Frequencies: Staying Constant Chapter 14. What is Allele Frequency? How frequent any allele is in a given population: –Within one race –Within.
Evolution of Populations. The Smallest Unit of Evolution Natural selection acts on individuals, but only populations evolve – Genetic variations contribute.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
Chapter 23 – Population Genetics The Evolution of Populations (Outline) The smallest unit of evolution It is the population, not the individual, that evolves.
The Hardy-Weinberg principle is like a Punnett square for populations, instead of individuals. A Punnett square can predict the probability of offspring's.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
1.Stream A and Stream B are located on two isolated islands with similar characteristics. How do these two stream beds differ? 2.Suppose a fish that varies.
Evolution of populations Ch 21. I. Background  Individuals do not adapt or evolve  Populations adapt and evolve  Microevolution = change in allele.
8 and 11 April, 2005 Chapter 17 Population Genetics Genes in natural populations.
Genome-Wides Association Studies (GWAS) Veryan Codd.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Lecture 6 Genetic drift & Mutation Sonja Kujala
Evolution and Population Genetics
Single Nucleotide Polymorphisms (SNPs
Common variation, GWAS & PLINK
Evolution of populations
HARDY-WEINBERG and GENETIC EQUILIBRIUM
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Recombination (Crossing Over)
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Lecture 4: Testing for Departures from Hardy-Weinberg Equilibrium
The Evolution of Populations
“TaqMan genotyping Assay’’
The Evolution of Populations
Linkage Analysis Problems
Presentation transcript:

Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006

2 Outline Definition and motivation SNP distribution and characteristics –Allele frequency, LD, population stratification SNP discovery (unknown) and genotyping (known)SNP discovery genotyping SNP association studies –Case control studies, and family based association studiesCase control studies, and family based association studies –Issues related to association studiesIssues related to association studies

3 Mode of inheritance

4 Polymorphism Polymorphism: sites/genes with “common” variation, less common allele frequency >= 1%, otherwise called rare variant and not polymorphic First discovered (early 1980): restriction fragment length polymorphism Some definitions: –Locus: position on chromosome where sequence or gene is located –Allele: alternative form of DNA on a locus

Fundamental rules of genetics Law of Segregation: a diploid parent is equally likely to pass along either of its two alleles P(pass copy 1) = P(pass copy 2) = ½ Law of Random Union gametes unite in a random fashion, so allele A1 is no more likely to unite with allele A1 than A2, for example P(offspring is A1A1) = P(father passes A1) × P(mother passes A1) P(offspring is A1A2) = P(father passes A1) × P(mother passes A2)+ P(mother passes A1) × P(father passes A2) 5 Slides from Karin S. Dorman

Hardy-Weinberg Equilibrium Consider a single locus where there are two alleles segregating in a diploid population. Make the Hardy- Weinberg (HW) assumptions: –No difference in genotype proportions between the sexes. –Synchronous reproduction at discrete points in time (discrete generations) –Infinite population size (so that small variabilities are erased in the average) –No mutation. –No migration –No selection –Random mating 6 Slides from Karin S. Dorman

Deriving HWE Let genotypes at generation t be P 11 (t), P 12 (t), and P 22 (t). Then, Genotype in the next generation will be And p 1 (t+1)=p 1 (t); p 2 (t+1)=p 2 (t) So in one step it returns to the equilibrium! 7 Slides from Karin S. Dorman

A simple example Consider this “population” 8 Slides from Karin S. Dorman

9

10 Slides from Karin S. Dorman

11 SNP Three classes of polymorphic markers: –Biallelic: SNPs and indels, less informative but more frequent & stable –Multiallelic: micro and mini satellites, more dynamic, high copy number loci have high mutation rate –Combination of above two Single Nucleotide Polymorphism –Occasionally short (1-3 bp) indels are considered SNPs too –Come from DNA-replication mistake individual germ line cell, then transmitted

What are Single Nucleotide Polymorphisms (SNPs)? ATGGTAAGCCTGAGCTGACTTAGCGT-AT ATGGTAAACCTGAGTTGACTTAGCGTCAT    SNP SNP indel SNPs result from replication errors and DNA damage They are a ‘polymorphic’ bit state at a nucleoside address

13 Why Should We Care Personalized Medicine –Aithal et al., 1999, Lancet –Warfarin anticoagulant drug –CYP2C9 gene metabolizes warfarin, CYP2C9*1 (wild type) has two allelic variants: CYP2C9*2 & CYP2C9*3 (both single AA change) –Patients with variant alleles are poor warfarin metabolisers, often at higher risk of bleeding Disease gene discovery –Association studies –Chromosome aberrations (copy number changes)

Disease resistant populationDisease susceptible population Resistant people all have an ‘A’ at position 4 in geneX, while susceptible people have a ‘T’ (A/T are the SNPs) Genotype all individuals for thousands of SNPs ATGATTATAG ATGTTTATAGgeneX

SNP Applications in Medicine Gene discovery and allele mapping Association-based (drug) candidate –polymorphism testing of a trait pool Diagnostics / risk profiling Drug response prediction Homogeneity testing / study design Gene function identification

Population Assignment– assessing competing hypotheses The likelihood ratio method Definition of competing hypotheses is essential 16Adapted from a slide of Steve DiFazio

17Adapted from a slide of Steve DiFazio

18

19

Hypothesis testing in statistics … Null hypothesis – assumed true unless there is an overwhelming evidence against it. –P-value – under the null hypothesis assess how “odd” a particular aspect of the data is – the probability of seeing values as extreme or more extreme than the one we saw. –Using the likelihood ratio to find an effective aspect of the data to tell the two hypotheses apart – a way to guide your choice 20

21 SNP Distribution Most common, > 1 SNP / 1KB –Balance between mutation introduction rate and polymorphism lost rate –Most mutations lost within a few generations Often more transitions (A/G, C/T) than transversions (A/T, A/C, G/T, G/C) In non-coding regions, often fewer SNPs at more conserved regions In coding regions, often more synonymous than non-synonymous SNPs

22 SNP Characteristics: Allele Frequency Distribution Most alleles are rare (minor allele frequency < 10%) Allele frequency in different genomes have a large variation –Human > 1 SNP / 600-1KB, –Fly and maize have an order of magnitude greater number of polymorphism (1 SNP / bp) Nucleotide diversity is positively correlated with recombination rate

International HapMap Project The International HapMap project is a recent, large-scale effort to facilitate GWAS studies: –Phase 1: 269 samples, 1.1 M SNPs –Phase 2: 270 samples, 3.9 M SNPs –Phase 3: 1115 samples, 1.6 M SNPs Phase 3 platforms: –Illumina Human1M (by Wellcome Trust Sanger Institute) –Affymetrix SNP 6.0 (by Broad Institute) 23