Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.

Slides:



Advertisements
Similar presentations
Association Studies, Haplotype Blocks and Tagging SNPs Prof. Sorin Istrail.
Advertisements

Julia Krushkal 4/11/2017 The International HapMap Project: A Rich Resource of Genetic Information Julia Krushkal Lecture in Bioinformatics 04/15/2010.
Are you ready for the genomic age? An introduction to human genomics Jacques Fellay EPFL School of Life Sciences Swiss Institute of Bioinformatics Lausanne,
Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood.
Objectives Cover some of the essential concepts for GWAS that have not yet been covered Hardy-Weinberg equilibrium Meta-analysis SNP Imputation Review.
Linkage Disequilibrium
Efficient Algorithms for Genome-wide TagSNP Selection across Populations via the Linkage Disequilibrium Criterion Authors: Lan Liu, Yonghui Wu, Stefano.
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
MALD Mapping by Admixture Linkage Disequilibrium.
Signatures of Selection
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
A coalescent computational platform for tagging marker selection for clinical studies Gabor T. Marth Department of Biology, Boston College
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
SNP Selection University of Louisville Center for Genetics and Molecular Medicine January 10, 2008 Dana Crawford, PhD Vanderbilt University Center for.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Design Considerations in Large- Scale Genetic Association Studies Michael Boehnke, Andrew Skol, Laura Scott, Cristen Willer, Gonçalo Abecasis, Anne Jackson,
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
HapMap: application in the design and interpretation of association studies Mark J. Daly, PhD on behalf of The International HapMap Consortium.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Conservation of genomic segments (haplotypes): The “HapMap” n In populations, it appears the the linear order of alleles (“haplotype”) is conserved in.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Molecular & Genetic Epi 217 Association Studies
CS177 Lecture 10 SNPs and Human Genetic Variation
Gene Hunting: Linkage and Association
Genome-Wide Association Study (GWAS)
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Large-scale recombination rate patterns are conserved among human populations David Serre McGill University and Genome Quebec Innovation Center UQAM January.
Association mapping: finding genetic variants for common traits & diseases Manuel Ferreira Queensland Institute of Medical Research Brisbane Genetic Epidemiology.
Molecular & Genetic Epi 217 Association Studies: Indirect John Witte.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
Genes in human populations n Population genetics: focus on allele frequencies (the “gene pool” = all the gametes in a big pot!) n Hardy-Weinberg calculations.
1 DNA Polymorphisms: DNA markers a useful tool in biotechnology Any section of DNA that varies among individuals in a population, “many forms”. Examples.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
Clustering and optimization in genetic data: the problem of Tag-SNPs selection Paola Bertolazzi, Serena D‘ Aguanno, Giovanni Felici *, Paola Festa** *
The International Consortium. The International HapMap Project.
Populations: defining and identifying. Two major paradigms for defining populations Ecological paradigm A group of individuals of the same species that.
Motivations to study human genetic variation
Copyright OpenHelix. No use or reproduction without express written consent1.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Lectures 7 – Oct 19, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall.
Linkage. Announcements Problem set 1 is available for download. Due April 14. class videos are available from a link on the schedule web page, and at.
Analyzing DNA using Microarray and Next Generation Sequencing (1) Background SNP Array Basic design Applications: CNV, LOH, GWAS Deep sequencing Alignment.
Global Variation in Copy Number in the Human Genome Speaker: Yao-Ting Huang Nature, Genome Research, Genome Research, 2006.
Population stratification
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Common variation, GWAS & PLINK
Population Genetics As we all have an interest in genomic epidemiology we are likely all either in the process of sampling and ananlysising genetic data.
Genetic Engineering in Medicine, Agriculture, and Law
Differences in asthma genetics between Chinese and other populations
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Recombination (Crossing Over)
Itsik Pe’er, Yves R. Chretien, Paul I. W. de Bakker, Jeffrey C
Linkage: Statistically, genes act like beads on a string
Haplotype Reconstruction
Differences in asthma genetics between Chinese and other populations
Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium  Penelope E. Bonnen, Michael D. Story,
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
Yu Zhang, Tianhua Niu, Jun S. Liu 
Presentation transcript:

Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013

Objectives Introduce the concept of linkage disequilibrium (LD) Describe how the HapMap project provides publically available information on genetic variation and LD structure Review how LD enables genome-wide screens with only a subset of genome-wide SNP markers Describe the design of chip-based genotype assays

Human Genome 3 billion base pairs, 23 paired chromosomes 99.9% sequence similarity between individuals ~12 million variant sites

What are the Different Types of Genetic Variation? Single base pair change (A C GT  A T GT), aka S ingle N ucleotide P olymorphism ~12 million across the genome Insertions/Deletions (TGGT TTC TA  TGGT---TA) Can be of variable size Trinucelotide repeats (microsatellites) Highly polymorphic, less common than SNPs Responsible for certain clinic disorders (Huntington’s, Fragile X, myotonic dystrophy)

SNPs in detail SNPs can have up to four possible alleles (A,C,G,T), most have only two alleles present in human populations Each person has two SNP alleles (one for each copy of the chromosome) when both copies are the same, you’re homozygous (i.e. AA, CC, GG, TT). When they’re different (AT), your heterozygous. Each allele has a frequency in which it appears in a given population major allele (more common), minor allele (less common) they sum to 1 (or 100%)

SNPs are Used as Genetic Markers for GWAS Chips Properties of SNPs that make them good markers for GWAS densely spaced across the genome usually bi-allelic (only 2 alleles in the population, simplifies statistical tests) GWAS chips can effectively represent most common variation with just a subset of SNPs with ~500,000 SNPs, most common variation can be captured this is because there is significant correlation between neighboring SNPs

Linkage Disequilibrium Causes Correlation Between Neighboring SNPs Mendel’s laws state that genes (alleles) are independently transferred across generations (random assortment – linkage equilibrium). This is not the case when two genetic loci are physically close to each other. When two physically close genetic loci are not randomly assorted, this is called linkage disequilibrium.

Linkage Equilibrium Arises Because of Meiotic Recombination

Linkage and Recombination X Y Z x y z X y z X y z y X z X Y z Gametogenesis Paternal DNAMaternal DNA From Paternal grandfather From Paternal grandmother

Recombination Breaks Up Chromosomal Segments Over Generations recombination is not uniform across the genome ( recombination hotspots ). SNPs within the yellow region are correlated with each other and form haplotypes. Because of this correlation, one can often use a single SNP from a haplotype to represent all the SNP variation within a haplotype.

Haplotype Structure Reflects Evolutionary History The structure of haplotype blocks varies across racial groups African populations have short LD blocks, reflecting the longer evolutionary history of those populations

~500,000 SNP Markers Can Reasonably Represent Most of the Common Genetic Variation in European Genomes GWAS relies upon linkage disequilibrium and the ubiquitous nature of SNP markers to enable genome-wide surveys of the impact of common variation on disease susceptibility Pe’er et al. Nat Gen. 2006

The HapMap Project is a catalog of human variation across populations The Human Genome project provided the complete human sequence for a small number of individuals To get an accurate sense of variable sites, data from many individuals is needed HapMap has three iterations ( dense genotype data from multiple populations groups CEU – individuals of Northern and Western European ancestry from Utah YRI – Yorubans from Nigeria JPT – Japanese from Tokyo CHB – Han Chinese from Beijing

Data from the HapMap Project Enabled GWAS Chip Design Information from HapMap Used in chip design panel of potential SNPs to use in a genotype chip population specific LD structure to allow the identification of tag SNPs that effectively tag haplotypes

Using Linkage Disequilibrium to find Genes Linkage disequilibrium (LD) means that sites of genetic variation can serve as “markers” for larger chromosomal segments. Correlation between markers is quantified with r- squared and D’.

GWAS identify novel disease loci, but additional localization is often necessary

Genotype Chip Technology education.nih.gov/newsnapshots/TOC_Chips/Chips_RITN/How_Chips_Wo rk_1/how_chips_work_1.html

Kang et al. The American Journal of Human Genetics Volume 74, Issue

Summary Genetic material is transmitted across generations in blocks called haplotypes. Linkage disequilibrium and haplotype blocks allow for SNP tagging approaches that enable GWAS chips to capture common genetic variation with a subset of genetic markers. Haplotype structure varies across ancestral groups. The HapMap project catalogs human genetic variation and LD structure across populations.