The HapMap Project and Haploview

Slides:



Advertisements
Similar presentations
Association Studies, Haplotype Blocks and Tagging SNPs Prof. Sorin Istrail.
Advertisements

Julia Krushkal 4/11/2017 The International HapMap Project: A Rich Resource of Genetic Information Julia Krushkal Lecture in Bioinformatics 04/15/2010.
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Are you ready for the genomic age? An introduction to human genomics Jacques Fellay EPFL School of Life Sciences Swiss Institute of Bioinformatics Lausanne,
Linkage Disequilibrium
Efficient Algorithms for Genome-wide TagSNP Selection across Populations via the Linkage Disequilibrium Criterion Authors: Lan Liu, Yonghui Wu, Stefano.
Single Nucleotide Polymorphism And Association Studies
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
Using HapMap.Org A Tutorial Lincoln Stein, Cold Spring Harbor Laboratory.
The role of variation in finding functional genetic elements Andy Clark – Cornell Dave Begun – UC Davis.
Resources at HapMap.Org Tutorial Marcela K. Tello-Ruiz Cold Spring Harbor Laboratory.
Haplotypes and imputed genotypes in diverse human populations Noah Rosenberg April 29, 2009.
SNP Resources: Finding SNPs Discovery and Databases Mark J. Rieder, PhD SeattleSNPs Workshop March 20-21, 2006.
A coalescent computational platform for tagging marker selection for clinical studies Gabor T. Marth Department of Biology, Boston College
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
Picking SNPs Application to Association Studies Dana Crawford, PhD SeattleSNPs PGA University of Washington March 20, 2006.
A dynamic program algorithm for haplotype block partitioning Zhang, et. al. (2002) PNAS. 99, 7335.
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
SNPs DNA differs between humans by 0.1%, (1 in 1300 bases) This means that you can map DNA variation to around 10,000,000 sites in the genome Almost all.
SNP Selection University of Louisville Center for Genetics and Molecular Medicine January 10, 2008 Dana Crawford, PhD Vanderbilt University Center for.
Course Overview Personalized Medicine: Understanding Your Own Genome Fall 2014.
The International HapMap Project: Ethical, Social, and Cultural Issues [Names and institutions of presenters]
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Design Considerations in Large- Scale Genetic Association Studies Michael Boehnke, Andrew Skol, Laura Scott, Cristen Willer, Gonçalo Abecasis, Anne Jackson,
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
HapMap: application in the design and interpretation of association studies Mark J. Daly, PhD on behalf of The International HapMap Consortium.
Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Introduction to Single Nucleotide Polymorphisms (SNPs) Zhongming Zhao Department of Psychiatry and Center for the Study of Biological Complexity June 28,
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Conservation of genomic segments (haplotypes): The “HapMap” n In populations, it appears the the linear order of alleles (“haplotype”) is conserved in.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Molecular & Genetic Epi 217 Association Studies
CS177 Lecture 10 SNPs and Human Genetic Variation
Informative SNP Selection Based on Multiple Linear Regression
Genome-Wide Association Study (GWAS)
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
1 of 32 Sequence Variation in Ensembl. 2 of 32 Outline SNPs SNPs in Ensembl Haplotypes & Linkage Disequilibrium SNPs in BioMart HapMap project Strain-specific.
Some current issues in QTL identification Lon Cardon Wellcome Trust Centre for Human Genetics University of Oxford Acknowledgements:Goncalo Abecasis Stacey.
Large-scale recombination rate patterns are conserved among human populations David Serre McGill University and Genome Quebec Innovation Center UQAM January.
Association mapping: finding genetic variants for common traits & diseases Manuel Ferreira Queensland Institute of Medical Research Brisbane Genetic Epidemiology.
Molecular & Genetic Epi 217 Association Studies: Indirect John Witte.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
SNPs, Haplotypes, Disease Associations Algorithmic Foundations of Computational Biology II Course 1 Prof. Sorin Istrail.
The International Consortium. The International HapMap Project.
Motivations to study human genetic variation
PanMap Mapping Genomic Variation in Western Chimpanzees
Copyright OpenHelix. No use or reproduction without express written consent1.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Signals of natural selection in the HapMap project data The International HapMap Consortium Gil McVean Department of Statistics, Oxford University.
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
NCSU Summer Institute of Statistical Genetics, Raleigh 2004: Genome Science Session 3: Genomic Variation.
Association Mapping in Families Gonçalo Abecasis University of Oxford.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Of Sea Urchins, Birds and Men
Constrained Hidden Markov Models for Population-based Haplotyping
Differences in asthma genetics between Chinese and other populations
Itsik Pe’er, Yves R. Chretien, Paul I. W. de Bakker, Jeffrey C
Power to detect QTL Association
Differences in asthma genetics between Chinese and other populations
Polycystic ovary syndrome: an ancient disorder?
A Tutorial Lincoln Stein, Cold Spring Harbor Laboratory
Genome-Wide Association Studies: Present Status and Future Directions
Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium  Penelope E. Bonnen, Michael D. Story,
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
SNPs and CNPs By: David Wendel.
Presentation transcript:

The HapMap Project and Haploview David Evans Ben Neale University of Oxford Wellcome Trust Centre for Human Genetics

Human Haplotype Map General Idea: Characterize the distribution of Linkage Disequilibrium across the genome. Why?: Infeasible to type every polymorphism in the human genome => Because of LD, type a subset of variants that captures most of common variation in genome Output: -- Raw genotype data freely available (monthly release) -- www.hapmap.org Deliverables: Sets of haplotype tagging SNPs

Human Haplotype Map - Funding - Total US $120 million Japanese Ministry of Education, Culture, Sports, Science and Technology (MEXT), Tokyo National Institutes of Health, US The Wellcome Trust, UK Genome Canada in Ottawa and Genome Quebec, Montreal Chinese Academy of Sciences, Chinese Ministry of Science and Technology, Natural Science Foundation of China, Beijing The SNP Consortium (TSC), US

Human Haplotype Map - Participants - Genotyping 25% RIKEN/Univ Tokyo (Nakamura) 24% Sanger Institute (Bentley) 16% Illumina (Chee) 10% Genome Quebec (Hudson) 10% Beijing/Shanghai/Hong Kong (Yang, Zeng, Huang, Tsui) 9% Whitehead Institute (Altshuler) 4% Baylor Coll Medicine, US (Gibbs) 2% Univ Calif San Francisco (Kwok) Ethical, Legal, Social Issues Japan (Matsuda) China (Zhang, Zeng) US (Leppert) Nigeria (Rotimi) Samples Nigeria (Yoruba; Ibadan) 30 trios  90 individuals US (CEPH) China (Han) 45 unrelateds Japan (Tokyo) Data Analysis Whitehead (Altshuler, Daly) Johns Hopkins Univ (Chakravarti, Cutler) Oxford Statistics (Donnelly, McVean) Oxford Genetics (Cardon, Weir, Abecasis) Data Coordination Cold Spring Harbor (Stein)

Human Haplotype Map Status March 2005 “Phase I” complete ~1 million SNPs typed in 270 individuals at an average spacing of 1 SNP per 5 KB Study of data accuracy across centres (1,500 markers) revealed concordance, internal consistency > 99.8% For several centres, accuracy > 99.9% “Phase II” underway Type an additional 2.25 million SNPs in the same samples (~1 SNP per 1 KB)

Encode Regions Resequence ten ~500KB regions in 16 CEPH, 16 Yoruba, 8 Japanese and 8 Chinese Genotype all dbSNPs and “new” SNPs in all 270 individuals ENCODE Regions Genotype Information Region name Chromosome band Genomic interval (NCBI ) Available SNPs Genotyped SNPs Genotyping group dbSNP New SNPs CEU HCB JPT YRI rs# no rs# ENr112 2p16.3 Chr2:51633239..52133238 1,624 1,720 1,064 937 867 900 868 879 922 McGill-GQIC, Perlegen ENr131 2q37.1 Chr2:234778639..235278638 1,787 1,233 1,179 719 923 690 925 932 704 ENr113 4q26 Chr4:118705475..119205474 1,516 1,819 1,017 1,614 878 1,589 1,597 Broad, Perlegen ENm010 7p15.2 Chr7:26699793..27199792 1,274 1,857 757 459 291 500 284 456 UCSF-WU, Perlegen ENm013 7q21.13 Chr7:89395718..89895717 1,545 1,713 927 1,382 740 1,393 748 1,391 ENm014 7q31.33 Chr7:126135436..126632577 1,354 1,562 963 1,428 794 1,417 800 1,419 ENr321 8q24.11 Chr8:118769628..119269627 1,468 1,682 936 905 726 907 713 903 Illumina, Perlegen ENr232 9q34.11 Chr9:127061347..127561346 1,494 1,646 694 707 508 702 517 689 ENr123 12q12 Chr12:38626477..39126476 1,904 1,551 859 80 78 74 BCM, Perlegen ENr213 18q12.1 Chr18:23717221..24217220 1,465 809 820 643 816 817 644 819   Total 15,357 16,248 9,205 8,971 6,450 8,914 6,451 8,915 6,470 8,900

US Caucasian vs UK Caucasian Population Recombination Rate Population

HapMap website: Haploview website: www.hapmap.org www.broad.mit.edu/mpg/haploview/index.php

Haploview .ped file 1 1 0 0 1 2 1 2 0 0 1 2 0 0 2 2 1 2 3 3 1 3 0 0 1 2 1 1 1 1 1 4 1 2 2 2 1 2 0 0 1 5 3 4 2 2 1 1 1 1 1 6 3 4 1 2 2 2 1 3 .info file rs1474567 38362947 rs2179083 38364233

Exercise F:\davide\Boulder2005\hapmap af1.dat African dataset af1.ped cauc1.dat cauc1.ped Caucasian dataset How many “blocks” are there in the Caucasian dataset? Do the number and position of blocks vary according to whether the Gabriel et al or four gamete block definition is employed? Choose a set of tagging SNPs for the Caucasian dataset to summarize the genotype data efficiently. Do LD patterns vary between the Caucasian and African datasets? Why?