Biostatistics-Lecture 19 Linkage Disequilibrium and SNP detection

Slides:



Advertisements
Similar presentations
Imputation for GWAS 6 December 2012.
Advertisements

Julia Krushkal 4/11/2017 The International HapMap Project: A Rich Resource of Genetic Information Julia Krushkal Lecture in Bioinformatics 04/15/2010.
Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.
Objectives Cover some of the essential concepts for GWAS that have not yet been covered Hardy-Weinberg equilibrium Meta-analysis SNP Imputation Review.
High resolution detection of IBD Sharon R Browning and Brian L Browning Supported by the Marsden Fund.
Joint Linkage and Linkage Disequilibrium Mapping
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Published Genome-Wide Associations through ,617 published GWA at p≤5X10 -8 for 249 traits Autism marker Multiple Sclerosis Marker The GWAS Human.
MALD Mapping by Admixture Linkage Disequilibrium.
Lab 13: Association Genetics. Goals Use a Mixed Model to determine genetic associations. Understand the effect of population structure and kinship on.
University of Connecticut
The role of variation in finding functional genetic elements Andy Clark – Cornell Dave Begun – UC Davis.
Computational Challenges in Whole-Genome Association Studies Ion Mandoiu Computer Science and Engineering Department University of Connecticut.
. Basic Model For Genetic Linkage Analysis Lecture #3 Prepared by Dan Geiger.
Genotype Error Detection using Hidden Markov Models of Haplotype Diversity Justin Kennedy, Ion Mandoiu, Bogdan Pasaniuc CSE Department, University of Connecticut.
Imputation-based local ancestry inference in admixed populations Ion Mandoiu Computer Science and Engineering Department University of Connecticut Joint.
Robust and powerful sibpair test for rare variant association
Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
Imputation 2 Presenter: Ka-Kit Lam.
Molecular & Genetic Epi 217 Association Studies
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Whole genome association studies Introduction and practical Boulder, March 2009.
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Large-scale recombination rate patterns are conserved among human populations David Serre McGill University and Genome Quebec Innovation Center UQAM January.
Methods in genome wide association studies. Norú Moreno
Lab 13: Association Genetics December 5, Goals Use Mixed Models and General Linear Models to determine genetic associations. Understand the effect.
FINE SCALE MAPPING ANDREW MORRIS Wellcome Trust Centre for Human Genetics March 7, 2003.
California Pacific Medical Center
Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.
Imputation-based local ancestry inference in admixed populations
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland.
Populations: defining and identifying. Two major paradigms for defining populations Ecological paradigm A group of individuals of the same species that.
Copyright OpenHelix. No use or reproduction without express written consent1.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Lectures 7 – Oct 19, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall.
The Haplotype Blocks Problems Wu Ling-Yun
Association Mapping in Families Gonçalo Abecasis University of Oxford.
Introduction to SNP and Haplotype Analysis
Genetic Linkage.
Gonçalo Abecasis and Janis Wigginton University of Michigan, Ann Arbor
Of Sea Urchins, Birds and Men
Constrained Hidden Markov Models for Population-based Haplotyping
Population genetics Dr Gavin Band
Population Genetics As we all have an interest in genomic epidemiology we are likely all either in the process of sampling and ananlysising genetic data.
Genetic Linkage.
Imputation-based local ancestry inference in admixed populations
Post-GWAS and Mechanistic Analyses
Patterns of Linkage Disequilibrium in the Human Genome
The ‘V’ in the Tajima D equation is:
Haplotype Reconstruction
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Garrett McKinney Jim Seeb Lisa Seeb
Genetic Linkage.
Polycystic ovary syndrome: an ancient disorder?
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Proportioning Whole-Genome Single-Nucleotide–Polymorphism Diversity for the Identification of Geographic Population Structure and Genetic Ancestry  Oscar.
10 Years of GWAS Discovery: Biology, Function, and Translation
Outline Cancer Progression Models
A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants  Andrew.
Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium  Penelope E. Bonnen, Michael D. Story,
IBD Estimation in Pedigrees
A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals  Brian L. Browning, Sharon.
X-chromosomal markers and FamLinkX
Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
Evaluating the Effects of Imputation on the Power, Coverage, and Cost Efficiency of Genome-wide SNP Platforms  Carl A. Anderson, Fredrik H. Pettersson,
Genotype-Imputation Accuracy across Worldwide Human Populations
Gonçalo R. Abecasis, Janis E. Wigginton 
Fig. 4 Neanderthal ancestry distribution in Eurasian populations.
Presentation transcript:

Biostatistics-Lecture 19 Linkage Disequilibrium and SNP detection Ruibin Xi Peking University School of Mathematical Sciences

Haplotype Freqeuncies

Linkage Equilibrium

Linkage Disequilibrium

Disequilibrium Coefficient DAB

DAB is hard to interpret Sign is arbitrary … A common convention is to set A, B to be the common allele and a, b to be the rare allele Range depends on allele Frequencies Hard to compare between markers

r2 (also called Δ2) Ranges between 0 and 1 1 when the two markers provide identical information 0 when they are in perfect equilibrium

Raw r2 data from chr22

Comparing Populations CEPH: Utah residents with ancestry from northern and western Europe (CEU)

Use LD for SNP imputation and detection fastPhase

Use LD for SNP imputation and detection fastPhase

Model for haplotypes Observed n haplotypes Each with M markers bij = 0, 1 Assume each haplotye originates from one of K clusters zi: unknown cluster of origin of bi Since clusters of origin are unknown

Local clustering of haplotype Assume zi = (zi1,…, ziM) forms a Markov chain on {1,…,K} zim denote the cluster origin for bim Initial probabilities Transition probabilities Conditional on the cluster of origin Marginal

Local clustering of genotype data We have genotype data gim: genotype at marker m of individual i Take values 0, 1, 2 Initial probabilities ( unordered cluster of origins) Transition probabilities

Local clustering of genotype data Genotype probabilities conditional on cluster of origins Joint likelihood

Algorithms for genotype imputation fastPhase BEAGLE IMPUTE PLINK MaCH

Algorithms for genotype imputation fastPhase BEAGLE IMPUTE PLINK MaCH Picture taken from IMPUTE v2

SNP detection with LD information MaCH: (G: genotye, S: cluster)

SNP detection with LD information For sequencing data G is not observed Coverage of base A, B are observed, we have the HMM

SNP detection with LD information Nielsen et al. 2011 Nature Review Genetics