Biostatistics-Lecture 19 Linkage Disequilibrium and SNP detection

Slides:

Advertisements

Similar presentations

Imputation for GWAS 6 December 2012.

Advertisements

Julia Krushkal 4/11/2017 The International HapMap Project: A Rich Resource of Genetic Information Julia Krushkal Lecture in Bioinformatics 04/15/2010.

Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.

Objectives Cover some of the essential concepts for GWAS that have not yet been covered Hardy-Weinberg equilibrium Meta-analysis SNP Imputation Review.

High resolution detection of IBD Sharon R Browning and Brian L Browning Supported by the Marsden Fund.

Joint Linkage and Linkage Disequilibrium Mapping

Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.

Published Genome-Wide Associations through ,617 published GWA at p≤5X10 -8 for 249 traits Autism marker Multiple Sclerosis Marker The GWAS Human.

MALD Mapping by Admixture Linkage Disequilibrium.

Lab 13: Association Genetics. Goals Use a Mixed Model to determine genetic associations. Understand the effect of population structure and kinship on.

University of Connecticut

The role of variation in finding functional genetic elements Andy Clark – Cornell Dave Begun – UC Davis.

Computational Challenges in Whole-Genome Association Studies Ion Mandoiu Computer Science and Engineering Department University of Connecticut.

. Basic Model For Genetic Linkage Analysis Lecture #3 Prepared by Dan Geiger.

Genotype Error Detection using Hidden Markov Models of Haplotype Diversity Justin Kennedy, Ion Mandoiu, Bogdan Pasaniuc CSE Department, University of Connecticut.

Imputation-based local ancestry inference in admixed populations Ion Mandoiu Computer Science and Engineering Department University of Connecticut Joint.

Robust and powerful sibpair test for rare variant association

Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.

SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.

Imputation 2 Presenter: Ka-Kit Lam.

Molecular & Genetic Epi 217 Association Studies

National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.

Whole genome association studies Introduction and practical Boulder, March 2009.

Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.

Large-scale recombination rate patterns are conserved among human populations David Serre McGill University and Genome Quebec Innovation Center UQAM January.

Methods in genome wide association studies. Norú Moreno

Lab 13: Association Genetics December 5, Goals Use Mixed Models and General Linear Models to determine genetic associations. Understand the effect.

FINE SCALE MAPPING ANDREW MORRIS Wellcome Trust Centre for Human Genetics March 7, 2003.

California Pacific Medical Center

Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.

Imputation-based local ancestry inference in admixed populations

Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.

2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland.

Populations: defining and identifying. Two major paradigms for defining populations Ecological paradigm A group of individuals of the same species that.

Copyright OpenHelix. No use or reproduction without express written consent1.

Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs

Lectures 7 – Oct 19, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall.

The Haplotype Blocks Problems Wu Ling-Yun

Association Mapping in Families Gonçalo Abecasis University of Oxford.

Introduction to SNP and Haplotype Analysis

Genetic Linkage.

Gonçalo Abecasis and Janis Wigginton University of Michigan, Ann Arbor

Of Sea Urchins, Birds and Men

Constrained Hidden Markov Models for Population-based Haplotyping

Population genetics Dr Gavin Band

Population Genetics As we all have an interest in genomic epidemiology we are likely all either in the process of sampling and ananlysising genetic data.

Genetic Linkage.

Imputation-based local ancestry inference in admixed populations

Post-GWAS and Mechanistic Analyses

Patterns of Linkage Disequilibrium in the Human Genome

The ‘V’ in the Tajima D equation is:

Haplotype Reconstruction

Haplotype Inference Yao-Ting Huang Kun-Mao Chao.

Garrett McKinney Jim Seeb Lisa Seeb

Genetic Linkage.

Polycystic ovary syndrome: an ancient disorder?

Haplotype Inference Yao-Ting Huang Kun-Mao Chao.

Proportioning Whole-Genome Single-Nucleotide–Polymorphism Diversity for the Identification of Geographic Population Structure and Genetic Ancestry Oscar.

10 Years of GWAS Discovery: Biology, Function, and Translation

Outline Cancer Progression Models

A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants Andrew.

Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium Penelope E. Bonnen, Michael D. Story,

IBD Estimation in Pedigrees

A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals Brian L. Browning, Sharon.

X-chromosomal markers and FamLinkX

Haplotype Inference Yao-Ting Huang Kun-Mao Chao.

Evaluating the Effects of Imputation on the Power, Coverage, and Cost Efficiency of Genome-wide SNP Platforms Carl A. Anderson, Fredrik H. Pettersson,

Genotype-Imputation Accuracy across Worldwide Human Populations

Gonçalo R. Abecasis, Janis E. Wigginton

Fig. 4 Neanderthal ancestry distribution in Eurasian populations.

Presentation transcript:

Biostatistics-Lecture 19 Linkage Disequilibrium and SNP detection Ruibin Xi Peking University School of Mathematical Sciences

Haplotype Freqeuncies

Linkage Equilibrium

Linkage Disequilibrium

Disequilibrium Coefficient DAB

DAB is hard to interpret Sign is arbitrary … A common convention is to set A, B to be the common allele and a, b to be the rare allele Range depends on allele Frequencies Hard to compare between markers

r2 (also called Δ2) Ranges between 0 and 1 1 when the two markers provide identical information 0 when they are in perfect equilibrium

Raw r2 data from chr22

Comparing Populations CEPH: Utah residents with ancestry from northern and western Europe (CEU)

Use LD for SNP imputation and detection fastPhase

Use LD for SNP imputation and detection fastPhase

Model for haplotypes Observed n haplotypes Each with M markers bij = 0, 1 Assume each haplotye originates from one of K clusters zi: unknown cluster of origin of bi Since clusters of origin are unknown

Local clustering of haplotype Assume zi = (zi1,…, ziM) forms a Markov chain on {1,…,K} zim denote the cluster origin for bim Initial probabilities Transition probabilities Conditional on the cluster of origin Marginal

Local clustering of genotype data We have genotype data gim: genotype at marker m of individual i Take values 0, 1, 2 Initial probabilities ( unordered cluster of origins) Transition probabilities

Local clustering of genotype data Genotype probabilities conditional on cluster of origins Joint likelihood

Algorithms for genotype imputation fastPhase BEAGLE IMPUTE PLINK MaCH

Algorithms for genotype imputation fastPhase BEAGLE IMPUTE PLINK MaCH Picture taken from IMPUTE v2

SNP detection with LD information MaCH: (G: genotye, S: cluster)

SNP detection with LD information For sequencing data G is not observed Coverage of base A, B are observed, we have the HMM

SNP detection with LD information Nielsen et al. 2011 Nature Review Genetics