Genome-Wides Association Studies (GWAS) Veryan Codd.

Slides:



Advertisements
Similar presentations
Statistical methods for genetic association studies
Advertisements

Genetic Approaches to Thinking, Moving and Feeling
Analysis of imputed rare variants
What is an association study? Define linkage disequilibrium
Review of main points from last week Medical costs escalating largely due to new technology This is an ethical/social problem with major conseq. Many new.
Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant.
Which Phenotypes Can be Predicted from a Genome Wide Scan of Single Nucleotide Polymorphisms (SNPs): Ethnicity vs. Breast Cancer Mohsen Hajiloo, Russell.
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Genetic Analysis in Human Disease
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Published Genome-Wide Associations through ,617 published GWA at p≤5X10 -8 for 249 traits Autism marker Multiple Sclerosis Marker The GWAS Human.
Ferdinand van ’t Hooft Cardiovascular Genetics and Genomics Group Karolinska Institutet, Stockholm, Sweden Genome-Wide Association Study GWAS
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
What is Mendelian Randomisation? Frank Dudbridge.
Office hours Wednesday 3-4pm 304A Stanley Hall. Fig Association mapping (qualitative)
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
1 FSTL4 and SEMA5A are associated with alcohol dependence: meta- analysis of two genome-wide association studies Kesheng Wang, PhD Department of Biostatistics.
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Genome-Wide Association Studies Xiaole Shirley Liu Stat 115/215.
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Resolving membership in a study in shared aggregate genetics data David W. Craig, Ph.D. Investigator & Associate Director Neurogenomics Division
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Single nucleotide polymorphisms and applications Usman Roshan BNFO 601.
Genome Variations & GWAS
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Rare and common variants: twenty arguments G.Gibson Homework 3 Mylène Champs Marine Flechet Mathieu Stifkens 1 Bioinformatics - GBIO K.Van Steen.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
What host factors are at play? Paul de Bakker Division of Genetics, Brigham and Women’s Hospital Broad Institute of MIT and Harvard
Gene Hunting: Linkage and Association
A basic review of genetics Dr. Danny Chan Associate Professor Assistant Dean (Faculty of Medicine) Department of Biochemistry Department of Biochemistry.
C Reactive Protein Coronary Heart Disease Genetics Collaboration BMJ 2011;342:d548.
Genome-Wide Association Study (GWAS)
Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,
Lab 13: Association Genetics December 5, Goals Use Mixed Models and General Linear Models to determine genetic associations. Understand the effect.
POLYMORPHISM AND VARIANT ANALYSIS Saurabh Sinha, University of Illinois.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
A PPROACHING THE G ENOME - G ENETIC M ARKERS, L INKAGE AND A SSOCIATION G ENETICS 202 Jon Bernstein Department of Pediatrics October 8, 2015.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
1 Data Quality Control in Genome-wide Genetic Analyses Wei V. Chen July 2, 2008.
Familial coronary artery disease Paul Brennan Clinical Director Northern Genetics Service Newcastle Hospitals NHS Foundation Trust North East and North.
Linkage. Announcements Problem set 1 is available for download. Due April 14. class videos are available from a link on the schedule web page, and at.
Linkage. Announcements Problem set 1 is available for download. Due April 14. class videos are available from a link on the schedule web page, and at.
Analyzing DNA using Microarray and Next Generation Sequencing (1) Background SNP Array Basic design Applications: CNV, LOH, GWAS Deep sequencing Alignment.
Schematic of the single variant polymorphism (SNP) genotyping assay.
Analysis of Next Generation Sequence Data BIOST /06/2015.
NCSU Summer Institute of Statistical Genetics, Raleigh 2004: Genome Science Session 3: Genomic Variation.
Increasing Power in Association Studies by using Linkage Disequilibrium Structure and Molecular Function as Prior Information Eleazar Eskin UCLA.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Brendan Burke and Kyle Steffen. Important New Tool in Genomic Medicine GWAS is used to estimate disease risk and test SNPs( the most common type of genetic.
Power Calculations for GWAS
SNPs and complex traits: where is the hidden heritability?
Common variation, GWAS & PLINK
upstream vs. ORF binding and gene expression?
Marker heritability Biases, confounding factors, current methods, and best practices Luke Evans, Matthew Keller.
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Beyond GWAS Erik Fransen.
Exercise: Effect of the IL6R gene on IL-6R concentration
Medical genomics BI420 Department of Biology, Boston College
Medical genomics BI420 Department of Biology, Boston College
Association Design Begins with KNOWN polymorphism theoretically expected to be associated with the trait (e.g., DRD2 and schizophrenia). Genotypes.
Presentation transcript:

Genome-Wides Association Studies (GWAS) Veryan Codd

Why? Identify genetic variants that are associated with a common heritable traits / diseases Understand disease mechanism – identify new genes/pathways involved Personalised medicine – Risk analysis – Early intervention – Inform medication choice

Which genetic changes are studied? SNPs – single nucleotide polymorphisms CNVs – copy number variations Indels - insertions / deletions

Two alleles of a SNP – A and a – Individuals AA, Aa or aa – One allele often less commonly found in population – minor allele (AF<0.5) – Eg rs (RS = reference SNP) MAF=0.27 CC = 0.53, CT=0.40, TT=0.07 SNPs

Linkage Tendency of genetic markers to be inherited together during meiosis. A B C

Linkage – r 2 and D’ D’ is a measure of how frequently two markers are co- inherited R 2 additionally takes into account allele frequencies, is a measure of how highly correlated 2 SNPs are Eg 1 – 2 SNPs, A and B, both have MAF ~50% A B A B A B A B A B A B A B A B A B A B D’=1.0, r 2 =1.0 A would be a “proxy” SNP for B and vice verse – one genotype could fully predict the other

Eg2 2 SNPs, A and B, A has high MAF, B has low MAF A A B A A A A A A A A A A A A A A A A A A D’=1.0 as B can completely predict A r 2 is low - A cannot completely predict B

Genotype individuals on an array – UK Biobank Axiom Array - 820,967 SNP and indel – Imputation to predict additional markers based on known linkage relationships Can test ~10million genetic variants New release of 1000Genomes – 78million SNPs Methods

Analysis…..basics Test the hypothesis that allele frequencies differ between groups e.g. cases vs controls Multiple testing – millions of tests being performed Pragmatic threshold of P=5x10 -8 More recently studies looking at FDR (false discovery rate)

“missing heritability” Even with large scale analysis only ~11% of heritability of CAD explained More small effect common variant to be discovered? Structural variations? Rare variants with large effects? – Exome arrays/ sequencing have been disappointing – Family based studies

Mendelian randomisation Association is NOT causation Genotype assigned randomly from parents to offspring at meiosis Genetics not affected by lifestyle etc which could also impact disease risk Can be affected by – Linkage disequilibrium – Genetic heterogeneity – Pleiotropy – Population stratification

Reasons for an association of a biomarker with a disease Association of TL with CAD TL is involved in disease pathogenesis TL shortened by CAD process Another factor affects both TL and CAD Eg Increased inflammation increases risk of CAD and independently increases TL attrition leading to shorter telomeres CAUSATION REVERSE CAUSATION CONFOUNDING

Mendelian randomisation as a means of establishing causality Schunkert H & Samani NJ. NEJM 2008 Confounding factor (Lifestyle) TL CAD Genetic variant Reverse causation Causal