P REDICTION M ODELS U SING G ENOMIC P ROFILING H. Zhang E. Warner D. Zhao.

Slides:



Advertisements
Similar presentations
Regulation of Consumer Tests in California AAAS Meeting June 1-2, 2009 Beatrice OKeefe Acting Chief, Laboratory Field Services California Department of.
Advertisements

Role of pathogen-driven selection in shaping the predisposition to IBD: identification of disease susceptibility alleles Mario (Mago) Clerici, M.D. Chair.
II. Potential Errors In Epidemiologic Studies Random Error Dr. Sherine Shawky.
Review of main points from last week Medical costs escalating largely due to new technology This is an ethical/social problem with major conseq. Many new.
Sample size estimation
Study Designs in Epidemiologic
METHODS FOR HAPLOTYPE RECONSTRUCTION
Genetic Analysis in Human Disease
Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.
1 Case-Control Study Design Two groups are selected, one of people with the disease (cases), and the other of people with the same general characteristics.
Objectives Cover some of the essential concepts for GWAS that have not yet been covered Hardy-Weinberg equilibrium Meta-analysis SNP Imputation Review.
QTL Mapping R. M. Sundaram.
1 FSTL4 and SEMA5A are associated with alcohol dependence: meta- analysis of two genome-wide association studies Kesheng Wang, PhD Department of Biostatistics.
DATA ANALYSIS Module Code: CA660 Lecture Block 2.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Genome-Wide Association Studies
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Quantitative Genetics
Seminar Assignment HT Seminar assignment Handed out at course start Group work / individual assessment Research based aspects, requires independent.
Introduction to Molecular Epidemiology Jan Dorman, PhD University of Pittsburgh School of Nursing
Standard error of estimate & Confidence interval.
Screening and Early Detection Epidemiological Basis for Disease Control – Fall 2001 Joel L. Weissfeld, M.D. M.P.H.
Robust and powerful sibpair test for rare variant association
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Rare and common variants: twenty arguments G.Gibson Homework 3 Mylène Champs Marine Flechet Mathieu Stifkens 1 Bioinformatics - GBIO K.Van Steen.
Chuanyu Sun Paul VanRaden National Association of Animal breeders, USA Animal Improvement Programs Laboratory, USA Increasing long term response by selecting.
Study Design. Study Designs Descriptive Studies Record events, observations or activities,documentaries No comparison group or intervention Describe.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
ConceptS and Connections
CHP400: Community Health Program- lI Research Methodology STUDY DESIGNS Observational / Analytical Studies Case Control Studies Present: Disease Past:
Web of Causation; Exposure and Disease Outcomes Thomas Songer, PhD Basic Epidemiology South Asian Cardiovascular Research Methodology Workshop.
Multifactor Dimensionality Reduction Laura Mustavich Introduction to Data Mining Final Project Presentation April 26, 2007.
Genetic-Environmental Interaction: Implications for Osteoporosis Prevention Strategies.
Case(Control)-Free Multi-SNP Combinations in Case-Control Studies Dumitru Brinza and Alexander Zelikovsky Combinatorial Search (CS) for Disease-Association:
Population Genetics: Chapter 3 Epidemiology 217 January 16, 2011.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
What host factors are at play? Paul de Bakker Division of Genetics, Brigham and Women’s Hospital Broad Institute of MIT and Harvard
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,
Genes in human populations n Population genetics: focus on allele frequencies (the “gene pool” = all the gametes in a big pot!) n Hardy-Weinberg calculations.
The Problem with Individual Risk Beverly Rockhill, Ph.D. Department of Epidemiology University of North Carolina, Chapel Hill.
1 Risk Assessment Tests Marina Kondratovich, Ph.D. OIVD/CDRH/FDA March 9, 2011 Molecular and Clinical Genetics Panel for Direct-to-Consumer (DTC) Genetic.
Risk Prediction of Complex Disease David Evans. Genetic Testing and Personalized Medicine Is this possible also in complex diseases? Predictive testing.
1 B-b B-B B-b b-b Lecture 2 - Segregation Analysis 1/15/04 Biomath 207B / Biostat 237 / HG 207B.
Diagnostic Tests Afshin Ostovar Bushehr University of Medical Sciences Bushehr, /7/20151.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
Paolo Vineis University of Torino and ISI Foundation, Torino, Italy address: GENE-ENVIRONMENT INTERACTIONS IN CANCER.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
10 May Understanding diagnostic tests Evan Sergeant AusVet Animal Health Services.
Prediction research Introduction and examples A. Cecile J.W. Janssens, PhD Professor of Translational Epidemiology.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
CASE CONTROL STUDY. Learning Objectives Identify the principles of case control design State the advantages and limitations of case control study Calculate.
24.1 Quantitative Characteristics Vary Continuously and Many Are Influenced by Alleles at Multiple Loci The Relationship Between Genotype and Phenotype.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
SNPs and complex traits: where is the hidden heritability?
Quantitative genetics
Migrant Studies Migrant Studies: vary environment, keep genetics constant: Evaluate incidence of disorder among ethnically-similar individuals living.
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Beyond GWAS Erik Fransen.
Chapter 7 Multifactorial Traits
Osteoarthritis year 2012 in review: genetics and genomics
A Critical Appraisal of the Scientific Basis of Commercial Genomic Profiles Used to Assess Health Risks and Personalize Health Interventions  A. Cecile.
Osteoarthritis year 2012 in review: genetics and genomics
Quanhe Yang, W. Dana Flanders, Ramal Moonesinghe, John P. A
Presentation transcript:

P REDICTION M ODELS U SING G ENOMIC P ROFILING H. Zhang E. Warner D. Zhao

GENOMIC PROFILING Testing genes at multiple loci simultaneously Complex diseases-complex causal pathway High number of weak predictors Single genes–of limited predictive value

QUESTIONS OF INTEREST Is testing low-risk genes at multiple loci useful clinically -- discriminative accuracy? Can we predict individual genetic risk from GWAS? Value for assessing susceptibility to common diseases and targeting interventions?

P REDICTIVE TESTING FOR COMPLEX DISEASES USING MULTIPLE GENES : F ACT OR FICTION ? Cecile J. W. Janssens, Yurii S. Aulchenko, Stefano Elefante, Gerard J J. M Borsboom, Ewout W. Steyerberg, Cornelia M van Dujuin Genet Med 2006: 8(7):

METHODS Simulation test: 100,000 subjects up to 400 genes Each gene: two alleles Hardy-Weinberg Equilibrium Disease risk associated with genetic profiles: Bayes’ theorem Multiplicative model No LD between genes All genes predictive of the disease are known

METHODS ( CONTINUED ) Part I: genes had the same risk allele frequencies and the same effect on the disease (same ORs) Part II: genes with varying ORs and allele frequencies

DISCRIMINATIVE ACCURACY AUC (Area Under the ROC Curve) Probability that the test correctly identifies the diseased subject

E XAMPLES OF AUC

FIG 1. DISCRIMINATIVE ACCURACY OF GENETIC PROFILING (CONSTANT OR S )

FIG 2. DISCRIMINATIVE ACCURACY OF GENETIC PROFILING (VARYING OR S )

FIG. 3. RELATIONSHIP BETWEEN HERITABILITY AND DISCRIMINATIVE ACCURACY

CONCLUSIONS Discriminative accuracy depends on Number of genes Frequency of risk alleles Risk associated with the genotypes Heritability (few strong predictors or large number of common susceptibility genes) Level of discriminative accuracy required for clinical application depends on the goal of testing, burden of disease, cost, treatment availability etc.

P REDICTION OF INDIVIDUAL G ENETIC RISK TO DISEASE FROM GENOME - WIDE ASSOCIATION STUDIES Naomi R. Wray, Michael E. Goddard and Peter M Visscher Genome Res :

P URPOSE Research Question – Can we identify high risk genetic profiles consisting of multiple risk alleles with small effects at any given locus? Aims – Investigate the relationship between the RR of genetic loci and the number of loci that contribute to disease risk – Investigate the number of loci underlying complex disease of a given disease prevalence and heritability – Simulate a case control study to investigate the prediction of genetic risk of disease from multiple loci in a genome wise association study (GWAS) – Use SNPs selected from the simulation to see how accurately they predict the risk in a random sample

M ETHODS Repeated simulations Parameters – Disease prevalence: 0.05 or 0.10 – Heritability: 0.1 or 0.2 – Allele frequency distribution: uniform (common disease-common variant) or U- shaped (neutral allele hypothesis) – GWAS 500,000 SNPs Number of disease risk loci: 10, 20, 50, 100, 300, or 10,000 cases and controls

R ESULTS : # OF L OCI AND A VERAGE R ELATIVE R ISK

R ESULTS : R ISK A LLELE MODELS

R ESULTS : SELECTED SNPS

R ESULTS : A CCURACY

A SSUMPTIONS AND L IMITATIONS True causal SNPs were always included in GWAS All genetic variance was attributable to variants of frequency 0.01 to 0.99 No population stratification All genotypes are in Hardy-Weinberg equilibrium No LD between SNPs Did not consider gene-gene or gene-environment interactions

C ONCLUSIONS Prediction of genetic risk is possible, even if there are hundreds of risk variants, each of small effect Genomic profiling may not be appropriate for rare diseases Implementation of these procedures doesn’t require knowledge of causal mechanism

A N EPIDEMIOLOGIC ASSESSMENT OF GENOMIC PROFILING FOR MEASURING SUSCEPTIBILITY TO COMMON DISEASES AND TARGETING INTERVENTIONS Muin J. Khoury, Quanhe Yang, Marta Gwinn, Julian Little, W. Dana Flanders Genet Med 2004:6(1):38-47

P URPOSE Normal epidemiological methods already provide information about important exposure-disease associations that can be used to reduce disease burden What value does genomic profiling/genetic testing to predict susceptibility add to usual epidemiological methods?

T WO ASPECTS OF “ VALUE ” Clinical value – individual level Clinical validity – can genetic testing help predict future disease positive and negatives? Clinical utility – can genetic testing help lower disease risks for people with a “positive” genetic test See the task force on genetic testing and the Secretary's Advisory Committee on Genetic Testing – references in paper Public health value – population level Public health utility – how does reduction of disease burden in population based on genetic profiling compare to population-wide interventions

G ENERAL M ETHODS Model: Risk = Baseline + Gene1 + Gene2 + Gene3 + Modifiable exposure Posited hypothetical but “likely” data by varying the following parameters: Lifetime risk of a disease Number of loci in genetic test Frequency of genotypes Strength of association between these loci and the disease Strength of association between exposure and the disease Calculated value for these hypothetical data by calculating impact of targeted intervention on the exposure

CLINICAL VALIDITY AND UTILITY Technical validity is assumed

CLINICAL VALIDITY AND UTILITY Technical validity is assumed

P UBLIC HEALTH UTILITY Calculate the ratio of PAFt – reduction of disease burden due to targeted intervention – to PAF

P UBLIC HEALTH UTILITY Calculate the ratio of PAFt – reduction of disease burden due to targeted intervention – to PAF

I MPLICATIONS There are other parameters that could be varied: Higher synergy will lead to higher predictive values and population impact Epistasis will lead to higher predictive values Tension between targeted and population interventions Screening may be a good compromise: population- wide intervention of education and awareness + targeted intervention Genetic testing has different added values under different conditions and epidemiological methods can be used to determine the extent of its added value

S UMMARY Genetic tests should involve multiple loci Discriminative accuracy improves with higher heritability Number of loci needed to accurately identify association Function of heritability, prevalence and RR Complicated relationships between accuracy, allele effect sizes and allele frequency

S UMMARY The accuracy of the predictive models in the presence of gene- gene/environment interactions may be overstated Genetic testing must be applied to all subjects and can be resource-intensive