Genetic Meta-Analysis and Mendelian Randomization George Davey Smith MRC Centre for Causal Analyses in Translational Epidemiology, University of Bristol.

Slides:



Advertisements
Similar presentations
Agenda Control in Denmark Gary W. Cox, Mathew D. McCubbins, and Asbjorn Skjaeveland.
Advertisements

The prevalence of use of beta- blockers in secondary prevention of myocardial infarctions in patients hospitalized 1 Institute of Epidemiology and biostatistics,
Understanding heterogeneity in systematic reviews and met-analysis meta-analysis generates a single best estimate of effectmeta-analysis generates a single.
Progress Against Bladder Cancer. Pre-1970 Progress Against Bladder Cancer Pre : Tool offers view inside bladder for first time.
First cases of AIDS identified.
School of SOCIAL AND COMMUNITY MEDICINE University of BRISTOL ALSPAC AND CROSS-COHORT STUDIES Causal Effects of Breastfeeding on Child Health Outcomes.
Critical appraisal of research Sarah Lawson
Through the years… JUNIATA COLLEGE FIELD HOCKEY. SeasonCoachRecord 1973Unknown Jo Reilly Alexa Fultz Alexa Fultz Nancy Harden-Latimore4-4-1.
Nurses Health Study (n=121,700) Health Professionals Follow-up Study (n=52,000) Nurses Health Study II (n=116,000) Investigators: Frank Hu, Frank Speizer,
From Delft into Almere Prof.dr.ir.Taeke M. de Jong.
Prenatal Nutrition and IQ: A causal analysis using a Mendelian randomization approach Sarah Lewis.
A World Leading Supplier of Propellers and Sterngear.
1 Use of Cochrane review results in designing new studies Nicola Cooper Centre for Biostatistics and Genetic Epidemiology, University of Leicester UK
Sexually Transmitted Disease Surveillance 2012 Division of STD Prevention.
Rotary Club of Space Center August 6, 1964 – August 6, 2014 Presidents and District Governors.
Complete Discography – The Soundhouse Tapes.
CHANGING WORLDS The Impact of University Research.
Copyright © 2004 South-Western 35 The Short-Run Tradeoff between Inflation and Unemployment.
Collaborative meta-analysis of randomised trials of antiplatelet therapy for prevention of death, myocardial infarction, and stroke in high risk.
The Aging Population Source: U.S. Census Bureau Percent Growth in U.S. Population, by Age Bracket.
Who is Poor? What is Poverty?
Coronary heart disease (CHD) event rates in secondary prevention and acute coronary syndrome trials A. Kumar, C.P. Cannon. Arch Med Sci 2007;3:S115-S125.
What is Mendelian Randomisation? Frank Dudbridge.
P H Y S I C I A N S ’ A C A D E M Y F O R C A R D I O V A S C U L A R E D U C A T I O N Oral drugs for type 2 diabetes and all cause mortality in General.
Mendelian Randomization: Genes as Instrumental Variables David Evans University of Queensland.
Quantitative Synthesis I Prepared for: The Agency for Healthcare Research and Quality (AHRQ) Training Modules for Systematic Reviews Methods Guide
Systematic Review Module 9: Quantitative Synthesis I Joseph Lau, MD Thomas Trikalinos, MD, PhD Tufts EPC.
Data Analysis in Systematic Reviews
Grant Stephen: Chair of the MBC Life Science Informatics Group & CEO, Tessella Inc: Creating Insight & Understanding from Scientific.
ALCOHOL AND HEALTH Morten Grønbæk National Institute of Public Health Copenhagen, Denmark.
Impact of early surgery vs conventional treatment for infective endocarditis on mortality and embolic events: data from EASE trial Prospective RCT ( );
WOSCOPS: West Of Scotland Coronary Prevention Study Purpose To determine whether pravastatin reduces combined incidence of nonfatal MI and death due to.
Date of download: 5/27/2016 Copyright © The American College of Cardiology. All rights reserved. From: Causal Assessment of Serum Urate Levels in Cardiometabolic.
THE USE OF GENETIC VARIANTS AS TOOLS FOR EPIDEMIOLOGISTS George Davey Smith MRC Integrative Epidemiology Unit University of Bristol.
Date of download: 6/2/2016 Copyright © 2016 American Medical Association. All rights reserved. From: Long-term Use of Inhaled Corticosteroids and the Risk.
Date of download: 9/18/2016 Copyright © The American College of Cardiology. All rights reserved. From: Nonfasting Glucose, Ischemic Heart Disease, and.
Mendelian Randomization
Arch Intern Med. 2004;164(13): doi: /archinte Figure Legend:
Flow of Individuals Through the Vitamin E and Vitamin C Components of the Physicians’ Health Study II Howard D. Sesso et al. JAMA 2008;300:
Benefits and Pitfalls of Systematic Reviews and Meta-Analyses
LEADER trial: Primary Outcome
Benefits of minimally invasive aortic valve replacement (MIS-AVR)
A Meta-Analysis of Randomized Controlled Trials and Prospective Cohort Studies of Eicosapentaenoic and Docosahexaenoic Long-Chain Omega-3 Fatty Acids.
Basic statistical methods
Heterogeneity and sources of bias
A Meta-Analysis of Randomized Controlled Trials and Prospective Cohort Studies of Eicosapentaenoic and Docosahexaenoic Long-Chain Omega-3 Fatty Acids.
Association between risk-of-bias assessments and results of randomized trials in Cochrane reviews: the ROBES study Jelena Savović1, Becky Turner2, David.
Figure 1 Mendelian randomization study
مقدمه ای بر مهارت های زندگی
Outcome Relative risk with beta blockers 95% CI Stroke –1.30
Clinical Outcomes with β-Blockers for Myocardial Infarction: A Meta-analysis of Randomized Trials  Sripal Bangalore, MD, MHA, Harikrishna Makani, MD,
Elliott P, et al. JAMA 2009;302:37-48.
Factors independently associated with odds of fractures
David J.A. Jenkins et al. JACC 2018;71:
Mendelian Randomization (Using genes to tell us about the environment)
Berger JS, et al. JAMA 2009;301:
Flow diagram for exclusions of trials identified RCT indicates randomized controlled trial Hulten E, et al. Arch Intern Med 2006;166:
(A) Meta-analysis of repeat revascularisation in randomised trials.
Study Flow Diagram Thompson A, et al. JAMA 2008;299:
Alpha-2 adrenergic agonists to prevent perioperative cardiovascular complications:  Duminda N Wijeysundera, MD, Jennifer S Naik, MD, W Scott Beattie, MD,
Adjunctive Therapies in the Treatment of Acute Coronary Syndromes
Forest plots of randomised controlled trials and propensity score matched studies examining outcomes between patients taking proton pump inhibitor (PPIs)
Flow chart of search strategy
Lifetime heart failure risk*
Mendelian Randomization (Using genes to tell us about the environment)
PRISMA flow diagram for selection of RCTs from leading sports medicine journals for the publication years 2005 and 2015. PRISMA, Preferred Reporting Items.
Risk of cardiovascular disease mortality by cardiorespiratory fitness and body mass index categories, 2316 men with type 2 diabetes at baseline, 179 deaths.
Alcohol, Other Drugs, and Health: Current Evidence May–June 2019
Mendelian Randomization: Genes as Instrumental Variables
Presentation transcript:

Genetic Meta-Analysis and Mendelian Randomization George Davey Smith MRC Centre for Causal Analyses in Translational Epidemiology, University of Bristol

RCT vs Observational Meta- Analysis: fundamental difference in assumptions In meta-analysis of observational studies confounding, residual confounding and bias: –May introduce heterogeneity –May lead to misleading (albeit very precise) estimates

Relative risk (95% confidence interval) Trial (Year) Barber (1967) Reynolds (1972) Wilhelmsson (1974) Ahlmark (1974) Multicentre International (1975) Yusuf (1979) Andersen (1979) Rehnqvist (1980) Baber (1980) Wilcox Atenolol (1980) Wilcox Propanolol (1980) Hjalmarson (1981) Norwegian Multicentre (1981) Hansteen (1982) Julian (1982) BHAT (1982) Taylor (1982) Manger Cats (1983) Rehnqvist (1983) Australian-Swedish (1983) Mazur (1984) EIS (1984) Salathia (1985) Roque (1987) LIT 91987) Kaul (1988) Boissel (1990) Schwartz low risk (1992) Schwartz high risk (1992) SSSD (1993) Darasz (1995) Basu (1997) Aronow (1997) Overall (95% CI) 0.80 ( ) Mortality results from 33 trials of beta-blockers in secondary prevention after myocardial infarction Adapted from Freemantle et al BMJ 1999

Results from 29 studies examining the association between intact foreskin and the risk of HIV infection in men Adapted from Van Howe Int J STD AIDS 1999

Vitamin E supplement use and risk of Coronary Heart Disease Stampfer et al NEJM 1993; 328: 144-9; Rimm et al NEJM 1993; 328: ; Eidelman et al Arch Intern Med 2004; 164:

Genetic meta-analysis, while of observational data, may be analogous to RCT meta-analysis NOT conventional observational meta-analysis

Clustered environments and randomised genes (93 phenotypes, 23 SNPs) Phenotype / phenotype 4278 pairwise associations Phenotype / genotype 2139 pairwise combinations Genotype / genotype 253 pairwise combinations 43% significant at p< significant at p<0.01 vs 21 expected 4 / 253 significant at p<0.01 vs 3 expected Davey Smith et al. PLoS Medicine 2007 in press

WTCCC: blood donors versus 1958 birth cohort controls

A leading epidemiologist speaks … Forget what you learnt at the London School of Hygiene and Tropical Medicine …. just get as many cases as possible and a bunch of controls from wherever you can.. Paul McKeigue, Nov 2002

Or the polite version … This approach allows geneticists to focus on collecting large numbers of cases and controls at low cost, without the strict population-based sampling protocols that are required to minimize selection bias in case-control studies of environmental exposures Am J Human Genetics 2003;72:

If not confounding or selection bias, why have genetic association studies such a poor history of replication?

Are genetic association studies replicable? Hirschhorn et al reviewed 166 putative associations for which there were 3 or more published studies and found that only 6 had been consistently replicated (defined as achieving statistically significant findings in 75% or more of published studies) Hirschhorn JN et al. Genetics in Medicine 2002;4:45-61

Reasons for inconsistent genotype – disease associations True variation Variation of allelic association between subpopulations Effect modification by other genetic or environmental factors that vary between populations Spurious variation Misclassification of phenotype Confounding by population structure Lack of power Chance Publication bias Colhoun et al, Lancet 2003;361:865-72

True variation in genotype and health outcome between populations Disease-causing allele is in LD with a different allele at the marker locus in different groups Allelic heterogeneity (different variants within the same gene) between ethnic groups More likely when disease-causing variant is rare or has been subject to selection pressure The association is modified by other genetic or environmental factors that vary between the groups studied Effect modification by genes unlikely to account for failure to replicate studies in similar populations. Modification by environmental factors more likely, especially when absolute risk of disease varies

Differential misclassification of genotypesAvoided by appropriate laboratory procedures Differential misclassification of outcome: possible if genotype is known when outcome is classified Unlikely, because outcome is usually confirmed in advance of genotyping Biases vary between studies

Population is divided into strata that vary by disease risk and by allele frequencies at the marker locus Unlikely to be a serious problem in most studies: when confounding is a problem, it can be controlled in study design by restriction or use of family-based controls, or in the analysis by quantifying and controlling for substructure Confounding by population substructure

Case mix heterogeneity in an apparently homogenous outcome between populations studied: for instance in a study of stroke, mix of haemorrhagic and thrombotic subtypes may vary between populations Unlikely to be an explanation for failure to replicate studies in similar populations with similar case sampling strategies Case-mix heterogeneity

Failure to consider that the initial effect size reported is an inflation of the true effect size Replication studies should be powered to detect effect sizes that are smaller than the initial effect size reported, especially when the initial study had low power Absence of power leading to false-negative results and failure to replicate

The Beavis effect If the location of a variant and its phenotypic effect size are estimated from the same data sets, the effect size will be over-estimated, in many cases substantially. Statistical significance and the estimated magnitude of the parameter are highly correlated. H Göring et al. Am J Hum Genetics 2001;69:

Multiple testing and publication bias: multiple loci are assessed in each study, many statistical tests are done, and multiple studies are undertaken but only positive results are reported Perhaps the most likely reason for failure to replicate? False positive results by chance in initial positive studies

What is being associated in genetic association studies? Estimates of 15M SNPs in human genome (rare allele frequency >1% in at least one population) Large number of outcomes (diseases and subcategories of particular disease outcomes) Large number of potential subgroups Multiple possible genetic contrasts

Polymorphism really is associated with disease Polymorphism is not associated with disease Total Result of experiment Association declared to exist Association not declared to exist Total What percentage of associations that are studied actually exist? … 1 in 10? (at 80% power, 5% significance level) Oakes 1986; Davey Smith 1998; Sterne & Davey Smith 2001

Power of study (% of time we reject null hypothesis if it is false) P=0.05P=0.01P= Percentage of significant results that are false positives if 10% of studied associations actually exist Sterne & Davey Smith BMJ 2001;322:

Power of study (% of time we reject null hypothesis if it is false) P=0.05P=0.01P= Percentage of significant results that are false positives if 1% of studied associations actually exist Sterne & Davey Smith. BMJ 2001;322:

P values often misinterpreted in both genetic and conventional epidemiology Low prior probability major issue in genetic epidemiology; meaningless (but real) associations a major issue in conventional epidemiology

Why has replication proved to be so difficult? LOW STATISTICAL POWER A consistent feature of almost all analyses Fundamental to many of the explanations or the approach needed to correct for them If we need 5,000 cases to test for a given aetiological effect with a power of 80%, and with a critical p-value of , how much power would there be for a study with 500 cases?

Why has replication proved to be so difficult? LOW STATISTICAL POWER!! A key feature of almost all proffered explanations, and/or of the approach needed to correct for them If we need 5,000 cases to test for a given aetiological effect with a power of 80%, and with a critical p-value of , how much power would there be for a study with 500 cases? 0.008

Deducing true numerical ratios requires the greatest possible number of individual values; and the greater the number of these the more effectively will mere chance be eliminated. Gregor Mendel 1865/6

Association of GNB3 and Hypertension Bagos et al, J Hypertens March Studies Cases = 14,094 Controls = 17,760 Total = 21,654

¿ | α β γ | A B C | a b c | ?

Are genetic associations studies replicable: take two? Joel Hirschhorns group selected 25 of the 166 genetic associations that they had studied and performed formal meta-analysis, claiming that 8 of these (one third) were robust. One third claim widely welcomed! Lohmueller KE et al. Nature Genetics 2003;33:

Replicable Studies ABCC8, type 2 diabetes2.28 ( ) COL1A1, fracture1.59 ( ) CTLA4, type 1 diabetes1.27 ( ) DRD3, schizophrenia1.12 ( ) GSTM1, head/neck cancer1.20 ( ) HTR2A, schizophrenia1.07 ( ) PPARG, type 2 diabetes1.22 ( ) SLC2A1, type 2 diabetes1.76 ( )

Are genetic associations studies replicable: take two? Low hanging fruit and a best-case scenario. Effect size estimates not so widely welcomed..

Science, June 1, 2007 All Studies Combined 14,585 cases 17,968 controls TCF7

Nature, June 7, 2007 Distribution of ORs for 70 Common Disease Variants Odds Ratio %

for exposures with small effect sizes it is very difficult to exclude confounding and bias in conventional epidemiology, and level of statistical significance does not help statistical deviation from the null more important in genetic epidemiology

Mendel on Mendelian randomization the behaviour of each pair of differentiating characteristics in hybrid union is independent of the other differences between the two original plants, and, further, the hybrid produces just so many kinds of egg and pollen cells as there are possible constant combination forms (Sometimes called Mendels second law – the law of independent assortment) Gregor Mendel, Mendel in 1862

Mendelian randomization Genotypes can proxy for some modifiable environmental factors, and there should be no confounding of genotype by behavioural, socioeconomic or physiological factors (excepting those influenced by alleles at closely proximate loci or due to population stratification), no bias due to reverse causation, and lifetime exposure patterns can be captured

Mendelian randomisation and RCTs RANDOMISATION METHOD RANDOMISED CONTROLLED TRIAL CONFOUNDERS EQUAL BETWEEN GROUPS MENDELIAN RANDOMISATION RANDOM SEGREGATION OF ALLELES CONFOUNDERS EQUAL BETWEEN GROUPS EXPOSED: FUNCTIONAL ALLELLES EXPOSED: INTERVENTION CONTROL: NULL ALLELLES CONTROL: NO INTERVENTION OUTCOMES COMPARED BETWEEN GROUPS