Presentation is loading. Please wait.

Presentation is loading. Please wait.

Causal inference: emulating a target trial when a randomized trial is not available Miguel Hernán DEPARTMENTS OF EPIDEMIOLOGY AND BIOSTATISTICS.

Similar presentations


Presentation on theme: "Causal inference: emulating a target trial when a randomized trial is not available Miguel Hernán DEPARTMENTS OF EPIDEMIOLOGY AND BIOSTATISTICS."— Presentation transcript:

1 Causal inference: emulating a target trial when a randomized trial is not available Miguel Hernán DEPARTMENTS OF EPIDEMIOLOGY AND BIOSTATISTICS

2 The situation: We need to make decisions NOW Treat with A or with B? Treat now or later? When to switch to C?  A relevant randomized trial would, in principle, answer each comparative effectiveness and safety question  Interference/scaling up issues aside Hernán - Target trial2

3 But we rarely have randomized trials expensive, untimely, unethical, impractical  And deferring decisions is not an option no decision is a decision: “Keep status quo”  Question: What do we do? Hernán - Target trial3

4 Answer: We analyze observational data (pre-existing or collected for research)  Epidemiologic studies  Electronic medical records  Administrative claims databases  National registers  Disease registries  Other Hernán - Target trial4

5 We analyze observational data but only because we cannot conduct a randomized trial  Observational studies are not our preferred choice For each observational study, we can imagine a hypothetical randomized trial that we would prefer to conduct  If only it were possible Hernán - Target trial5

6 The Target Trial  An analysis of observational data (e.g., large health care database) can be viewed as an attempt to emulate a hypothetical pragmatic randomized trial That would answer our question  If the observational study succeeds at emulating the target trial, both studies would yield identical effect estimates except for random variability Hernán - Target trial6

7 Procedure to answer clinical/policy questions  Step #1 Describe the protocol of the target trial  Step #2 Option A: Conduct the target trial Option B  Use observational data to explicitly emulate the target trial  Apply appropriate causal inference analytics to estimate the effects of interest Hernán - Target trial7

8 Key elements of the protocol of the target trial  Eligibility criteria  Treatment strategies randomly assigned at start of follow-up  Randomized assignment  Start/End of follow-up  Outcomes  Causal contrast(s) of interest  Analysis plan Hernán - Target trial8

9 The observational study needs to emulate  Eligibility criteria  Treatment strategies randomly assigned at start of follow-up  Randomized assignment  Start/End of follow-up  Outcomes  Causal contrast(s) of interest  Analysis plan Hernán - Target trial9

10 Example  Suppose we use observational data a large health care claims database  to emulate a target trial of hormone therapy and heart disease  First we need to outline the protocol of the target trial Hernán - Target trial10

11 Hernán - Target trial11 Target trial: Hormone therapy and coronary heart disease Protocol summary Eligibility criteria Postmenopausal women within 5 years of menopause between the years 2005 and 2010, and with no history of cancer and no use of hormone therapy in the last 2 years. Treatment strategies 1.Initiate estrogen plus progestin hormone therapy at baseline and remain on it during the follow-up, unless deep vein thrombosis, pulmonary embolism, myocardial infarction, or cancer are diagnosed 2.Refrain from taking hormone therapy during the follow-up Assignment procedures Participants will be randomly assigned to either strategy at baseline, and will be aware of the strategy they have been assigned to. Follow-up period Starts at randomization and ends at diagnosis of coronary heart disease, death, loss to follow-up, or 5 years after baseline, whichever occurs earlier. OutcomeCoronary heart disease diagnosed by a cardiologist Causal contrastsIntention-to-treat effect, per-protocol effect Analysis planIntention-to-treat analysis, non-naïve per-protocol analysis (to be discussed)

12 Target trial emulation is not straightforward: data experts only  When observational data were not collected for research purposes e.g., “coronary heart disease” may be recorded when a woman was diagnosed with it, or when her physician suspected it and ordered a diagnostic test  Must consult with knowledgeable data users Time-varying clinical workflows, idiosyncratic coding practices, software versions… Hernán - Target trial12

13 Besides expert knowledge of the data Validation studies to quantify data accuracy Internal consistency checks to detect problems Cross-datasets comparisons to better understand coding differences  Let’s say we have consulted with experts and done the above before attempting to emulate the target trial Hernán - Target trial13

14 Eligibility criteria Emulation  Apply same criteria as the target trial to women who at baseline have been included in the database for at least 2 years  Potential problems Insufficient data to characterize individuals eligible for the target trial Example: If target trial required baseline screening to exclude prevalent cases, emulation may be hard if database records the performance of a test (for billing purposes) but not its findings Hernán - Target trial14

15 Treatment strategies Emulation  Eligible individuals assigned to the strategy consistent with their baseline data Strategy 1: women who start estrogen plus progestin therapy Strategy 2: women who do not start hormone therapy Excluded: women who start a different hormone therapy  Target trial is typically a pragmatic trial observational data cannot be used to emulate trials with tight monitoring and enforcement of adherence to the study protocol cannot emulate a placebo-controlled trial  at most a trial with a “usual care” group Hernán - Target trial15

16 Assignment procedures Emulation of blinding  Generally impossible individuals in the dataset, and their health care workers, are usually aware of the treatment they receive  Observational data can only emulate target trials without blind assignment standard for pragmatic trials not a limitation if the goal is comparing real-world treatment strategies Hernán - Target trial16

17 Randomized assignment Emulation of randomization  Generally requires adjustment for all confounding factors via matching, stratification or regression, standardization or inverse probability (IP) weighting, g-estimation…  If insufficient information on baseline confounders or we fail to identify them, then successful emulation of the target trial’s random assignment is not possible Confounding bias Hernán - Target trial17

18 Start of follow-up When is time zero (baseline)?  In true trials the time of eligibility and randomization  In emulated trials the time of eligibility and treatment assignment  Failure to assign time zero correctly may lead to misunderstandings e.g., immortal time bias, others to be discussed Hernán - Target trial18

19 Outcome Emulation Hernán - Target trial19  Use the database to identify women with a diagnosis of coronary heart disease during the follow-up  Potential problem: observational data cannot be generally used to emulate a target trial with systematic and blind outcome ascertainment Except if outcome ascertainment cannot be affected by treatment history, e.g., if the outcome is mortality independently ascertained from a death registry

20 The observational analysis needs to emulate  Eligibility criteria  Treatment strategies randomly assigned at start of follow-up  Randomized assignment  Start/End of follow-up  Outcomes  Causal contrast(s) of interest  Analysis plan Hernán - Target trial20

21 Causal contrast Emulation  Truly randomized trials can be used to estimate different types of causal effects  We first define these effects in randomized trials, then their analogs in observational studies Hernán - Target trial21

22 Causal contrasts 1. Intention-to-treat effect  The effect of being assigned to a strategy, regardless of strategy received  In our target trial, contrast of the outcome distribution under  Assignment to initiation of estrogen plus progestin therapy at baseline vs.  Assignment to no hormone therapy at baseline Hernán - Target trial22

23 Causal contrasts 1. Intention-to-treat effect  In some trials, assignment to and initiation of the treatment strategies occur simultaneously ITT effect is effect of initiation of the treatment strategies  Effect magnitude depends on treatment decisions made after baseline discontinuation or initiation of the treatments of interest, use of concomitant therapies…  Two trials with the same protocol in similar settings may have different ITT effect estimates with neither of them being biased Hernán - Target trial23

24 Causal contrasts 2. Per-protocol effect  The effect of receiving the treatment strategies specified in the study protocol  In our target trial, contrast of the outcome distribution under  receiving estrogen plus progestin therapy continuously (unless toxicity or contraindications arise) vs.  receiving no hormone therapy between baseline and study end Hernán - Target trial24

25 Causal contrasts 2. Per-protocol effect  Often the implicit target of inference When investigators say that the ITT effect estimate is biased, they mean biased for the per-protocol effect  Cannot be biased by treatment changes that are consistent with the protocol If protocol lets physicians make their own treatment decisions when toxicity or contraindications arise, then treatment discontinuation because of toxicity is not a deviation from protocol and should not be adjusted for Hernán - Target trial25

26 Causal contrasts 3. Other  The effect of receiving strategies other than the ones specified in the study protocol The per-protocol effect in an alternative trial  In our target trial, contrast of the outcome distribution under  receiving statin and estrogen plus progestin therapy continuously (unless toxicity or contraindications arise) vs.  receiving neither statin nor hormone therapy between baseline and study end Hernán - Target trial26

27 Observational analog Intention-to-treat effect  Contrast of the outcome distribution under  Initiation of estrogen plus progestin hormone therapy vs.  No initiation of hormone therapy at time zero  This contrast preserves the key feature of the intention-to-treat analysis: groups are defined solely by baseline information When using prescription data, analog of ITT effect in a target trial When using dispensing data, analog of ITT effect in a target trial in which everybody initiates their assigned strategy Hernán - Target trial27

28 Observational analog Per-protocol effect  Defined identically as that for the target trial  Contrast of the outcome distribution under  receiving estrogen plus progestin therapy continuously (unless toxicity or contraindications arise) vs.  receiving no hormone therapy between time zero and study end Hernán - Target trial28

29 The observational analysis needs to emulate  Eligibility criteria  Treatment strategies randomly assigned at start of follow-up  Start/End of follow-up  Outcomes  Causal contrast(s) of interest  Analysis plan Hernán - Target trial29

30 Analysis plan  Identical for true and emulated trials Except that no adjustment for baseline confounding is expected in intention-to-treat analysis of true trials  Both true and emulated trials require adjustment for post-baseline confounding and selection bias Possibly using Robins’s g-methods Which means longitudinal data on treatment, confounders, and outcomes are required Hernán - Target trial30

31 The target trial will be a compromise between the ideal trial we would really like to conduct and the trial we may reasonably emulate using the available data  The drafting of the protocol of the target trial is typically an iterative process That requires detailed knowledge of the database Hernán - Target trial31

32 Examples of trial emulation using Big Data 1.Classic cohort study Nurses’ Health Study Postmenopausal hormone therapy and coronary heart disease 2.Electronic medical records - THIN Statins and coronary heart disease Static strategies  treat vs.no treat 3.Claims database - USRDS Medicare (not today) Epoetin and mortality Static and dynamic strategies  intervention depends on response to previous intervention Hernán - Target trial32

33 EXAMPLE #1 Hormone therapy and heart disease  Question What is the intention-to-treat effect of hormone therapy on the risk of coronary heart disease in postmenopausal women? Hernán - Target trial33

34 Hernán - Target trial34 Answers (shocking discrepancy)  Observational studies >30% lower risk in current users compared with never users  e.g., HR 0.68 in Nurses Health Study Grodstein et al. J Women’s Health 2006  Randomized trial >20% higher risk in initiators compared with noninitiators  HR 1.24 in Women’s Health Initiative Manson et al. NEJM 2003

35 Hernán - Target trial35 The WHI randomized trial Manson et al, NEJM 2003  Double-blind  Placebo-controlled  Large >16,000 U.S. women aged 50-79 yrs  Randomly assigned to estrogen plus progestin therapy or placebo  Women followed approximately every year like in many large observational studies No intervention after baseline

36 Hernán - Target trial36 WHI: Effect estimates Intention-to-treat hazard ratio (95% CI) of CHD  Overall 1.23 (0.99, 1.53)  Years of follow-up 0-2 1.51 (1.06, 2.14) >2-5 1.31 (0.93, 1.83) >5 0.67 (0.41, 1.09)  Years since menopause <10 0.89 (0.54, 1.44) 10-20 1.24 (0.86, 1.80) >20 1.65 (1.14, 2.40)

37 Why did observational studies get it “wrong”?  Popular theory: residual confounding insufficient adjustment for lifestyle and socioeconomic indicators Corollary: causal inference from observational data is a hopeless undertaking  An alternative theory: Observational and randomized studies asked different questions Hernán - Target trial37

38 Randomized trial estimated the intention-to-treat effect  What is the CHD risk in women who initiate hormone therapy compared with women who do not?  Design and analysis: Women randomly assigned to initiation of hormone therapy or placebo Analytic approach  Compare risk between incident users and nonusers of hormone therapy Hernán - Target trial38

39 Observational studies did not estimate intention-to-treat effect  What is the CHD risk in women who are currently taking hormone therapy compared with women who are not?  Design and analysis: Women are asked about therapy use Analytic approach  Compare risk between prevalent users and nonusers of hormone therapy (current users vs. never users) Hernán - Target trial39

40 “Current vs. never users” contrast is not clinically relevant  Consider a woman wondering whether to start hormone therapy The current vs. never contrast does not provide the information she needs  Consider a woman wondering whether to stop hormone therapy The current vs. never contrast does not provide the information she needs Hernán - Target trial40

41 Hernán - Target trial41 What if we re-analyze the observational study…  … to compare the risk in incident users vs. nonusers?  That is, what if we use the observational data to answer same question as randomized trial? estimate the observational analog of the intention-to- treat effect  Hernán et al. Biometrics 2005  Hernán et al. Epidemiology 2008

42 Hernán - Target trial42 Effect estimates (ITT hazard ratios) Randomized Observational Women’s Health Initiative Nurses’ Health Study  Overall 1.23 (0.99, 1.53) 1.05 (0.82, 1.34)  Years of follow-up 0-2 1.51 (1.06, 2.14) 1.43 (0.92, 2.23) >2 1.07 (0.81, 1.41) 0.91 (0.72, 1.16)  Years since menopause <10 0.89 (0.54, 1.44) 0.88 (0.63, 1.21) 10-20 1.24 (0.86, 1.80) 1.13 (0.85, 1.49) >20 1.65 (1.14, 2.40) --

43 When same question is asked  No shocking observational-randomized discrepancies for ITT estimates though wide CIs in both studies  What about the popular hypothesis? Any residual confounding? Probably, but insufficient to explain the original discrepancy Hernán - Target trial43

44 Aside: Analysis can/should be extended in two ways  Causal contrast Estimate per-protocol effect  Rather than intention-to-treat effect only  Effect measure Estimate survival (or cumulative risk) curves  Rather than hazard ratios only Hernán - Target trial44

45 ITT effect is problematic Hernán, Hernández-Díaz. Clinical Trials 2012  Depends on adherence patterns Substantial non-adherence in both randomized trial and observational study  Inappropriate for safety outcomes  Not patient-centered  We also estimated per-protocol effect via IP weighting (more later) Again no randomized-observational discrepancies  Toh et al. Ann Intern Med 2010 Hernán - Target trial45

46 Hazard ratios are problematic Hernán. Epidemiology 2010  Overall hazard ratio is a weighted average of the time-varying hazard ratios Effect size depends on follow-up length  Hazard ratio may be vary over time because of built-in selection bias Hernán - Target trial46

47 End of aside Hernán - Target trial47

48 Conclusion: CER based on epidemiologic studies is possible  If high-quality observational data on treatment, outcome, and confounders are available e.g., the Nurses Health Study  But most observational CER relies on large databases (big, pre-existing data) Health claims, electronic medical records  Can emulation of a target trial work in that setting? See next case studies Hernán - Target trial48

49 EXAMPLE #2 Statins and coronary heart disease  Question What is the effect of statin therapy on the risk of coronary heart disease?  Extreme example of confounding Hernán - Target trial49

50 Hernán - Target trial50 Target trial: Statin therapy and coronary heart disease Protocol summary Eligibility criteria Individuals aged 55–84 in the years 2000-2006 with no prior history of CHD, stroke, peripheral vascular disease, heart failure, cancer, schizophrenia or dementia, no symptoms of subclinical CHD, and no use of statin therapy in the last 2 years. Treatment strategies 1.Initiate statin therapy at baseline and remain on it during the follow-up, unless contraindications arise 2.Refrain from taking statin therapy during the follow-up Assignment procedures Participants will be randomly assigned to either strategy at baseline, and will be aware of the strategy they have been assigned to. Follow-up period Starts at randomization and ends at diagnosis of coronary heart disease, death, loss to follow-up, or January 2007, whichever occurs earlier. OutcomeCoronary heart disease diagnosed by a cardiologist Causal contrastsIntention-to-treat effect, per-protocol effect Analysis planIntention-to-treat analysis, non-naïve per-protocol analysis

51 Observational data The Health Improvement Network  THIN is a database of electronic medical records 6.2 million individuals from 350 general practices in the UK (2009)  For each individual demographic and socioeconomic characteristics symptoms, signs and diagnoses, referrals, laboratory test results some lifestyle information Vital status and cause of death data Hernán - Target trial51

52 Target trial emulation  Use observational data from THIN to emulate the components of the target trial  Eligible individuals classified into Strategy 1 if they initiated statin treatment Strategy 2 if they did not initiate statin treatment during the baseline month  Baseline month January 2000 3178 individuals met all the eligibility criteria 18 were initiators 1 initiator developed CHD Hernán - Target trial52

53 Sequence of target trials Emulation  Emulate a target trial starting each calendar month between January 2000 and November 2006 83 target trials with a 1-month enrollment period  For each trial Follow-up starts at the trial-specific baseline and ends at diagnosis of CHD, death, lost to follow-up, or January 2007 Eligibility criteria applied at each baseline Hernán - Target trial53

54 Sequence of target trials Emulation  74 806 individuals eligible for at least 1 trial  On average each eligible individual participated in the emulation of 11 trials many non-initiators in the January 2000 trial still met all eligibility criteria in February All 18 initiators in January 2000 were ineligible in February 2000 because they received treatment during the washout period for the February 2000 trial, and so on  Danaei et al. Statistical Methods in Medical Research 2013 Hernán - Target trial54

55 CONSORT flowchart of emulated trials Hernán - Target trial55

56 Adherence to treatment Probability of continuing initial treatment Months of follow-up ___ Non-initiators ___ Initiators 56Hernán - Target trial

57 Intention-to-treat effect Emulation  Compare CHD incidence in initiators vs. no- initiators at baseline of each emulated trial Regardless of their subsequent treatment  Pool data across emulated trials to obtain a more precise effect estimate Robust variance because of within-subject correlation Hernán - Target trial57

58 Hazard ratio (95% CI) of CHD THIN trials 2000-2006 Intention-to-treat analysisPer-protocol analysis Unique cases635488 Unique persons74,806 Cases6,3354,849 Person-trials844,800 Adjusted for age and sex1.29 (1.06, 1.56)1.54 (1.09, 2.18) Adjusted for all covariates0.89 (0.73, 1.09)0.84 (0.54, 1.30) Hernán - Target trial58

59 What if we had compared prevalent (not incident) users vs. nonusers?  Current users HR: 1.42 (1.16, 1.73)  Persistent (1 yr) current users HR: 1.05  Persistent (2 yrs) current users HR: 0.77 (0.51, 1.18)  We can get any result we want by changing the definition of current user! Confounding-selection bias tradeoff Hernán - Target trial59

60 Mortality hazard ratio for statins in CHD secondary prevention studies  RCTs: 0.84 (0.77, 0.91)  Observational studies Incident users: 0.77 (0.65,0.91) Prevalent-incident mix: 0.70 (0.64, 0.78) Prevalent users: 0.54 (0.45, 0.66)  Danaei et al. Am J Epidemiol 2012 Hernán - Target trial60

61 Other static comparisons: same analytic approach  Head-to-head comparisons: Example: Lipophilic statins (atorvastatin, simvastatin) vs other statins Danaei et al. Diabetes Care 2013  Joint interventions Example: statins plus antihypertensives vs standard care Danaei et al. J Clin Epidemiol 2016 (in press) Hernán - Target trial61

62 Advantages of the target trial approach (I)  Provides ready access to the application of formal counterfactual theory and concepts to Big Data without the need for technical jargon,  Organizing principle for causal inference methods which implicitly rely on counterfactual reasoning e.g., new user design, active comparators, outcome controls Hernán - Target trial62

63 Advantages of the target trial approach (I)  Provides ready access to the application of formal counterfactual theory and concepts to Big Data without the need for technical jargon,  Organizing principle for causal inference methods which implicitly rely on counterfactual reasoning e.g., new user design, active comparators, outcome controls Hernán - Target trial63

64 Advantages of the target trial approach (II)  Facilitates the comparison of complex strategies that are sustained over time and may depend on a patient’s evolving characteristics Dynamic treatment strategies Not “treat vs no treat” but rather “when to treat, when to switch, when to monitor” depending on time-varying factors Hernán - Target trial64

65 Advantages of the target trial approach (III)  Establishes a link between methods for the analysis and reporting of randomized trials and Big Data analytics Observational studies analyzed like randomized trials, and vice versa  Provides a scaffolding to organize discussions about which data are required/missing Hernán - Target trial65

66 Advantages of the target trial approach (IV)  Naturally leads to analytic approaches that prevent apparent paradoxes and common biases Selection bias related to prevalent users Immortal time bias Birth weight paradox, obesity paradox Etc. Hernán - Target trial66

67 Advantages of the target trial approach (V)  Facilitates a systematic methodologic evaluation of observational studies  which components of the target trial we weren’t able to mimic approximately?  which components of the target trial would be problematic even if we were able to conduct a truly randomized trial? An approach adopted by the Cochrane Collaboration Risk of Bias Tool for Nonrandomised Studies and the IOM Report on the Safety of Approved Drugs Hernán - Target trial67

68 Advantages of the target trial approach (VI)  Helps understand why estimates differ across studies assess the sensitivity of estimates to different design choices for the target trial focus research efforts on “sensitive” choices Hernán - Target trial68

69 Advantages of the target trial approach (last)  If we can influence how data are recorded the target trial approach helps record them  If we are using data as they exist the target trial approach guides the validation studies and the development and evolution of the Data Model  The target trial approach allows you to systematically articulate the tradeoffs that you are willing to accept regarding eligibility criteria, interventions confounders, outcomes Hernán - Target trial69

70 A common misinterpretation  You are saying that observational studies are as good as RCTs? “This is a cohort study that tries to turn itself into a clinical trial. This involves a series of assumptions and manoeuvres which lack credibility.”  Anonymous JAMA reviewer, April 2014  No, the point is not that observational studies can turn themselves into randomized experiments They can’t Hernán - Target trial70

71 The point is that we can do better  by using observational data to explicitly emulate randomized trials  The limitations of observational studies (e.g., confounding, mismeasurement) remain, but we do not compound them with additional problems Hernán - Target trial71

72 Remember  Observational studies are what we do when we cannot conduct a randomized trial In the absence of practical and ethical constraints, sane people will always prefer a randomized trial  No alternative to observational studies So we better keep improving them  because people will keep using (Big and Small) observational data to guide their decisions Hernán - Target trial72


Download ppt "Causal inference: emulating a target trial when a randomized trial is not available Miguel Hernán DEPARTMENTS OF EPIDEMIOLOGY AND BIOSTATISTICS."

Similar presentations


Ads by Google