Modelling partial compliance through copulas in a principal stratification framework Bartolucci F. and Grilli L. (2011) To appear in JASA. Leonardo Grilli.

Slides:

Advertisements

Similar presentations

1 Superior Safety in Noninferiority Trials David R. Bristol To appear in Biometrical Journal, 2005.

Advertisements

The Application of Propensity Score Analysis to Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Lilly Yue, Ph.D.* CDRH, FDA,

1 ESTIMATION IN THE PRESENCE OF TAX DATA IN BUSINESS SURVEYS David Haziza, Gordon Kuromi and Joana Bérubé Université de Montréal & Statistics Canada ICESIII.

Jörg Drechsler (Institute for Employment Research, Germany) NTTS 2009 Brussels, 20. February 2009 Disclosure Control in Business Data Experiences with.

C82MST Statistical Methods 2 - Lecture 2 1 Overview of Lecture Variability and Averages The Normal Distribution Comparing Population Variances Experimental.

MCMC estimation in MlwiN

Analysis of multivariate transformations. Transformation of the response in regression The normalized power transformation is: is the geometric mean of.

Assumptions underlying regression analysis

Multilevel Event History Modelling of Birth Intervals

Chapter 4: Basic Estimation Techniques

Tests of Static Asset Pricing Models

Inferential Statistics and t - tests

6. Statistical Inference: Example: Anorexia study Weight measured before and after period of treatment y i = weight at end – weight at beginning For n=17.

Chapter 4 Inference About Process Quality

Statistical Analysis SC504/HS927 Spring Term 2008

Comparing Two Population Parameters

7/20/03Copyright Ed Lipinski and Mesa Community College, All rights reserved. 1 Research Methods Summer 2009 Using Between Subjects and Within.

Writing up results from Structural Equation Models

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.

Chapter 18: The Chi-Square Statistic

Nonparametric estimation of non- response distribution in the Israeli Social Survey Yury Gubman Dmitri Romanov JSM 2009 Washington DC 4/8/2009.

Simple Linear Regression Analysis

Week 13 November Three Mini-Lectures QMM 510 Fall 2014.

Multiple Regression and Model Building

Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 

Estimating the Effects of Treatment on Outcomes with Confidence Sebastian Galiani Washington University in St. Louis.

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests

Improving health worldwide George B. Ploubidis The role of sensitivity analysis in the estimation of causal pathways from observational.

6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.

Experimental Design making causal inferences. Causal and Effect The IV precedes the DV in time The IV precedes the DV in time The IV and DV are correlated.

L18: CAPM1 Lecture 18: Testing CAPM The following topics will be covered: Time Series Tests –Sharpe (1964)/Litner (1965) version –Black (1972) version.

Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois.

4. Multiple Regression Analysis: Estimation -Most econometric regressions are motivated by a question -ie: Do Canadian Heritage commercials have a positive.

Statistical Inference and Regression Analysis: GB Professor William Greene Stern School of Business IOMS Department Department of Economics.

Chapter 11 Multiple Regression.

Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.

Simple Linear Regression and Correlation

CHAPTER 19: Two-Sample Problems

Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.

Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.

Multiple imputation using ICE: A simulation study on a binary response Jochen Hardt Kai Görgen 6 th German Stata Meeting, Berlin June, 27 th 2008 Göteborg.

Lesson Comparing Two Means.

Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.

Lecture 8: Generalized Linear Models for Longitudinal Data.

Experimental Design making causal inferences Richard Lambert, Ph.D.

Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.

© Michael Lechner, 2006, p. 1 (Non-bayesian) Discussion (translation) of Principal Stratification for Causal Inference with Extended Partial Complience.

Assumptions of value-added models for estimating school effects sean f reardon stephen w raudenbush april, 2008.

By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.

HSRP 734: Advanced Statistical Methods July 17, 2008.

Empirical Efficiency Maximization: Locally Efficient Covariate Adjustment in Randomized Experiments Daniel B. Rubin Joint work with Mark J. van der Laan.

통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.

Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.

Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.

CIA Annual Meeting LOOKING BACK…focused on the future.

Multiple Logistic Regression STAT E-150 Statistical Methods.

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.

Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.

Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.

REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.

Estimating standard error using bootstrap

Missing data: Why you should care about it and what to do about it

Chapter 8: Inference for Proportions

Lesson Comparing Two Means.

EM for Inference in MV Data

Which of the popular two drugs – Lipitor or Pravachol – helps lower.

EM for Inference in MV Data

Björn Bornkamp, Georgina Bermann

Presentation transcript:

Modelling partial compliance through copulas in a principal stratification framework Bartolucci F. and Grilli L. (2011) To appear in JASA. Leonardo Grilli University of Florence Bristol, 29 th June 2011 Research seminar

2 Some history EFRON & FELDMAN (1991): analysis of a randomized trial with partial non-compliance FRANGAKIS & RUBIN (2002): principal stratification general framework to deal with non-compliance but earliest applications are for all-or-none compliance ( discrete strata) JIN & RUBIN (2008): new analysis of Efron & Feldman data using continuous principal strata BARTOLUCCI & GRILLI (2011): new analysis of Efron & Feldman data following the approach of Jin & Rubin but with a different modeling strategy

3 Efron-Feldman data /1 EF: Efron B. & Feldman D. (1991) Compliance as an Explanatory Variable in Clinical Trials, JASA 86, Subset of data from LRC-CPPT, a placebo-controlled double-blinded randomized clinical trial designed to study the effectiveness of cholestyramine for lowering cholesterol levels data on 335 men: 164 active pills of the drug 171 placebo pills Visits at two-month intervals (average length of 7.3 years) Variables: binary indicator for treatment assignment (1 = active pills) compliance (proportion of assigned packets not returned, averaged over all visits – rounded to the second decimal) continuous outcome variable: average decrease in the cholesterol level during the study

4 Efron-Feldman data /2 Drug group Placebo group The effect of the observed compliance to drug is likely the combination of a genuine effect (chemical action of the drug) and a collateral effect (correlation between the degree of compliance and some unobserved characteristics of the patients which affect the cholesterol level, such as the propensity to eat healthy food or to exercise)

5 imputed observed imputed Efron-Feldman data /3 Q-Q plot observed compliance to placebo larger than observed compliance to drug (adverse side-effects of the drug) EF imputed the missing compliances using the percentiles (equipercentile equating assumption) Maybe a restrictive assumption: how to relax it?

6 Preliminary analysis Treatment indicator Z i (1=drug, 0=placebo) Potential outcomes Compliance: d i placebo, D i drug Outcome: Y i (0) placebo, Y i (1) drug Preliminary analysis: separate regressions for the two treatment arms SELECTED MODELS: no quadratic terms, but heteroschedasticity for Z i =1

7 Modelling strategy Principal stratification approach of JR: Jin and Rubin (2008) Principal Stratification for Causal Inference With Extended Partial Compliance, JASA 103, Principal stratification (d i,D i ) principal strata (continuous) E(Y i (1) - Y i (0) | d i,D i ) principal causal effect (PCE) Regression models for the outcomes Y i (0) on d i and D i Y i (1) on d i and D i Like JR we adopt principal stratification and focus on PCE In the regressions we allow for nonlinearities (like JR) but also for interactions and heteroschedasticity Key point: we relax one of the assumptions of JR

Assumptions /1 1.SUTVA 2.Ignorable treatment assignment 3.Strong access monotonicity (drug compliance is null for patients assigned to placebo and viceversa) 8 We rely on assumptions 1 to 3 like JR and EF These assumptions are untestable but reasonable in carefully designed randomized trials such as the cholestyramine study

Assumptions /2 An advantage of our approach is that we do not invoke assumptions about the relationship between drug and placebo compliances In contrast EF assumed equipercentile equating of the compliances (a deterministic relationship) JR assumed negative side-effect monotonicity, i.e. the drug compliance is no larger than the placebo compliance (a stochastic relationship with D i d i ) 9 Negative side-effect monotonicity seems plausible in placebo-controlled experiments where the drug has some negative side effects However, it is violated if there exists a subset of patients having positive side effects from reported cholesterol reductions after periodic blood tests

10 Joint distribution of compliances The compliances d i and D i are never jointly observed … … but empirical evidence on their correlation is induced by the equations for the outcomes where both compliances enter as regressors For the joint distribution of the compliances, JR assumed that d i has distribution Beta( ) and (conditional on d i ) the ratio D i /d i has distribution Beta( ), which implies D i d i (Negative side-effect monotonicity) The specification of JR allows the compliances to be correlated, although in a particular way that should not be viewed as the only possible way How sensitive is the inference on the causal effect to the model on the compliances?

11 Copulas To model the joint distribution of the compliances (d i,D i ) we use a copula, which is a flexible way to define a joint distribution from the marginals Definition: let X and Y be two random variables with distribution functions F X (x) and F Y (y), then a single-parameter copula (with parameter ) is a function C (, ) such that C (F X (x),F Y (y)) is a joint distribution function The copula has the merit of allowing us to study the association between the compliances without specifying a model for their marginal distributions, which are estimated by their empirical distribution functions

12 Plackett copula We use a Plackett copula which has a single association parameter negative association independence positive association Advantages of using a copula instead of a parametric density: no constraints on the marginal distributions association captured by a single parameter (to be estimated or used in a sensitivity analysis)

13 ML estimation via EM 1.Compute the univariate empirical distribution functions of d i and D i 2.For a set of values of the association param. i.Compute the joint distribution function of (d i,D i ) using the copula ii.Compute the likelihood iii.Maximize the likelihood via EM 3.Plot the profile likelihood for This allows us to see how the different values of are supported by the data check for local maxima we wrote a MATLAB routine

14 Model selection /1 We begin with a general form with quadratic terms, interactions and heteroschedasticity INITIAL SPECIFICATION: 1. Regression models for the means E(Y i (0) | d i,D i ) = … E(Y i (1) | d i,D i ) = … 2. Regression models for the log-variances logVar(Y i (0) | d i,D i ) = … logVar(Y i (1) | d i,D i )= … 3. Plackett copula for the joint distribution of (d i,D i ) with association parameter 9 parameters 6 parameters 1 parameter We test several restrictions using the LR test

15 Model selection /2 FINAL SPECIFICATION: 1. Regression models for the means E(Y i (0) | d i,D i ) = d i E(Y i (1) | d i,D i )d iD i(d iD i ) 2. Regression models for the log-variances logVar(Y i (0) | d i,D i ) logVar(Y i (1) | d i,D i ) D i 3. Plackett copula for the joint distribution of (d i,D i ) with association parameter 4 parameters 2 parameters 1 parameter

16 Profile log-likelihood of log Point estimate of is Independence between d i and D i (i.e.=1) is rejected (p- value<0.001) Pearson correlation between d i and D i is a rather flat profile

Type of inference The PCE depends on the regression coefficients, which depend on the value of the Plackett association parameter which is estimated with low precision because of scarce empirical support (flat profile log-likelihood) identified thanks to the regression equations (which cannot be tested separately since they depend on both compliances) Thus it is not advisable to base the inference exclusively on the model with at its point estimate First, I will illustrate the inference drawn with at its ML estimate Next, I will show the results of a sensitivity analysis on the estimated PCE by letting vary on a suitable interval 17

18 Association of compliances [ at MLE] We relax the negative side-effect monotonicity (i.e. D i d i ) 21.6% of the points go beyond the bisectrix, corresponding to individuals with D i > d i (they would take more drug than placebo) Random draws from the bivariate distribution of the compliances Jin and Rubin (2008) Bartolucci and Grilli (2011)

19 Models for the means and PCE [ at MLE] E(Y i (0) |d i,D i )d i E(Y i (1) |d i,D i )d i D i (d iD i ) The estimated Principal Causal Effect is PCE(d i,D i )d iD i The PCE depends on the dose of the drugD i and the slope is positive, except when d i <0.298 (but this is rare: 12.3% of the subjects in the placebo arm) steeper at higher levels of the placebo compliance d i

20 Estimated PCE surface [ at MLE] Good agreement with the estimates of Jin and Rubin At the median point (d i =0.89, D i =0.70) our ML estimate of PCE is 30, compared with 24 of JR (Bayesian posterior median)

Confidence intervals for PCE [ at MLE] Confidence intervals for the PCE are easily obtained via the nonparametric bootstrap (1000 samples) There is a confidence interval for each couple of values of the compliances, for example: at the medians (d i =0.89, D i =0.70) interval (22.5, 39.2) significant effect at first quartiles (d i =0.59, D i =0.27) interval (3.1, 9.9) non-significant effect at first quartile for placebo compliance and at third quartile for drug compliance (d i =0.59, D i =0.95) interval (10.8, 34.7) non-significant effect 21 Remark: the third case refers to an unlikely couple of compliance levels (rarely D i is much larger than d i ) very large interval

22 Sensitivity analysis We perform a sensitivity analysis to assess how the PCE depends on (we let it vary in its profile likelihood interval): at the median point (d i =0.89, D i =0.70): PCE (27.4, 34.8) at the Q1-Q3 point (d i =0.59, D i =0.95): PCE (14.0, 29.5) Principal Causal Effects are reliably estimated at drug and placebo compliance levels near the sample medians, while inference at unlikely compliance levels appears to be unduly affected by model assumptions Remark: this case (corresponding to an unlikely couple of compliance levels, i.e. far from the bulk of the data) has an high sensitivity (in addition to a large sampling variance – recall the bootstrap CI)

Model checks The estimates of the PCE may critically depend on some modelling choices such as the normality of the conditional distributions of the potential outcomes the type of copula We check the normality assumption using a Box-Cox transformation of the outcomes the LR test does not reject the hypothesis of an identity transformation (i.e. no evidence against normality) the estimates of the PCE are quite stable (except for unlikely couples of compliance levels – again!) We check the role of the copula by replacing the Plackett copula with a Gaussian copula: the estimates of the PCE are very stable (even for unlikely couples of compliance levels ) 23

Warnings Be aware of hidden extrapolation: the model yields estimates of the PCE at any couple of compliance levels, but for unlikely couples (i.e. far from the bulk of the data) the estimates are highly sensitive to model assumptions (beyond having a large sampling variance) We are not estimating a dose-response function: for a fixed value of d i, we can draw PCE(d i,D i ) as a function of D i : however, this is not interpretable as a dose-response function since the dose of the drug is not randomly assigned but chosen by the patients the effect of the dose of the drug is mixed with the effect of the unobserved features associated with the degree of compliance (the interpretation in terms of a dose–response function would require further problematic assumptions – see JRs section 4) 24

25 Final remarks Our ML estimates of the causal effects for the Efron- Feldman data are similar to the Bayesian estimates of Jin and Rubin (2008) … … but we offer an alternative modelling strategy yielding a different interpretation due to interaction between the compliances in the principal causal effect flexible specification of the joint distribution of the compliances through a copula ( doubts on the negative side-effect assumption) Merits of our approach: The joint distribution of the compliances is modelled in a flexible way (relaxing assumptions) Sensitivity analysis is straightforward ML estimation via EM is computationally simple

Thanks for your attention!