Considerations for the use of multiple imputation in a noninferiority trial setting Kimberly Walters, Jie Zhou, Janet Wittes, Lisa Weissfeld Joint Statistical.

Slides:



Advertisements
Similar presentations
1 Superior Safety in Noninferiority Trials David R. Bristol To appear in Biometrical Journal, 2005.
Advertisements

Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis.
Controlling for Time Dependent Confounding Using Marginal Structural Models in the Case of a Continuous Treatment O Wang 1, T McMullan 2 1 Amgen, Thousand.
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
BCOR 1020 Business Statistics
Clustered or Multilevel Data
BS704 Class 7 Hypothesis Testing Procedures
Today Concepts underlying inferential statistics
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Statistical Methods for Missing Data Roberta Harnett MAR 550 October 30, 2007.
Chapter 14 Inferential Data Analysis
Sample Size Determination Ziad Taib March 7, 2014.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Multiple imputation using ICE: A simulation study on a binary response Jochen Hardt Kai Görgen 6 th German Stata Meeting, Berlin June, 27 th 2008 Göteborg.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 Multiple Imputation : Handling Interactions Michael Spratt.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Review of Chapters 1- 6 We review some important themes from the first 6 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 10 Missing Data Henian Chen, M.D., Ph.D.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
What is a non-inferiority trial, and what particular challenges do such trials present? Andrew Nunn MRC Clinical Trials Unit 20th February 2012.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
1 Updates on Regulatory Requirements for Missing Data Ferran Torres, MD, PhD Hospital Clinic Barcelona Universitat Autònoma de Barcelona.
Sampling Fundamentals 2 Sampling Process Identify Target Population Select Sampling Procedure Determine Sampling Frame Determine Sample Size.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 13, 2013.
29 th TRF 2003, Denver July 14 th, Jenny H. Qin and Mike Singleton Kentucky CODES Kentucky Injury Prevention & Research Center University.
Tutorial I: Missing Value Analysis
Multiple Imputation using SAS Don Miller 812 Oswald Tower
1 Probability and Statistics Confidence Intervals.
Chapter 13 Understanding research results: statistical inference.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 9, 2012.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Sample Size and Power Considerations.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
Missing data: Why you should care about it and what to do about it
More on Inference.
Hypothesis Testing Start with a question:
Sample Size Determination
Biostatistics Case Studies 2007
Hypothesis Testing: One Sample Cases
Multiple Imputation using SOLAS for Missing Data Analysis
MISSING DATA AND DROPOUT
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
The Centre for Longitudinal Studies Missing Data Strategy
Sample Size Estimation
Multiple Imputation Using Stata
How to handle missing data values
Aligning Estimands and Estimators – A Case Study Sept 13, 2018 Elena Polverejan Vladimir Dragalin Quantitative Sciences Janssen R&D, Johnson & Johnson.
Chapter 9 Hypothesis Testing.
More on Inference.
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Chapter 9 Hypothesis Testing.
Discrete Event Simulation - 4
When You See (This), You Think (That)
Additional notes on random variables
Additional notes on random variables
Missing Data Mechanisms
What Do We Know About Estimators for the Treatment Policy Estimand
Handling Missing Not at Random Data for Safety Endpoint in the Multiple Dose Titration Clinical Pharmacology Trial Li Fan*, Tian Zhao, Patrick Larson Merck.
Clinical prediction models
Stanford University School of Medicine
Yu Du, PhD Research Scientist Eli Lilly and Company
How Should We Select and Define Trial Estimands
Imputation Strategies When a Continuous Outcome is to be Dichotomized for Responder Analysis: A Simulation Study Lysbeth Floden, PhD1 Melanie Bell, PhD2.
2019 Joint Statistical Meetings at Denver
Presentation transcript:

Considerations for the use of multiple imputation in a noninferiority trial setting Kimberly Walters, Jie Zhou, Janet Wittes, Lisa Weissfeld Joint Statistical Meetings Denver, Colorado July 30, 2019

Outline Inference in clinical trials: noninferiority versus superiority choice of estimand conservative (favoring null) assumptions for missing data Multiple imputation (MI) of missing data fully conditional specification (FCS) methods imputation size, i.e., number of imputed datasets choice of covariates Simulation study to assess MI choices: the trade-off bias, variance, power computational burden

A word about estimands Wait! What’s an estimand? From ICH E9 (R1) addendum on estimands: The population The response variable (Y) Rules to handle observations after intercurrent events Population-level summary of treatment effect, e.g., θT − θC Examples: Treatment policy, While on treatment, Composite, Hypothetical, Principal stratum

A word about estimands From ICH E9 (R1) addendum on estimands: “The choice of estimands for studies with objectives to demonstrate non-inferiority or equivalence requires careful reflection.” “[S]uch trials are not conservative in nature and the importance of minimizing the number of protocol violations and deviations, non-adherence and withdrawals is indicated.”

Superiority Null Alternative Comparator Investigational drug measure of efficacy time

Noninferiority Null Alternative Comparator Investigational drug measure of efficacy <NIM >NIM time NIM=noninferiority margin (depends on known efficacy of comparator)

Control-based assumptions about missing data measure of efficacy Comparator Investigational drug SUPERIORITY more conservative assumptions NONINFERIORITY less conservative assumptions time

Single vs. multiple imputation (MI) Single imputation each missing value is replaced by a single value examples: LOCF, WOCF, group-mean can underestimate the variability of the outcome Multiple imputation each missing value is replaced by multiple (M) values M complete datasets generated more accurately reflects the variability of the outcome estimates from each dataset are “rolled up” using Rubin’s rules

Multiple imputation design Model for generating multiple values? How many and which covariates? Imputation size M?

Multiple imputation model

Multiple imputation model SAS PROC MI and PROC MIANALYZE Fully conditional specification (FCS) SAS: “With an FCS statement, the variables are imputed sequentially in the order specified in the VAR statement.” conditions on previous variables (observed and imputed values) Predictive mean matching (PMM) Default FCS models SAS: “By default, a regression method is used for a continuous variable, and a discriminant function method is used for a classification variable.”

Covariates for MI Choice of variables Substantive model – response (Y) and predictors (X) Auxiliary variables (Z) highly correlated to substantive model variables predict the pattern of missingness Caution from Hardt, et al. 2012: “inclusion of too many variables leads to a considerable increase in computation time or even causes programs to fail…” They recommend this rule of thumb: “the number of cases with complete data should be at least three times the number of variables”

Imputation size Hardt et al., 2012: “There was a recommendation by Little and Rubin that creating three to five data sets is usually enough. In more recent articles, probably due to increasing computational power, this recommendation has changed, suggesting the use of a greater number of imputed datasets in real data analyses, i.e. 20 – 100” O’Kelly and Ratitch, 2014: “In our experience, using the number of imputations even on the order of several thousand does not present any practical challenges with the current technology and produces results that are quite stable with respect to different random seeds used, as well as close to the results of the maximum likelihood analysis with an equivalent model.” The MI Procedure (SAS): “For other statistics (such as standard error and p-value) to be reliable, the rule of thumb is to use the percentages of cases with missing values as the number of imputations.”

Simulation study: design parameters Response features of interventions in population θ = true treatment effect (unknown) σ = SD(Y) = standard deviation of endpoint measure Y Study design n = number of subjects total (1:1) choice of estimand noninferiority margin (NIM) Data quality p = percentage of missing data (varies by variable) mechanism of missing data – MAR or MNAR ρ = correlation between covariates and outcome -0.2, 0 1, 1.5 300, 1000 change score, treatment policy, difference in means -0.25, -0.5 10%, 40% MAR 0.2, 0.8

Simulation study: MI parameters M = number of imputations K = number of covariates N = number of simulations mix of covariate types all continuous 60% continuous and 40% binary 10, 50, 100, 500 10, 20, 30, 40, 50 10, 100, 1000

Simulation study: assessment criteria Statistical operating characteristics Bias and variance of estimate Bias and variance of lower bound power to reject null (inferiority) nearness to asymptote (as M increases) invariance to random number seed Computational burden run time for multiple imputation (including analysis and rollup) number of burn-in iterations fixed (nbiter at 100; default=20)

Results I: imputation size N=100; n=1000; K=7 (normal); SD=1.5; rho=0.8; p=40%; delta=-0.2 Computation time (seconds) M Mean Std Dev Minimum Maximum 10 1 0.2 2 50 5 0.4 4 6 100 0.5 9 500 49 2.1 47 61

Results I: imputation size N=100; n=1000; K=7 (normal); SD=1.5; rho=0.8; p=40%; delta=-0.2 M Statistics Mean Std Dev Min Median Max >-0.5 >-0.25 10 ESTIMATE -0.223 0.096 -0.393 -0.228 -0.013 - STDERR 0.099 0.004 0.089 0.110 LCLMEAN -0.418 -0.580 -0.425 -0.209 77% 4% 500 -0.224 0.394 0.231 -0.006 0.098 0.002 0.092 0.105   -0.585 -0.426 -0.198

Results II: number of covariates N=10; n=1000; M=10; SD=1; rho=0.2; p=10%; delta=1; *switch 20 to binary Computation time (minutes) K Mean Std Dev Minimum Maximum 10  0.3 0.0 0.4 20  2.4 1.0 1.2 3.7 30  3.5 0.9 2.7 4.9 40  8.0 4.8 4.1 17.7 50  9.0 3.1 6.4 16.2 50* 26.5 7.1 15.9 33.2

Results II: number of covariates N=10; n=1000; M=10; SD=1; rho=0.2; p=10%; delta=1 M Statistics Mean Std Dev Min Median Max 20 ESTIMATE 0.980 0.125 0.856 0.944 1.213 STDERR 0.090 0.010 0.074 0.105 LCLMEAN 0.797 0.124 0.666 0.761 1.019   30 1.016 0.069 0.901 1.023 1.135 0.095 0.009 0.082 0.114 0.823 0.078 0.703 0.831 0.940 40 1.001 0.089 0.854 1.025 1.126 0.094 0.013 0.072 0.096 0.108 0.811 0.086 0.633 0.826 0.954 50 1.029 0.087 0.923 1.012 1.194 0.100 0.012 0.097 0.116 0.825 0.073 0.723 0.796 0.960

Results III: “realistic” scenario N=10; n=1000; K=72 (mix); SD=1.5; rho=0.8 (Y) and 0.2 (X); p=10%; delta=0 Computation time (hours) M Mean Std Dev Minimum Maximum 10 0.6 0.2 0.5 0.9 50 4.2 1.6 2.8 7.9 100 7.5 6.2 9.2

Results III: “realistic” scenario N=10; n=1000; K=72 (mix); SD=1.5; rho=0.8 (Y) and 0.2 (X); p=10%; delta=0 M Statistics Mean Std Dev Min Median Max 10 ESTIMATE -0.014 0.097 -0.117 -0.044 0.162 STDERR 0.112 0.002 0.107 0.111 0.115 LCLMEAN -0.233 0.100 -0.334 -0.265 -0.054 50 -0.001 0.114 -0.180 -0.007 0.159 0.110 -0.217 0.117 -0.397 -0.226 -0.055 100 -0.181 0.158 -0.218 0.118 -0.398 -0.057

Conclusions Imputation size depends on percentage missing adds to computational burden use M=50 Imputation covariates well chosen covariates can reduce bias and approach MAR assumptions too many covariates can make analysis impossible to run type of covariates and correlation to response K~50

Author contact information kimberly.walters@statcollab.com jie.zhou@statcollab.com janet.wittes@statcollab.com lisa.weissfeld@statcollab.com