Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.

Slides:



Advertisements
Similar presentations
A Spreadsheet for Analysis of Straightforward Controlled Trials
Advertisements

Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
1-Way Analysis of Variance
MCUAAAR: Methods & Measurement Core Workshop: Structural Equation Models for Longitudinal Analysis of Health Disparities Data April 11th, :00 to.
Topic 12 – Further Topics in ANOVA
3-Dimensional Gait Measurement Really expensive and fancy measurement system with lots of cameras and computers Produces graphs of kinematics (joint.
1 QOL in oncology clinical trials: Now that we have the data what do we do?
CCEB Modeling Quality of Life Data with Missing Values Andrea B. Troxel, Sc.D. Assistant Professor of Biostatistics Center for Clinical Epidemiology and.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy LSU ---- Geaux Tigers! April 2009.

Common Problems in Writing Statistical Plan of Clinical Trial Protocol Liying XU CCTER CUHK.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy ENAR March 2009.
Clustered or Multilevel Data
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy Psychiatric Biostatistics Symposium May 2009.
Analysis of Variance & Multivariate Analysis of Variance
Longitudinal Data Analysis: Why and How to Do it With Multi-Level Modeling (MLM)? Oi-man Kwok Texas A & M University.
Regression Approach To ANOVA
Professor of Epidemiology and Biostatistics
Repeated measures: Approaches to Analysis Peter T. Donnan Professor of Epidemiology and Biostatistics.
GEE and Generalized Linear Mixed Models
Introduction to Multilevel Modeling Using SPSS
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Inference for regression - Simple linear regression
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Inference for Linear Regression Conditions for Regression Inference: Suppose we have n observations on an explanatory variable x and a response variable.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 5: Analysis Issues in Large Observational Studies.
Statistical Bootstrapping Peter D. Christenson Biostatistician January 20, 2005.
Some terms Parametric data assumptions(more rigorous, so can make a better judgment) – Randomly drawn samples from normally distributed population – Homogenous.
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 3: Replicates.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 6: Case Study.
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 1: Sample Size & Power for Inequality and Equivalence Studies.
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 5: Choices for Longitudinal Data Analysis.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 2: Using the Bootstrap: Throw Out Those Messy.
1 THE ROLE OF COVARIATES IN CLINICAL TRIALS ANALYSES Ralph B. D’Agostino, Sr., PhD Boston University FDA ODAC March 13, 2006.
BUSI 6480 Lecture 8 Repeated Measures.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 6: Case Study.
PSYC 3030 Review Session April 19, Housekeeping Exam: –April 26, 2004 (Monday) –RN 203 –Use pencil, bring calculator & eraser –Make use of your.
1 Updates on Regulatory Requirements for Missing Data Ferran Torres, MD, PhD Hospital Clinic Barcelona Universitat Autònoma de Barcelona.
Biostatistics Case Studies 2010 Peter D. Christenson Biostatistician Session 3: Clustering and Experimental Replicates.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 4: An Alternative to Last-Observation-Carried-Forward:
Biostatistics in Practice Peter D. Christenson Biostatistician Session 3: Testing Hypotheses.
Data Analysis in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine October.
Randomized block designs  Environmental sampling and analysis (Quinn & Keough, 2002)
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 2: Correlation of Time Courses of Simultaneous.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 3: An Alternative to Last-Observation-Carried-Forward:
Analysis of Experiments
Tutorial I: Missing Value Analysis
Session 6: Other Analysis Issues In this session, we consider various analysis issues that occur in practice: Incomplete Data: –Subjects drop-out, do not.
Biostatistics in Practice Session 6: Data and Analyses: Too Little or Too Much Youngju Pak Biostatistician
Assumptions of Multiple Regression 1. Form of Relationship: –linear vs nonlinear –Main effects vs interaction effects 2. All relevant variables present.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 6: Data and Analyses: Too Little or Too Much.
1 Statistics 262: Intermediate Biostatistics Mixed models; Modeling change.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Sample Size and Power Considerations.
1 G Lect 10M Contrasting coefficients: a review ANOVA and Regression software Interactions of categorical predictors Type I, II, and III sums of.
DATA STRUCTURES AND LONGITUDINAL DATA ANALYSIS Nidhi Kohli, Ph.D. Quantitative Methods in Education (QME) Department of Educational Psychology 1.
Simulation setup Model parameters for simulations were tuned using repeated measurement data from multiple in-house completed studies and baseline data.
Repeated measures: Approaches to Analysis
Stephen W. Raudenbush University of Chicago December 11, 2006
CHAPTER 29: Multiple Regression*
An Introductory Tutorial
BY: Mohammed Hussien Feb 2019 A Seminar Presentation on Longitudinal data analysis Bahir Dar University School of Public Health Post Graduate Program.
Psych 231: Research Methods in Psychology
Fixed, Random and Mixed effects
Clinical prediction models
Presentation transcript:

Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies

Case Study

Study Design

Study Results 1 2 3

Enrolled and Completed Subjects Completer Analysis:N= LOCF Analysis:N= =63 were imputed. MMRM Analysis:N= None were imputed. When?

General Reasoning Completer Analysis: Biased; completers may differ from randomized in Phase III study. May be preferred in Phase II. LOCF: Proposed as a neutral method to implement analysis on an intent-to-treat population. MMRM: Uses all data, unlike completer analysis, but doesn’t impute unobserved data as in LOCF. ? Reasoning for this particular study ?

Imputation with LOCF Completer 30 HAM-A Score Week 0 Ignores potential progression; conservative; usually attenuates likely changes and ↑ standard deviations. No correction for using unobserved data as if real. Individual Subjects denotes imputed: N=63/260 Use all 260 values as if observed here.

Change from Baseline Baseline Final Visit Intermediate Visit 0 Change from Baseline Intermediate Visit Final VisitBaseline 0 LOCF: Ignore Potential Progression LRCF: Maintain Expected Relative Progression Individual Subjects One Alternative: Last Rank Carried Forward

Completer vs. LOCF Analysis LOCF Analysis Δ b/w groups = 1.8 N=260: 197 actual, 63 imputed Completer Analysis Δ b/w groups = 2.5 N=197: 197 actual Δ from baseline =~ 10 Clinically relevant Δ=? (Week 8 or earlier)

Mixed Model Approach The completer analysis removes some early data. The LOCF method adds unobserved later data. E.g., remove week 0 or add week 8. Mixed Model for Repeated Measures MMRM: Performs completer analysis. Makes valid, but less preferred, comparison using data omitted from completer analysis. Combines these two results. This paper only gives p-values for results. Example: Next Slide

MMRM Example* *Brown, Applied Mixed Models in Medicine, Wiley Consider a crossover (paired) study with 6 subjects. Subject 5 missed treatment A and subject 6 missed B. Completer analysis would use IDs 1-4; trt diff=4.25. Strict LOCF analysis would impute 22,17; trt diff=2.83. LOCF Difference

MMRM Example Cont’d Δ W =4.25 Paired Δ B =5 Unpaired Mixed model gets the better* estimate of the A-B difference from the 4 completers paired mean Δ w =4.25. It gets a poorer unpaired estimate from the other 2 subjects Δ B = = 5. How are these two “sub-studies” combined? *Why better?

MMRM Example Cont’d Δ W =4.25 Paired Δ B =5 Unpaired The overall estimated Δ is a weighted average of the separate Δs, inversely weighting by their variances: Δ = [Δ W /SE 2 (Δ W ) + Δ B /SE 2 (Δ B )]/K = [4.25/ /43.1]/(1/ /43.1) = 4.32 The 4.45 and 43.1 incorporate the Ns and whether data is paired or unpaired: How are they found?

MMRM Example - SAS Output Covariance Parameter Estimates CS Residual Standard Effect trt Estimate Error DF t Value Pr > |t| Intercept trt A-B Diff Within Subjects (Paired) Among Subjects SE 2 for N=4+4 Paired Δ W =4.25: 8.90(1/4 + 1/4) = 4.45 SE 2 for N=1+1 Unpaired Δ B =43.1: ( )(1/1 + 1/1) = 43.1 Also 4.45 and 43.1 are used to get SE(Δ) = 2.01

MMRM - More General I The example was “balanced” in missing data, with information from both treatments A and B in the unpaired data. What if all missing data are at week 8, and none at week 0, as in our paper? The unpaired week 0 mean is compared with the combined paired week 0 and week 8 mean, giving an estimate of half of the week 0 to week 8 difference. It is appropriately weighted with the paired week 0 to week 8 estimate.

MMRM - More General II Can the intervening week 1 to week 6 data be used to improve further the week 0 to week 8 comparison? That information could be used to better estimate the variances and covariances, if we are willing to make assumptions, e.g., a consistency of variability at each time. Are these just “just so”, common-sense results? Mixed model estimates satisfy certain statistical optimality criteria, provided that the model assumptions hold.

MMRM - Warning Software has many options since mixed models are general and flexible. Defaults may not be appropriate. Requires specifying model structure; assumptions needed; should check assumptions. More experience needed than typical methods. Start by comparing ANOVA for a no-missing-data study with mixed model. See next slide for some modeling needed.

Some Covariance Patterns Compound Symmetry Estimated Covariance Pattern: Week (7.06) (7.06) 2 Correlation = 12.4/7.1*7.1=0.25 This model forces the SD among subjects to be the same at each week. But: Week 0 SD = 5.2 Week 8 SD = 8.8 Unstructured Estimated Covariance Pattern: Week (5.21) (8.79) 2 Correlation = 12.4/5.2*8.8=0.27 This model allows different SDs among subjects at each week.

MMRM Role in Major Analysis Methods Remaining slides put MMRM in context with other major methods. Mixed models are more general; MMRM is special case.

Big Picture: “Multiple” Data Lingo Multiple Regression: Outcome: Single value, say HAM-A at 8 weeks. Predictors: Multiple - treatment, covariates (age, baseline disease severity, other meds, etc.) Multivariate ANOVA (MANOVA): Outcome: Multiple, say (HAM-A, SDS, Dizziness) at 8 weeks, as a pattern or profile. Predictors: Single, say only treatment, or multiple. Repeated Measures: (longitudinal, as in this paper) Outcome: Single quantity, say HAM-A. Predictors: Time, and others (treatment, covariates).

Repeated Measures Studies The same subjects are measured repeatedly on the same outcome, usually at different times or body sites to be compared. Does not apply to only replicated measurements, e.g. multiple histology slices that are averaged. Time is usually relative, such as from start of treatment, or may be calendar time as in epidemiological studies. Usually have fixed time intervals, but times may be different for different subjects, e.g., retrospective series of clinic visits. Study goals will dictate type of analysis - next slide.

Goals in Repeated Measures Studies Some study objectives: Compare overall time-averaged treatment. Specific features of pattern, as in pharmacokinetic studies of AUC, peak, half- life, etc. Compare treatments at every time point. Compare treatments on rate of change over time. Compare treatments at end of study.

Mixed Models in General “Mixed” means combination of fixed effects (e.g., drugs; want info on those particular drugs) and random effects (e.g., centers or patients; not interested in the particular ones in the study). AKA multilevel models, hierarchical models. Very flexible, incorporate unequal patient variability, correlation, pairing, repeated values at multiple levels, subject clustering e.g., from the same family, and data missing at random. More specifications required than typical analyses.

Summary: Mixed Models Repeated Measures Currently one of the preferred methods for missing data. Does not resolve bias if missingness is related to treatment. Requires more model specifications than is typical. Mild deviations from assumed covariance pattern do not usually have a large influence. May be difficult to apply objectively in clinical trials where the primary analysis needs to be detailed a priori. Can be intimidating; need experience with modeling; software has many options to be general and flexible.