Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 5: Choices for Longitudinal Data Analysis.

Slides:



Advertisements
Similar presentations
1-Way Analysis of Variance
Advertisements

Lecture 11 (Chapter 9).
Topic 12 – Further Topics in ANOVA
1 QOL in oncology clinical trials: Now that we have the data what do we do?
Factorial and Mixed Factor ANOVA and ANCOVA
Estimation of Sample Size
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 5: Reporting Subgroup Results.
From last time….. Basic Biostats Topics Summary Statistics –mean, median, mode –standard deviation, standard error Confidence Intervals Hypothesis Tests.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Clustered or Multilevel Data
Today Concepts underlying inferential statistics
Lorelei Howard and Nick Wright MfD 2008
Sample Size Determination Ziad Taib March 7, 2014.
Slides 13b: Time-Series Models; Measuring Forecast Error
Professor of Epidemiology and Biostatistics
Repeated measures: Approaches to Analysis Peter T. Donnan Professor of Epidemiology and Biostatistics.
Objectives of Multiple Regression
Issues in Experimental Design Reliability and ‘Error’
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Advanced Statistics for Interventional Cardiologists.
Biostatistics in Clinical Research Peter D. Christenson Biostatistician January 12, 2005IMSD U*STAR RISE.
23-1 Analysis of Covariance (Chapter 16) A procedure for comparing treatment means that incorporates information on a quantitative explanatory variable,
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Some terms Parametric data assumptions(more rigorous, so can make a better judgment) – Randomly drawn samples from normally distributed population – Homogenous.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 3: Replicates.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 6: Case Study.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 1: Sample Size & Power for Inequality and Equivalence Studies.
Statistics (cont.) Psych 231: Research Methods in Psychology.
7. Comparing Two Groups Goal: Use CI and/or significance test to compare means (quantitative variable) proportions (categorical variable) Group 1 Group.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 4: Study Size and Power.
Osteoarthritis Initiative Analytic Strategies for the OAI Data December 6, 2007 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size and Power.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 6: Case Study.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
PSYC 3030 Review Session April 19, Housekeeping Exam: –April 26, 2004 (Monday) –RN 203 –Use pencil, bring calculator & eraser –Make use of your.
1 Updates on Regulatory Requirements for Missing Data Ferran Torres, MD, PhD Hospital Clinic Barcelona Universitat Autònoma de Barcelona.
Biostatistics Case Studies 2010 Peter D. Christenson Biostatistician Session 3: Clustering and Experimental Replicates.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 4: An Alternative to Last-Observation-Carried-Forward:
Biostatistics in Practice Peter D. Christenson Biostatistician Session 3: Testing Hypotheses.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 1: Demonstrating Equivalence of Active Treatments:
1 Mohamed Alosh, Ph.D. Kathleen Fritsch, Ph.D. Shiowjen Lee, Ph.D. DBIII, OB, CDER, FDA Efficacy Evaluation in Acne Clinical Trials.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 2: Correlation of Time Courses of Simultaneous.
Sample Size Determination
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 3: An Alternative to Last-Observation-Carried-Forward:
Tutorial I: Missing Value Analysis
Session 6: Other Analysis Issues In this session, we consider various analysis issues that occur in practice: Incomplete Data: –Subjects drop-out, do not.
How does Biostatistics at Roche typically analyze longitudinal data
Biostatistics in Practice Session 6: Data and Analyses: Too Little or Too Much Youngju Pak Biostatistician
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 6: Data and Analyses: Too Little or Too Much.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 1: Demonstrating Equivalence of Active Treatments:
Biostatistics Case Studies 2016 Youngju Pak, PhD. Biostatistician Session 2 Understanding Equivalence and Noninferiority testing.
Inferential Statistics Psych 231: Research Methods in Psychology.
Inferential Statistics Psych 231: Research Methods in Psychology.
Repeated measures: Approaches to Analysis
Biostatistics Case Studies 2007
Slope of the regression line:
Analyzing Intervention Studies
CHAPTER 29: Multiple Regression*
A Gentle Introduction to Linear Mixed Modeling and PROC MIXED
An Introductory Tutorial
Psych 231: Research Methods in Psychology
Inferential Statistics
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Presentation transcript:

Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 5: Choices for Longitudinal Data Analysis

Case Study

Study Goal - General

Specific Primary Aim The “ANCOVA” would be a t-test, if we ignored the baseline values and the different centers. The outcome is change in HAM-A. The groups are drug and placebo. The signal:noise ratio is ……

Comparison of Change Means with t-test Strength of Treatment Effect: Signal:Noise Ratio t= Observed Δ SD Δ √(1/N 1 + 1/N 2 ) Δ = Drug - Placebo Mean (Final-Base) Diff in HAM-A changes SD = Std Dev of within group HAM-A changes N 1 = N 2 = Group size | t | > ~1.96 ↔ p<0.05

Comparison of Change Means with t-test Strength of Treatment Effect: Signal:Noise Ratio t= Observed Δ SD Δ √(1/N 1 + 1/N 2 ) (-10.2) (?) √(1/ /132) = → p=0.27 = (Actually adjusted for baseline and center)

More than Two Visits How can we get one signal:noise ratio incorporating all visits? Perhaps we want to detect treatment effect at any visit.

Suppose Only Three Visits - Weeks 0, 4, 8 Two Treatment Differences in Changes: Δ 1 = D 1 - P 1 Δ 2 = D 2 - P 2 D1D1 D2D2 P1P1 P2P2 Total Effect: Δ Δ 2 2

Comparison of Change Means with ANOVA Strength of Treatment Effect: Signal:Noise Ratio F= Observed (Δ Δ 2 2 ) √V V involves SD Δ 1 and SD Δ 2 and the 1/Ns. Large F ↔ Δ Δ 2 2 too large to be random ↔ p<0.05

Repeated Measures ANOVA The previous slide is “classical” repeated measures ANOVA. Could have many groups and many time points. If the overall “total” effect is significant, then we would examine which Δs are the cause. Same conclusions if changes from baseline, not sequential changes were used. Since the signal or effect Δ 1 2 +Δ 2 2 equally weights the two Δ, we must know all changes for a subject. If we do not (missing data), then that subject is completely removed from the analysis.

Mixed Models for Repeated Measures (MMRM) “Classical” repeated measures ANOVA uses only subjects with no missing visits. MMRM overcomes that limitation by making a signal:noise ratio as the weighted average of signals or effects from sets of subjects with the same missing visit pattern. MMRM still provides the overall ratio, as in the classical ANOVA that cannot handle missing visits.

Mixed Models for Repeated Measures (MMRM) The next four slides use a simpler example to give the idea of how the weighting is done in MMRM. These four slides can be skipped to get to the bigger picture of longitudinal analyses.

MMRM Example* *Brown, Applied Mixed Models in Medicine, Wiley Consider a crossover (paired) study with 6 subjects. Subject 5 missed treatment A and subject 6 missed B. Completer analysis would use IDs 1-4; trt diff=4.25. Strict LOCF analysis would impute 22,17; trt diff=2.83. LOCF Difference

MMRM Example Cont’d Δ W =4.25 Paired Δ B =5 Unpaired Mixed model gets the better* estimate of the A-B difference from the 4 completers paired mean Δ w =4.25. It gets a poorer unpaired estimate from the other 2 subjects Δ B = = 5. How are these two “sub-studies” combined? *Why better?

MMRM Example Cont’d Δ W =4.25 Paired Δ B =5 Unpaired The overall estimated Δ is a weighted average of the separate Δs, inversely weighting by their variances: Δ = [Δ W /SE 2 (Δ W ) + Δ B /SE 2 (Δ B )]/K = [4.25/ /43.1]/(1/ /43.1) = 4.32 The 4.45 and 43.1 incorporate the Ns and whether data is paired or unpaired.

MMRM - More General I The example was “balanced” in missing data, with information from both treatments A and B in the unpaired data. What if all missing data are for A, and none for B? The unpaired A mean is compared with the combined A and B mean, giving an estimate of half of the A - B difference. It is appropriately weighted with the paired A - B estimate.

Competing Conclusions The next three slides show differences obtained by using different repeated measures approaches. These three slides can be skipped to get to other approaches for longitudinal analyses.

Competing Conclusions

Imputation with LOCF Completer 30 HAM-A Score Week 0 Ignores potential progression; conservative; usually attenuates likely changes and ↑ standard deviations. No correction for using unobserved data as if real. Individual Subjects denotes imputed: N=63/260 Use all 260 values as if observed here.

Completer vs. LOCF vs. MMRM Analysis LOCF Analysis Δ b/w groups = 1.8 N=260: 197 actual, 63 imputed Completer Analysis Δ b/w groups = 2.5 N=197: 197 actual MMRM uses all available visits for all 260. No imputation (Week 8 or earlier)

MMRM vs. Classical: Why Distinguish? Doesn’t distinguishing MMRM and classical seem to be about a minor technical point about weighting? Why make such a big deal? The MMRM is not in many basic software packages. It is not obvious how to perform it in software that does have it. So, it is not user-friendly yet. If you have missing data, ask a statistician to set it up in software correctly.

Other Approaches to Longitudinal Data So far, we have considered all sequential changes or changes from baseline. What other outcomes could be of interest?

Some Other Goals with Longitudinal Data Use one visit at a time: Compare treatments at each time separately - doesn’t look at changes in individuals. Compare treatments at end of study. Create summary over time: Compare average over time - trends unimportant. Specific pattern features, as in pharmacokinetic studies of AUC, peak, half-life, etc. Compare treatments on rate of change over time.

Average over Time - Trends Unimportant AUC Area Under the Curve (AUC), divided by total length of time, is an average outcome, weighted for time. Larger weights are given to the larger time intervals, since AUC is just a sum of trapezoids....

“Growth” Curves Parabola or line or equation based on theory describes time trend. The idea is to compare treatment groups on a parameter describing the pattern, e.g., slope.

“Growth” Curves The logic is to compare treatment groups by finding means over subjects in each group for a parameter describing the pattern, e.g., slope. Next slide for correct method.

“Growth” Curves The idea is to compare treatment groups on a parameter describing the pattern, e.g., slope. Conceptually, we could just fit a separate regression line for each subject, get the slopes, and compare mean slopes between groups with a t-test. But subjects may have different numbers of visits, and the slope might be correlated with the intercept (e.g., start off higher → smaller slope). So, another form of “mixed models” is more accurate: “random coefficient” models. They give slopes also. Like MMRM, they are not very user-friendly in software, so ask a statistician to set up.

Summary on Mixed Models Repeated Measures Currently one of the preferred methods for missing data. Does not resolve bias if missingness is related to treatment. Requires more model specifications than is typical. Mild deviations from assumed covariance pattern do not usually have a large influence. May be difficult to apply objectively in clinical trials where the primary analysis needs to be detailed a priori. Can be intimidating; need experience with modeling; software has many options to be general and flexible.