Local Control Bob Obenchain, PhD, FASA Risk Benefit Statistics LLC Yin = Dark = Evil = Risk Yang = Light = Good = Benefit.

Slides:



Advertisements
Similar presentations
THE USE OF HISTORICAL CONTROLS IN DEVICE STUDIES Vic Hasselblad Duke Clinical Research Institute.
Advertisements

Identifying Meaningful Patient Subgroups via Clustering - Sensitivity Graphics Bob Obenchain Commercial Information Sciences Outcomes Research, US Medical.
Bias Correction in Pharmaceutical Risk-Benefit Assessment Bob Obenchain, PhD, FASA Risk-Benefit Statistics LLC Yin = Dark = Evil = Risk Yang = Light =
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Agency for Healthcare Research and Quality (AHRQ)
I OWA S TATE U NIVERSITY Department of Animal Science Using Basic Graphical and Statistical Procedures (Chapter in the 8 Little SAS Book) Animal Science.
Controlling for Time Dependent Confounding Using Marginal Structural Models in the Case of a Continuous Treatment O Wang 1, T McMullan 2 1 Amgen, Thousand.
A workshop introducing doubly robust estimation of treatment effects
Lecture 10 F-tests in MLR (continued) Coefficients of Determination BMTRY 701 Biostatistical Methods II.
Reliable Causal Inference via Genetic Matching: A new matching method jointly developed by Alexis Diamond and Jasjeet Sekhon Alexis Diamond Software by.
1 Arlene Ash QMC - Third Tuesday September 21, 2010 (as amended, Sept 23) Analyzing Observational Data: Focus on Propensity Scores.
Introduction to Propensity Score Weighting Weimiao Fan 10/10/
V.: 9/7/2007 AC Submit1 Statistical Review of the Observational Studies of Aprotinin Safety Part I: Methods, Mangano and Karkouti Studies CRDAC and DSaRM.
Observational Studies Based on Rosenbaum (2002) David Madigan Rosenbaum, P.R. (2002). Observational Studies (2 nd edition). Springer.
Method Reading Group September 22, 2008 Matching.
 Confounders are usually controlled with the “standard” response regression model.  The standard model includes confounders as covariates in the response.
Clustered or Multilevel Data
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
Introduction to Multilevel Modeling Using SPSS
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242.
Multiple Choice Questions for discussion
Assessing Studies Based on Multiple Regression
Advanced Statistics for Interventional Cardiologists.
Introduction ANOVA Mike Tucker School of Psychology B209 Portland Square University of Plymouth Drake Circus Plymouth, PL4 8AA Tel: +44 (0)
ECON ECON Health Economic Policy Lab Kem P. Krueger, Pharm.D., Ph.D. Anne Alexander, M.S., Ph.D. University of Wyoming.
Lecture 8: Generalized Linear Models for Longitudinal Data.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Instrumental Variables: Problems Methods of Economic Investigation Lecture 16.
Rasch trees: A new method for detecting differential item functioning in the Rasch model Carolin Strobl Julia Kopf Achim Zeileis.
1 Background A new class of continuous factor (C-Factor) models have been proposed as a parsimonious alternative to HB for conjoint and choice modeling.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Linear correlation and linear regression + summary of tests
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Propensity Score Matching for Causal Inference: Possibilities, Limitations, and an Example sean f. reardon MAPSS colloquium March 6, 2007.
Public Policy Analysis ECON 3386 Anant Nyshadham.
Propensity Scoring and Beyond: Why? and How? Midwest Biopharmaceutical Statistics Workshop, 2009.
BIOST 536 Lecture 11 1 Lecture 11 – Additional topics in Logistic Regression C-statistic (“concordance statistic”)  Same as Area under the curve (AUC)
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Generalizing Observational Study Results Applying Propensity Score Methods to Complex Surveys Megan Schuler Eva DuGoff Elizabeth Stuart National Conference.
Relative Values. Statistical Terms n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the data  not sensitive to.
Limited Dependent Variables Ciaran S. Phibbs. Limited Dependent Variables 0-1, small number of options, small counts, etc. 0-1, small number of options,
Impact Evaluation Sebastian Galiani November 2006 Matching Techniques.
Todd Wagner, PhD October 2013 Propensity Scores. Outline 1. Background on assessing causation 2. Define propensity score (PS) 3. Calculate the PS 4. Use.
A Claims Database Approach to Evaluating Cardiovascular Safety of ADHD Medications A. J. Allen, M.D., Ph.D. Child Psychiatrist, Pharmacologist Global Medical.
Overview and Common Pitfalls in Statistics and How to Avoid Them
A REVIEW By Chi-Ming Kam Surajit Ray April 23, 2001 April 23, 2001.
Testing statistical significance of differences between coefficients Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2nd.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
1 EPI235: Epi Methods in HSR April 5, 2005 L3 Evaluating Health Services using administrative data 2: Advanced Topics in Risk Adjustment (Dr. Schneeweiss)
1 Hester van Eeren Erasmus Medical Centre, Rotterdam Halsteren, August 23, 2010.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Statistical inference Statistical inference Its application for health science research Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics.
1 Statistical Review of the Observational Studies of Aprotinin Safety Part II: The i3 Drug Safety Study CRDAC and DSaRM Meeting September 12, 2007 P. Chris.
Randomized Assignment Difference-in-Differences
Considering model structure of covariates to estimate propensity scores Qiu Wang.
1 Bandit Thinkhamrop, PhD.(Statistics) Dept. of Biostatistics & Demography Faculty of Public Health Khon Kaen University Overview and Common Pitfalls in.
Confounding adjustment: Ideas in Action -a case study Xiaochun Li, Ph.D. Associate Professor Division of Biostatistics Indiana University School of Medicine.
Transparency in the Use of Propensity Score Methods
Todd Wagner, PhD February 2011 Propensity Scores.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Alexander Spermann University of Freiburg, SS 2008 Matching and DiD 1 Overview of non- experimental approaches: Matching and Difference in Difference Estimators.
Matching methods for estimating causal effects Danilo Fusco Rome, October 15, 2012.
Introduction to Marketing Research
S. Stanley Young Robert Obenchain Goran Krstic
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Presenter: Wen-Ching Lan Date: 2018/03/28
Explanation of slide: Logos, to show while the audience arrive.
MGS 3100 Business Analysis Regression Feb 18, 2016
Outlines Introduction & Objectives Methodology & Workflow
Presentation transcript:

Local Control Bob Obenchain, PhD, FASA Risk Benefit Statistics LLC Yin = Dark = Evil = Risk Yang = Light = Good = Benefit

Epidemiology (case-control & cohort) studies Post-stratification and re-weighting in surveys Stratified, dynamic randomization to improve balance on predictors of outcome Matching and Sub-grouping using Propensity Scores Econometric Instrumental Variables (LATEs) Marginal Structural Models (IPW  1/PS) Unsupervised Propensity Scoring: Nested Treatment- within-Cluster ANOVA model …with LATE, LTD and Error sources of variation History of Local Control Methods for Human Studies

Epidemiology (case-control & cohort) studies Post-stratification and re-weighting in surveys Stratified, dynamic randomization to improve balance on predictors of outcome Matching and Sub-grouping using Propensity Scores Econometric Instrumental Variables (LATEs) Marginal Structural Models (IPW  1/PS) Unsupervised Propensity Scoring: Nested Treatment- within-Cluster ANOVA model …with LATE, LTD and Error sources of variation History of Local Control Methods for Human Studies

Source Degrees-of- Freedom Interpretation Clusters (Subgroups) K = Number of Clusters Cluster Means are Local Average Treatment Effects (LATEs) when X’s are Instrumental Variables (IVs) Treatment within Cluster I = Number of “Informative” Clusters Local Treatment Differences (LTDs) are of interest for All Types of X-variables Error Number of Patients  K  I Uncertainty Although a NESTED model can be (technically) WRONG, it is sufficiently versatile to almost always be USEFUL as the number of “clusters” increases. Nested ANOVA

The full DISTRIBUTION of Local Treatment Differences (LTDs) quantifies All Likely EFFECTS of Treatment. The MEAN of this LTD distribution is the MAIN Effect of Treatment.

Estimated PSs can easily “Fail-to-Factor” Joint distribution of x and t given p: Pr( x, t | p )  Pr( x | p ) Pr( t | p ) …because the sub-space of x vectors such that x  = constant is unbounded!

Constant PS Estimate  Calipers from Discrete Choice (Logit or Probit) Model x2x2 x3x3 x1x1 x  Linear Functional  constant Infinite 3-D Slab

Pr( x, t | p ) = Pr( x | p ) Pr( t | p ) The unknown true propensity score is the “most coarse” possible balancing score. The known x -vector itself is the “most detailed” balancing score… Pr( x, t ) = Pr( x ) Pr( t | x )

Conditioning upon Cluster Membership is intuitively somewhere between the two PS extremes in the limit as individual clusters become numerous, small and compact …as long as t information is not used to form clusters But LESS “detailed” than Pr( x, t ) = Pr( x ) Pr( t | x ) ? Pr( x, t | C )  Pr( x | t, C ) Pr( t | C )  Pr( x | C ) Pr( t | C ) What is LESS “coarse” than Pr( x, t | p ) = Pr( x | p ) Pr( t | p ) ?

Unsupervised  No PS Estimates Needed x2x2 x3x3 x1x1 3-D Clusters (Informative or Uninformative)

LSIM10K Numerical Example 10 simulated measurements for 10,325 patients. [1] mort6mo : Binary 6-month mortality indicator. [2] cardcost : Cumulative 6-month cardiac related charges. [3] trtm : Binary indicator (1 => treated, 0 => untreated). [4] stent : Binary indicator (1 => coronary stent deployment.) [5] height : Patient height rounded to the nearest centimeter. [6] female : Binary sex indicator (1 => yes, 0 => male.) [7] diabetic : Binary indicator (1 => diabetes mellitus, 0 => no.) [8] acutemi : Binary indicator (1 => acute myocardial infarction within the previous 7 days, 0 => no.) [9] ejecfrac : Left ejection fraction % rounded to integer. [10] ves1proc : Number of vessels involved in initial PCI.

PatsMortDiff (%) Standard Error CostDiff ($) 10, ves1proc , , , , , ( Treated – Untreated ) Differences

The Four “Tactical” Phases of Local Control Analysis 1.Does making Comparisons More-and- More LOCAL suggest that the GLOBAL Comparison is BIASED? 2.Is the Observed LTD Distribution “Salient” (Meaningfully Different from Random) ? 3.Systematic Sensitivity Analyses !!! 4.Which Patient Characteristics are Predictive of Differential Response?

Cuts of Cluster Dendrogram yield desired Number-of-Clusters

Y1LTD: Mortality within Six Months Across Cluster Average LTD Outcome Plus Two Sigma Upper Limit for Average LTD Minus Two Sigma Lower Limit for Average LTD NCreq = Number of Clusters Requested NCreqNCinfoY1 LTD Local Std Err Y1 Low Limit Y1 Upr Limit 11                      LC “Unbiasing” TRACE

Y2LTD: Six Month Cardiac Costs Across Cluster Average LTD Outcome Plus Two Sigma Upper Limit for Average LTD Minus Two Sigma Lower Limit for Average LTD NCreq = Number of Clusters Requested LC “Unbiasing” TRACE NCreqNCinfoY2 LTDLocal Std Err Y2 Low Limit Y2 Upr Limit         

Observed LTD Distributions cardcost mort6mo

Simulate the “Artificial LTD distribution” by using 900 clusters of the same size with the same treatment fractions as the observed clusters; 100 clusters will be uninformative. Randomly distribute the observed outcomes (y) within these clusters, preserving Treated (t=1) vs Untreated (t=0) status but ignoring all other observed patient X-characteristics. Repeat this random process 25 times, yielding a sample of 25 x 800 = 20,000 Artificial LTDs. Compare the resulting Artificial LTD distribution with that from the 800 Observed LTDs. LC Phase Two: Is the Observed LTD Distribution “Salient”? (Meaningfully Different from Random)

Visual Comparison of Empirical CDFs for the Observed (solid) and Artificial (dashed) LTD distributions of the mort6mo outcome.

Zoom-In on Tails: Empirical CDFs for the Observed (solid) and Artificial (dotted) LTD distributions.

The three patient baseline x  characteristics that appear to be most predictive of t  treatment choice and the mortality y  outcome are stent, acutemi and ejfract. Thus, an LC analysis using only these three patient characteristics is parsimonious. Interestingly, the resulting LC unbiasing trace for mort6mo is quite similar to that using all 7 x-characteristics, but the cardcost trace is much smoother! LC Phase Three: Systematic Sensitivity Analyses No software currently available for this sort of “batch mode” processing One Interesting Alternative Clustering

LC Phase Four: Multivariable Model predicting mort6mo LTDs from Seven Baseline Patient x  Characteristics (“effect screening” via Factorial to Degree Two) Summary of Fit Analysis of Variance SourceDFSum of Squares Mean Square F Ratio Model Error Prob > F C. Total <.0001 RSquare RSquare Adj Root Mean Square Error

LC Phase Four: Partition Regression TREE predicting mort6mo LTDs from Seven Patient x  Characteristics.

This numerical example illustrates that Local Control methods are sufficiently flexible to do a much better job of removing the effects of treatment selection bias and confounding than smooth, global multivariable models. This example also dramatically illustrates that some of the most interesting effects of treatment are NOT main-effects. Evidence of patient differential response to treatment is always important …even when it’s not completely clear which patients will display which differences. ANALYSIS SUMMARY

[i] LC analysis is easy to explain and illustrate (visually) to nontechnical audiences, [ii] Statistical professionals should find the LC approach to be both intuitively appealing and widely applicable, [iii] LC results are demonstrably more robust than traditional global, over-smooth parametric models, and [iv] LC yields realistic inferences even in the limit as “N” increases towards infinity. LC Advantages

Bang H, Robins JM. Doubly Robust Estimation in Missing Data and Causal Inference Models. Biometrics 2005; 61: Fraley C, Raftery AE. Model based clustering, discriminant analysis and density estimation. JASA 2002; 97: Imbens GW, Angrist JD. Identification and Estimation of Local Average Treatment Effects. Econometrica 1994; 62: McClellan M, McNeil BJ, Newhouse JP. Does More Intensive Treatment of Myocardial Infarction in the Elderly Reduce Mortality?: Analysis Using Instrumental Variables. JAMA 1994; 272: McEntegart D. “The Pursuit of Balance Using Stratified and Dynamic Randomization Techniques: An Overview.” Drug Information Journal 2003; 37: References

Obenchain RL. USPS package: Unsupervised and Supervised Propensity Scoring in R. Version August Obenchain RL. “The Local Control Approach using JMP,” In: Analysis of Observational Health-Care Data Using SAS. Cary, NC: SAS Press. September Robins JM, Hernan MA, Brumback B. Marginal Structural Models and Causal Inference in Epidemiology. Epidemiology 2000; 11: Rosenbaum PR, Rubin RB. The Central Role of the Propensity Score in Observational Studies for Causal Effects. Biometrika 1983; 70: Rosenbaum PR. Observational Studies, Second Edition. New York: Springer-Verlag References …concluded