Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February 2014 1.

Slides:



Advertisements
Similar presentations
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Advertisements

1 Arlene Ash QMC - Third Tuesday September 21, 2010 (as amended, Sept 23) Analyzing Observational Data: Focus on Propensity Scores.
LT8: Matching Sam Marden Introduction Describe the intuition behind matching estimators. Be concise. Suppose you have a sample of.
Propensity Score Matching Lava Timsina Kristina Rabarison CPH Doctoral Seminar Fall 2012.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Analysis of frequency counts with Chi square
EXPERIMENTAL DESIGNS Criteria for Experiments
By Wendiann Sethi Spring  The second stages of using SPSS is data analysis. We will review descriptive statistics and then move onto other methods.
1 Arlene Ash QMC - Third Tuesday September 21, 2010 Analyzing Observational Data: Focus on Propensity Scores.
Clustered or Multilevel Data
Treatment Effects: What works for Whom? Spyros Konstantopoulos Michigan State University.
Bivariate & Multivariate Regression correlation vs. prediction research prediction and relationship strength interpreting regression formulas process of.
Today Concepts underlying inferential statistics
Introduction to PSM: Practical Issues, Concerns, & Strategies Shenyang Guo, Ph.D. School of Social Work University of North Carolina at Chapel Hill January.
Chapter 14 Inferential Data Analysis
Inferential statistics Hypothesis testing. Questions statistics can help us answer Is the mean score (or variance) for a given population different from.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Multiple Choice Questions for discussion
Advanced Statistics for Interventional Cardiologists.
Simple Linear Regression
Matching Methods. Matching: Overview  The ideal comparison group is selected such that matches the treatment group using either a comprehensive baseline.
Copyright © 2010 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.
Experiments and Observational Studies. Observational Studies In an observational study, researchers don’t assign choices; they simply observe them. look.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 13 Experiments and Observational Studies.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Slide 13-1 Copyright © 2004 Pearson Education, Inc.
Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.
Correlational Research Chapter Fifteen Bring Schraw et al.
Introduction Osborn. Daubert is a benchmark!!!: Daubert (1993)- Judges are the “gatekeepers” of scientific evidence. Must determine if the science is.
Article Review Cara Carty 09-Mar-06. “Confounding by indication in non-experimental evaluation of vaccine effectiveness: the example of prevention of.
Matching Estimators Methods of Economic Investigation Lecture 11.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
Linear correlation and linear regression + summary of tests
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Chapter 3.1.  Observational Study: involves passive data collection (observe, record or measure but don’t interfere)  Experiment: ~Involves active data.
Public Policy Analysis ECON 3386 Anant Nyshadham.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Chapter 4: Introduction to Predictive Modeling: Regressions
Generalizing Observational Study Results Applying Propensity Score Methods to Complex Surveys Megan Schuler Eva DuGoff Elizabeth Stuart National Conference.
Latent Growth Modeling Byrne Chapter 11. Latent Growth Modeling Measuring change over repeated time measurements – Gives you more information than a repeated.
Applying impact evaluation tools A hypothetical fertilizer project.
Non-experimental methods Markus Goldstein The World Bank DECRG & AFTPM.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Master’s Essay in Epidemiology I P9419 Methods Luisa N. Borrell, DDS, PhD October 25, 2004.
Africa Program for Education Impact Evaluation Dakar, Senegal December 15-19, 2008 Experimental Methods Muna Meky Economist Africa Impact Evaluation Initiative.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
1 Chapter 4: Introduction to Predictive Modeling: Regressions 4.1 Introduction 4.2 Selecting Regression Inputs 4.3 Optimizing Regression Complexity 4.4.
Randomized Assignment Difference-in-Differences
Considering model structure of covariates to estimate propensity scores Qiu Wang.
Todd Wagner, PhD February 2011 Propensity Scores.
Nonparametric Statistics
Summary: connecting the question to the analysis(es) Jay S. Kaufman, PhD McGill University, Montreal QC 26 February :40 PM – 4:20 PM National Academy.
MATCHING Eva Hromádková, Applied Econometrics JEM007, IES Lecture 4.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Alexander Spermann University of Freiburg, SS 2008 Matching and DiD 1 Overview of non- experimental approaches: Matching and Difference in Difference Estimators.
ENDOGENEITY - SIMULTANEITY Development Workshop. What is endogeneity and why we do not like it? [REPETITION] Three causes: – X influences Y, but Y reinforces.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Matching methods for estimating causal effects Danilo Fusco Rome, October 15, 2012.
Looking for statistical twins
Constructing Propensity score weighted and matched Samples Stacey L
Sec 9C – Logistic Regression and Propensity scores
Matching Methods & Propensity Scores
Matching Methods & Propensity Scores
Methods of Economic Investigation Lecture 12
SPSS Propensity Score Matching: An overview
Matching Methods & Propensity Scores
Impact Evaluation Methods: Difference in difference & Matching
Chapter 11 EDPR 7521 Dr. Kakali Bhattacharya
Presentation transcript:

Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February

Overview What is Propensity Score Matching (PSM) When is PSM used How PSM works A Simple Recipe for Generalized PSM Some MH Services Research Examples Summary Overview of PSM 2

What is Propensity Score Matching Propensity Score Matching (PSM) is a technique used to create commonality among groups of observations/subjects across observed characteristics PSM is used in observational research where random assignment is not feasible Reduces or eliminates sample selection bias in making comparisons across groups (e.g. treatment/control) on observable characteristics 3

When is PSM used? Suppose you are interested in whether two clinics provide equivalent amounts of treatment to individuals But age and mental health status also determine utilization independent of clinic practice A review of utilization, age and mental health status by clinic might look like…. 4

When is PSM used? You can’t tell whether the difference in utilization is due to clinic practice, age/gender/MH status, or both 5

When is PSM used? You could use multivariate regression to “balance” the effects of potential confounding measures (age, gender, & MH status here) The regression equation would look like: Visits = B0 + B1*Clinic + B2*Age + B3*Gender + B4*MH Status + e And (hopefully) B1 would represent any difference in average individual utilization across clinics after accounting for differences in age, gender & MH status… 6

When is PSM used? But..what if a closer look at your samples revealed this: You can’t balance what doesn’t exist (no young females in clinic A) You won’t get balance on MH status using linear regression if relationship with utilization is non-linear 7

When is PSM used? You could try to “hand pick” matching subjects across clinics (e.g. basic case-control matching) But this won’t be easy/feasible if your study sample has: ▫A large number of observations ▫A large number of matching variables (“unique” case types increase exponentially) ▫Continuous matching variables (e.g. fine grained status scale) This is where PSM steps in with a statistical means of doing complex case matching on observed sample characteristics 8

How it works A propensity score is an estimated probability of an observation being in one sample group or another given a set of observed characteristics Propensity Score = Pr (D=1|X) ▫D indicates sample group (one of our clinics in example) ▫X are observed characteristics (age, gender, MH status) Typically estimated with logistic regression: Clinic(0,1) = B0 + B1*Age + B2*Gender + B3*MH Status + e

How it works A propensity score (PS) is an estimated probability of an observation being in one sample group or another given a set of observed characteristics Propensity Score = P(X) = Pr (d=1|X) ▫D indicates sample group (one or our clinics in example) ▫X are observed characteristics (age, gender, MH status) Typically estimated with logistic regression: Clinic(0,1) = B0 + B1*Age + B2*Gender + B3*MH Status + e

How it works Propensity score matching: match groups of observations on the estimated probability of being in a group (propensity score). Key assumption: Being in a group is independent of outcomes conditional on group characteristics ▫This is false if there are unobserved characteristics associated with being in a group PSM enables matching at the mean and in distribution of observed characteristics (single score represents multiple dimensions)

How it Works There are different ways to apply PSM Exact matching on propensity score ▫if all characteristics are measured as binary indicators then each PS is unique “Fuzzy” matching on propensity score ▫Find closest individual match e.g. nearest neighbor, caliper matching General matching on propensity score ▫Match propensity score distribution e.g. stratify observations per decile of PS (0-.1,.1-.2,.2-.3,etc.)

How it Works A key element of any type of PSM is support: ▫Support means there is some reasonable match for any observation(s) in both groups ▫Observations without support will create biased results if they are not eliminated

Density 0 1 Propensity score Region of common support Density of scores for Clinic B High probability of being in Clinic B given X Density of scores for clinic A

How it Works For general/distributional PSM you also need to test for balance: ▫Balance means that there are no statistical (or meaningful) differences in characteristics among observations within the same PS strata (e.g. decile) ▫If you cannot find balance you need to break PS into finer strata and test again ▫If you are still having problems you may need to add cross-products of measures in your PS logit regression and try again.

A Recipe for General PSM Run logistic regression (remember clinic equation) Generate predicted probabilities (PS) of being in one group or another Recode (group) raw PS into strata (at least quintiles) Run a frequency of PS strata by observation group (e.g. frequency of PS quintile groups by clinic) Check for any groups without support

A Recipe for General PSM Do a t-test on characteristic measures between observation groups (e.g. clinic) within each PS strata (with support) to test for balance If you don’t have balance use steps noted above and repeat If you have balance and support you can choose to: ▫Use PS or original “confounder” variables in regression to obtain comparison of outcomes (but remember linearity assumption issue) ▫Continue on to create a sample that is matched in distribution across PS groups

A Recipe for General PSM To “fully” match samples, randomly select observations from the larger observation group within each PS strata Re-do your “sample characteristics” table to show sample equivalence at mean You can now directly compare means of outcome using any statistical test you want: ▫without reference/inclusion of “confounder” variables ▫without concern for non-linear relations between confounders and outcomes

A Recipe for General PSM Results from doing PSM in our clinic example might look like this:

MH Services Examples The Use of Propensity Scores to Evaluate Outcomes for Community Clinics (Hodges & Grunwald, 2005): ▫Compare outcomes for youth in “exemplary” clinic to “typical care” ▫Use general PSM to find supported and balanced comparison group from other clinics (note: not “fully” matched but use PS as covariate) Propensity Score Matching: An Illustrative Analysis of Dose Response (Foster 2003): ▫Compare outcomes for youths grouped by amount (“dose”) of outpatient services received ▫Use general PSM to find supported and balanced groups by “dose” (note: again not “fully” matched but use PS as covariate)

Summary Overview of PSM Next best approach after randomization to account for sample selection bias: ▫But..only good for observable characteristics ▫Increasingly expected as part of high quality observational research At least checking for support can potentially save you from making bad comparisons ▫But if you have to drop observations you need to be clear about whom your sample represents (external validity) “Fully” matching (by individual or strata) has additional side benefits: ▫Don’t need to include additional covariates in analysis ▫Don’t need to worry about non-linearity between covariates and outcomes ▫Any characteristic based sub-sample will also be “matched”

Questions? Thank You Mark O. Hatfield School of Government Portland State University 22