Propensity Score Matching Lava Timsina Kristina Rabarison CPH 786-001 Doctoral Seminar Fall 2012.

Slides:



Advertisements
Similar presentations
Introduction to Propensity Score Matching
Advertisements

REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Logistic Regression Example: Horseshoe Crab Data
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Confidence Intervals Underlying model: Unknown parameter We know how to calculate point estimates E.g. regression analysis But different data would change.
Method Reading Group September 22, 2008 Matching.
Who are the participants? Creating a Quality Sample 47:269: Research Methods I Dr. Leonard March 22, 2010.
Impact Evaluation: The case of Bogotá’s concession schools Felipe Barrera-Osorio World Bank 1 October 2010.
Lecture 9: One Way ANOVA Between Subjects
© Institute for Fiscal Studies The role of evaluation in social research: current perspectives and new developments Lorraine Dearden, Institute of Education.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Agriregionieuropa Evaluating the CAP Reform as a multiple treatment effect Evidence from Italian farms Roberto Esposti Department of Economics, Università.
Introduction to the design (and analysis) of experiments James M. Curran Department of Statistics, University of Auckland
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Advanced Statistics for Interventional Cardiologists.
Impact Evaluation in the Real World One non-experimental design for evaluating behavioral HIV prevention campaigns.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Propensity Score Matching and Variations on the Balancing Test Wang-Sheng Lee Melbourne Institute of Applied Economic and Social Research The University.
Topic 14: Inference in Multiple Regression. Outline Review multiple linear regression Inference of regression coefficients –Application to book example.
Welfare Reform and Lone Parents Employment in the UK Paul Gregg and Susan Harkness.
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
Article Review Cara Carty 09-Mar-06. “Confounding by indication in non-experimental evaluation of vaccine effectiveness: the example of prevention of.
Matching Estimators Methods of Economic Investigation Lecture 11.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
Credit Scoring of Bank-affiliated Captive Finance Companies Gabriela Pásztorová CERGE-EI Bratislava Economic Meeting 8 June 2012.
Propensity Score Matching for Causal Inference: Possibilities, Limitations, and an Example sean f. reardon MAPSS colloquium March 6, 2007.
Public Policy Analysis ECON 3386 Anant Nyshadham.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
1 Nonparametric Statistical Techniques Chapter 17.
Generalizing Observational Study Results Applying Propensity Score Methods to Complex Surveys Megan Schuler Eva DuGoff Elizabeth Stuart National Conference.
Applying impact evaluation tools A hypothetical fertilizer project.
Non-experimental methods Markus Goldstein The World Bank DECRG & AFTPM.
New Measures of Data Utility Mi-Ja Woo National Institute of Statistical Sciences.
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Impact Evaluation Sebastian Galiani November 2006 Matching Techniques.
Matching STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Framework of Preferred Evaluation Methodologies for TAACCCT Impact/Outcomes Analysis Random Assignment (Experimental Design) preferred – High proportion.
Lorraine Dearden Director of ADMIN Node Institute of Education
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
1 Hester van Eeren Erasmus Medical Centre, Rotterdam Halsteren, August 23, 2010.
Randomized Assignment Difference-in-Differences
Considering model structure of covariates to estimate propensity scores Qiu Wang.
Bilal Siddiqi Istanbul, May 12, 2015 Measuring Impact: Non-Experimental Methods.
What can a CIE tell us about the origins of negative treatment effects of a training programme Miroslav Štefánik miroslav.stefanik(at)savba.sk INCLUSIVE.
MATCHING Eva Hromádková, Applied Econometrics JEM007, IES Lecture 4.
(ARM 2004) 1 INNOVATIVE STATISTICAL APPROACHES IN HSR: BAYESIAN, MULTIPLE INFORMANTS, & PROPENSITY SCORES Thomas R. Belin, UCLA.
Alexander Spermann University of Freiburg, SS 2008 Matching and DiD 1 Overview of non- experimental approaches: Matching and Difference in Difference Estimators.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
1 Nonparametric Statistical Techniques Chapter 18.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
Matching methods for estimating causal effects Danilo Fusco Rome, October 15, 2012.
Estimating standard error using bootstrap
Constructing Propensity score weighted and matched Samples Stacey L
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Sec 9C – Logistic Regression and Propensity scores
Date: Presenter: Ryan Chen
Impact evaluation: The quantitative methods with applications
Matching Methods & Propensity Scores
Matching Methods & Propensity Scores
Methods of Economic Investigation Lecture 12
Explanation of slide: Logos, to show while the audience arrive.
Matching Methods & Propensity Scores
Impact Evaluation Methods: Difference in difference & Matching
Evaluating Impacts: An Overview of Quantitative Methods
CHAPTER 12 More About Regression
The European Statistical Training Programme (ESTP)
Chapter: 9: Propensity scores
CHAPTER 10 Comparing Two Populations or Groups
Presentation transcript:

Propensity Score Matching Lava Timsina Kristina Rabarison CPH Doctoral Seminar Fall 2012

Introduction Program evaluation Counterfactual outcome – what would have happened to the participants in absence of treatment Statistical techniques Propensity score

Concept of PSM Identify neighborhoods that are as similar as possible to each other with respect to the probability of receiving the treatment (Chiavegatto Filho, Kawachi, & Gotlieb, 2012). The average treatment effects is then measured based on the mean difference in outcomes across these comparison and treatment groups.

Experimental Vs Non-experimental Experimental evaluation – Random assignment to treatment and control  control group can be regarded as counterfactual.

Motivation to Propensity Score Matching Non-experimental evaluation – Random assignment may not be possible in nonexperimental evaluation methods (Heinrich et al., 2010). – Assignment to treatment is often nonrandom and hence may bias participation and treatment outcomes. – Treatment units are matched with their “similar” counterparts that differ only in the treatment under study. – Extent of matching is challenging

Propensity score matching allows this matching problem to be reduced to a single dimension. Let me restate that propensity score is defined as the probability that a unit in the combined sample of treated and control units receive the treatment, given a set of observed covariates.

Assumptions for PSM PSM holds under two assumptions (Khandker, 2010; Rosenbaum & Rubin, 1983): – Conditional Independence or Unconfounded Assumption – Common Support or Overlap Condition

Conditional Independence or Unconfounded Assumption: – conditional on observable covariates, the outcomes are independent of treatment In absence of randomization, the groups may differ not only in the treatment status, but also in other covariates. Thus it is necessary to control for these covariates to avoid potential biases. There is a set of covariates observable to the researcher, such that after controlling for these covariates, the potential outcomes (r 1,r 0 ) are independent of the treatment status: (r 1,r 0 ) ⊥ Z|X

Common Support Condition Assumption – This condition ensures that treatment observations have comparison observations “nearby” in the propensity score distribution. For each value of X, there is positive probability of being both treated and untreated: 0<P(Z=1|X)<1 – Also called overlap condition

Steps of PSM (European Commission, 2009; Khandker, 2010): 1.Estimating a model of program participation 2.Defining the region of common support and balancing tests 3.Matching participants to nonparticipants 4.Estimating the Average Effect and its Standard Error

Steps of PSM (European Commission, 2009; Khandker, 2010): 1.Estimating a model of program participation i.Samples of participants and nonparticipants should be pooled, ii.Participation Z should be estimated on all the observed covariates X in the data that are likely to determine participation.  Probit or logit model of program participation  This predicted outcome represents the estimated probability of participation or propensity score for every sampled participants and non-participants.

Steps of PSM (European Commission, 2009; Khandker, 2010): 2.Defining the region of common support and balancing tests –The region of common support needs to be defined where distributions of the propensity score for treatment and comparison group overlap.

Steps of PSM (European Commission, 2009; Khandker, 2010): Some of the participant and nonparticipant observation falling outside the region of common support may have to be dropped.

Steps of PSM (European Commission, 2009; Khandker, 2010): Balancing tests can be conducted to check whether, within each quantile of the propensity score distribution, the treatment and comparison groups have similar average propensity scores and the mean of X. That is, the distributions of the treated group and the control group must be similar: p̂(X|Z=1) = p̂(X|Z=0)

Steps of PSM (European Commission, 2009; Khandker, 2010): 3.Matching participants to nonparticipants – Matching participants to nonparticipants on the basis of propensity score can be done using different matching technique.

Techniques of matching 1.Nearest-neighbor (NN) matching – Each treatment unit is matched to the comparison unit with the closest propensity score. Matching can be done with or without replacement. – In this method, both treatment and control groups are first randomly sorted. Then the first treatment unit is selected to find its closest control match based on the absolute value of the difference between the propensity score (or logit of the propensity score) of the selected treatment and that of the control under consideration. The closest control unit is selected as a match.

Techniques of matching 2.Caliper or radius matching – This method is similar to NN matching except it adds restriction. Both treatment and control units are randomly sorted and then the first treated unit is selected to find its closest control match in terms of the propensity score but only if the control’s propensity score is within a certain radius (caliper). – This avoids bad matching and ensures that the matched pairs are within a certain range of propensity scores.

Techniques of matching 3.Stratification or interval matching – Partitions the common support into different strata (or intervals) and calculates the program’s impact within each interval. – The treated and control groups are ranked on the basis of their propensity scores, and then grouped into K intervals (strata). – Then the impact for each k th stratum is evaluated. – The overall impact is the weighted average of the strata effects, with weights proportional to the number of treated units in each stratum.

Techniques of matching 4.Kernel and local linear matching – Kernel matching uses weighted averages of all individuals in the control group to construct the counterfactual outcome. – Weights depend on the distance between each individual from the control group and the participant observation for which the counterfactual is estimated. – The kernel function assigns higher weight to observations close in terms of propensity score to a treated individual and lower weight on more distant observations.

Steps of PSM (European Commission, 2009; Khandker, 2010): 4.Estimating the Average Effect and its Standard Error – compute the sample averages of the two groups and calculate the difference – Standard errors can be computed using bootstrapping methods.

Example Research question: Does homelessness affect physical health, as measured by pcs score from the SF-36?

A glimpse at the dataset proc print data=ref.help (obs=10); var id pcs mcs age female i1 homeless; run; ObsIDPCSMCSAGEFEMALEI1HOMELESS

Modeling as linear regression proc reg data=ref.help; model pcs=homeless; run; Number of Observations Read 453 Number of Observations Used 453 Parameter Estimates VariableDF Parameter Estimate Standard Errort ValuePr > |t| Intercept <.0001 HOMELESS Parameter Estimates VariableDF Parameter Estimate Standard Errort ValuePr > |t| Intercept <.0001 HOMELESS AGE <.0001 FEMALE I MCS proc reg data=ref.help; model pcs = homeless age female i1 mcs; run; quit;

Creating propensity scores proc logistic data=ref.help desc; model homeless = age female i1 mcs; output out=propen pred=propensity; run; Association of Predicted Probabilities and Observed Responses Percent Concordant 64.9 Somers' D Percent Discordant 34.7 Gamma Percent Tied 0.4 Tau-a Pairs c 0.651

Looking for the assumption of common support proc means data=propen; class homeless; var propensity; run; Analysis Variable : propensity Estimated Probability HOMELESS N ObsNMeanStd DevMinimumMaximum

proc univariate data=propen; class homeless; var propensity; histogram propensity; run; Looking for the assumption of common support

Restricting analysis to common support region Parameter Estimates VariableLabelDF Parameter Estimate Standard Errort ValuePr > |t| Intercept <.0001 HOMELESS propensity Estimated Probability proc reg data=propen; where propensity<0.8; model pcs=homeless propensity; run; quit;

Macros data prop2; set propen; if propensity<0.8; run; %include "C:\Documents and Settings\LRTI222\Desktop\vmatch.sas"; %include "C:\Documents and Settings\LRTI222\Desktop\dist.sas"; %include "C:\Documents and Settings\LRTI222\Desktop\nobs.sas"; %dist(data=prop2, group=homeless, id=id, mvars=propensity, wts=1, vmatch=Y, a=1, b=1, lilm=201, dmax=0.1, outm=mp1_b, summatch=n, printm=N, mergeout=mpropen);

Before & After Matching proc means data=propen mean; class homeless; var age female i1 mcs; run; HOMELESS N ObsVariableMean 0244 AGE FEMALE I1 MCS AGE FEMALE I1 MCS proc means data=mpropen mean; where matched; class homeless; var age female i1 mcs; run; HOMELESS N ObsVariableMean 0201 AGE FEMALE I1 MCS AGE FEMALE I1 MCS

Propensity score matched analysis proc reg data=mpropen; where matched; model pcs=homeless; run; quit; Parameter Estimates VariableDF Parameter Estimate Standard Errort ValuePr > |t| Intercept <.0001 HOMELESS