MATCHING Eva Hromádková, 14.10.2010 Applied Econometrics JEM007, IES Lecture 4.

Slides:



Advertisements
Similar presentations
Explanation of slide: Logos, to show while the audience arrive.
Advertisements

Introduction to Propensity Score Matching
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Review of Identifying Causal Effects Methods of Economic Investigation Lecture 13.
Brief introduction on Logistic Regression
Reliable Causal Inference via Genetic Matching: A new matching method jointly developed by Alexis Diamond and Jasjeet Sekhon Alexis Diamond Software by.
Omitted Variable Bias Methods of Economic Investigation Lecture 7 1.
Differences-in-Differences
Propensity Score Matching Lava Timsina Kristina Rabarison CPH Doctoral Seminar Fall 2012.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
KNAW, March 29, Reducing Selection Bias via Propensity Score Approach Bo Lu, Ph.D. Division of Biostatistics College of Public Health The Ohio State.
Method Reading Group September 22, 2008 Matching.
Multiple Linear Regression Model
MACHINE LEARNING 9. Nonparametric Methods. Introduction Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 
Pooled Cross Sections and Panel Data II
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
Introduction to Probability and Statistics Linear Regression and Correlation.
MACHINE LEARNING 6. Multivariate Methods 1. Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Motivating Example  Loan.
Today Concepts underlying inferential statistics
Introduction to PSM: Practical Issues, Concerns, & Strategies Shenyang Guo, Ph.D. School of Social Work University of North Carolina at Chapel Hill January.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
Matching Methods. Matching: Overview  The ideal comparison group is selected such that matches the treatment group using either a comprehensive baseline.
Propensity Score Matching and Variations on the Balancing Test Wang-Sheng Lee Melbourne Institute of Applied Economic and Social Research The University.
Instrumental Variables: Problems Methods of Economic Investigation Lecture 16.
Evaluating Job Training Programs: What have we learned? Haeil Jung and Maureen Pirog School of Public and Environmental Affairs Indiana University Bloomington.
Matching Estimators Methods of Economic Investigation Lecture 11.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Propensity Score Matching for Causal Inference: Possibilities, Limitations, and an Example sean f. reardon MAPSS colloquium March 6, 2007.
Public Policy Analysis ECON 3386 Anant Nyshadham.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Generalizing Observational Study Results Applying Propensity Score Methods to Complex Surveys Megan Schuler Eva DuGoff Elizabeth Stuart National Conference.
Non-experimental methods Markus Goldstein The World Bank DECRG & AFTPM.
Chapter 13: Limited Dependent Vars. Zongyi ZHANG College of Economics and Business Administration.
Lecture 10: Correlation and Regression Model.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Impact Evaluation Sebastian Galiani November 2006 Matching Techniques.
Matching STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Sampling Design and Analysis MTH 494 Lecture-22 Ossam Chohan Assistant Professor CIIT Abbottabad.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 11: Models Marshall University Genomics Core Facility.
Randomized Assignment Difference-in-Differences
Bilal Siddiqi Istanbul, May 12, 2015 Measuring Impact: Non-Experimental Methods.
What can a CIE tell us about the origins of negative treatment effects of a training programme Miroslav Štefánik miroslav.stefanik(at)savba.sk INCLUSIVE.
Todd Wagner, PhD February 2011 Propensity Scores.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Do European Social Fund labour market interventions work? Counterfactual evidence from the Czech Republic. Vladimir Kváča, Czech Ministry of Labour and.
Alexander Spermann University of Freiburg, SS 2008 Matching and DiD 1 Overview of non- experimental approaches: Matching and Difference in Difference Estimators.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
ENDOGENEITY - SIMULTANEITY Development Workshop. What is endogeneity and why we do not like it? [REPETITION] Three causes: – X influences Y, but Y reinforces.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
Looking for statistical twins
Matching Procedures: Propensity Score
Constructing Propensity score weighted and matched Samples Stacey L
Sec 9C – Logistic Regression and Propensity scores
Propensity Score Matching
Impact evaluation: The quantitative methods with applications
Matching Methods & Propensity Scores
Matching Methods & Propensity Scores
Methods of Economic Investigation Lecture 12
Treatment effect: Part 2
Explanation of slide: Logos, to show while the audience arrive.
Matching Methods & Propensity Scores
Impact Evaluation Methods: Difference in difference & Matching
The European Statistical Training Programme (ESTP)
Presentation transcript:

MATCHING Eva Hromádková, Applied Econometrics JEM007, IES Lecture 4

Introduction “If I do not have experiment, how can I get control group?” Last time: Diff-in-diff  Comparison before-after between two comparable groups  Assumption: fixed differences between control and treatment group over time  How can we check / adjust assumption:  Look for trends in pre-treatment period  Selection into treatment based on temporary factors (Ashenfelter dip), or anticipation of treatment (taxes)

Matching Intuition  Counterfactuals: what would have happened to treated subjects, if the had not received treatment?  Potential (observed) outcomes x real outcomes Matching = pairing treatment and comparison units that are similar in terms of observable characteristics  Conditional on observables (X i ) we can take assignment to treatment (T i ) as “random” (unconfoundness)  Implicitly, unobservables do not play role in treatment assignment – we assume they are similar among groups

Matching Intuition II  E(Y 1 – Y 0 | T=1) = (1) E[Y 1 | X, T=1] – E[Y 0 | X, T=0] - (2) E[Y 0 | X, T=1] – E[Y 0 | X, T=0]  Part 1 is matched treatment effect  Part 2 is assumed to be zero  all selection occurs only through observed X

Matching Common support  Matching can only work if there is a region of “common support”  People with the same X values are in both the treatment and the control groups  Let S be the set of all observables X, then 0<Pr(T=1 | X)<1 for some S * subset of S  Intuition: Someone in control group has to be close enough to match to treatment unit, or we see enough overlap in the distribution of treated and untreated individuals over their characteristics

Matching Common support II

Matching methods Overview  Exact matching  Propensity score matching  Nearest neighbor  Kernel matching  Radius matching  Stratification matching

Exact matching  Each group of treated has her counterpart with exactly same characteristics  We define cells for combinations of observables E.g.: Sex x age x education x region  We compare average of treated and untreated in each cell (combination of characteristics)  Total effect: weighted average of cells (weights are frequencies of observed cells)  Example: Payne, Lissenburgh, White a Payne (1996)  Employment training, Employment Action in Great Britain  Treated: long term unemployed

Exact matching Issues Problem:  To create cells, only few X’s can be used  If we use more X’s, we will not have enough matches  Few X’s might not fully explain selection process => main assumption of matching would be violated  We need a tool that “merges” more dimensions into one  1 number – score, that would measure how much similar are treated and untreated  Solution = propensity score matching

Propensity score matching Explanation Propensity score = probability that an individual is treated based on his/her pre-treatment characteristics P(X) = P(T=1|X) = E(T|X) When can we use p(X) instead of X?  Balancing property – for given propensity score (range), distribution of characteristics of treated and untreated is the same (testable!!)  Unconfoundness - Conditional on observables (X i ) we can take assignment to treatment (T i ) as “random”

Propensity score matching General procedure 1-to-n Match  Nearest neighbor matching  Caliper matching  Nonparametric/kernel matching Run Logistic Regression: Dependent variable: T=1, if participate; T = 0, otherwise. Choose appropriate conditioning variables, X Obtain propensity score: predicted probability (p) Multivariate analysis based on new sample 1-to-1 match  Nearest neighbor matching  estimate difference in outcomes for each pair  Take average difference as treatment effect

Propensity score matching Step 1: Estimation of propensity score  Estimate logit or probit from the sample of treated and non-treated  Check balancing property (test means of X within stratas by p(X))  Choose common support

Propensity score matching Step 2: Matching algorithms A. Stratification:  Dividing range of propensity scores (PS) into intervals until we get the same average of PS for treated and untreated  In practice, this is NOT EASY  Within each intervals we compute difference in average outcome between treated and untreated  Weighting is based on number of units within a range

Propensity score matching Step 2: Matching algorithms B. Nearest neighbor method  Searching for the most similar unit between treated and control (closest propensity score)  Distance (difference of PS) between treated and control unit is not always same  All matches are weighted the same in final average effect C. Radius matching  We define distance and match with all controls within this distance – average of the effects (not weighted) D. Kernel matching  We put some type of distribution (e.g. normal) around the each treatment unit and use it to weight closer control units more and farther control units less  We can set “bandwith” - limiting the maximum distance in PS that is allowed

Propensity score matching Problems  Choice of matching algorithm – no “perfect” solution, depends on the properties of sample  Rule of thumb – if all give the same results it is ok, if not – look for problem  Standard errors: Estimated variance of treatment effect should include additional variance from estimating p  Typically people “bootstrap” which is a non-parametric form of estimating your coefficients over and over until you get a distribution of those coefficients—use the variance from that

Special topics in Propensity score matching PSM versus OLS  Why not doing simple OLS?  Common support – OLS extrapolated treatment effect also on the regions outside of common support  Implicit weighting differences: OLS is underweighting those combinations of Xs, where treatment or control group is dominant  Linear regression is imposing functional form, while PSM is nonparametric

Special topics in Propensity score matching PSM + DD  Worry that unobservables are causing selection because matching on X not sufficient  Can combine this with difference and difference estimates (Heckman’s procedure)  Obtain propensity score, construct control group J for each individual i  Estimate difference in outcome before treatment  If the groups are truly ‘as if’ random should be zero  If it’s not zero: can assume fixed differences over time and take before-after difference in treatment and control groups (DD)

Related literature Both on methods and applications: Caliendo and Kopeining (2008) – Some practical guidance for the implementation of propensity score matching Stuart (2010) – Matching methods for causal inference: A review and a look forward  Also includes Stata commands

Can non-experimental methods (DD, matching) catch-up with experiments? LaLonde (1986) – NO Data: National Support Work Demonstration (NSW)  Help disadvantaged workers lacking basic skills  Duration of programme: 9-18 months  randomized into training versus no training !!! Goal of the study was to compare econometric estimates from those obtained from the experiment.  Use PSID and CPS to obtain control groups  Compare experimental to non-experimental estimates => Humbling experience for labor economists

Can non-experimental methods (DD, matching) catch-up with experiments? Further discussion  Dehejia and Wahba (1999, 2002) – YES  Same data  Propensity score matching, respect of common support (drop almost half of controls)  Includes only those with info on pre-program earnings  Smith and Lalonde (2005) - NO  DW results are sensitive to choice of Xs  Dehejia and Wahba (2006) – YES  Again stressing importance common support

Reality check  Questionable assumption about ignorability of unobservables in participation decision  Sensitive to what X we choose  Required to have a lot of pre-treatment (labor market behavior) and post-treatment characteristics  Good in evaluating obligatory programs or if filtering is based on some clearly define observed characteristics