Matching Estimators Methods of Economic Investigation Lecture 11.

Slides:



Advertisements
Similar presentations
Introduction to Propensity Score Matching
Advertisements

REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Review of Identifying Causal Effects Methods of Economic Investigation Lecture 13.
Methods of Economic Investigation Lecture 2
Random Assignment Experiments
Regression Discontinuity/Event Studies
Model generalization Test error Bias, variance and complexity
Omitted Variable Bias Methods of Economic Investigation Lecture 7 1.
Differences-in-Differences
Propensity Score Matching Lava Timsina Kristina Rabarison CPH Doctoral Seminar Fall 2012.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Lab exam when: Nov 27 - Dec 1 length = 1 hour –each lab section divided in two register for the exam in your section so there is a computer reserved for.
Method Reading Group September 22, 2008 Matching.
Multiple Linear Regression Model
QUALITATIVE AND LIMITED DEPENDENT VARIABLE MODELS.
4. Multiple Regression Analysis: Estimation -Most econometric regressions are motivated by a question -ie: Do Canadian Heritage commercials have a positive.
The Basics of Regression continued
© Institute for Fiscal Studies The role of evaluation in social research: current perspectives and new developments Lorraine Dearden, Institute of Education.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Introduction to the design (and analysis) of experiments James M. Curran Department of Statistics, University of Auckland
Ordinary Least Squares
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Matching Methods. Matching: Overview  The ideal comparison group is selected such that matches the treatment group using either a comprehensive baseline.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
AADAPT Workshop Latin America Brasilia, November 16-20, 2009 Non-Experimental Methods Florence Kondylis.
Quasi Experimental Methods I Nethra Palaniswamy Development Strategy and Governance International Food Policy Research Institute.
Error Component Models Methods of Economic Investigation Lecture 8 1.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Propensity Score Matching and Variations on the Balancing Test Wang-Sheng Lee Melbourne Institute of Applied Economic and Social Research The University.
Instrumental Variables: Problems Methods of Economic Investigation Lecture 16.
Properties of OLS How Reliable is OLS?. Learning Objectives 1.Review of the idea that the OLS estimator is a random variable 2.How do we judge the quality.
Non-Experimental Evaluations Methods of Economic Investigation Lecture 5 1.
Maximum Likelihood Estimation Methods of Economic Investigation Lecture 17.
Public Policy Analysis ECON 3386 Anant Nyshadham.
Application 3: Estimating the Effect of Education on Earnings Methods of Economic Investigation Lecture 9 1.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Application 2: Minnesota Domestic Violence Experiment Methods of Economic Investigation Lecture 6.
Latent Growth Modeling Byrne Chapter 11. Latent Growth Modeling Measuring change over repeated time measurements – Gives you more information than a repeated.
Non-experimental methods Markus Goldstein The World Bank DECRG & AFTPM.
Chapter 13: Limited Dependent Vars. Zongyi ZHANG College of Economics and Business Administration.
Instrumental Variables: Introduction Methods of Economic Investigation Lecture 14.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Impact Evaluation Sebastian Galiani November 2006 Matching Techniques.
Matching STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Sampling Design and Analysis MTH 494 Lecture-22 Ossam Chohan Assistant Professor CIIT Abbottabad.
Lorraine Dearden Director of ADMIN Node Institute of Education
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Randomized Assignment Difference-in-Differences
Bilal Siddiqi Istanbul, May 12, 2015 Measuring Impact: Non-Experimental Methods.
More on regression Petter Mostad More on indicator variables If an independent variable is an indicator variable, cases where it is 1 will.
MATCHING Eva Hromádková, Applied Econometrics JEM007, IES Lecture 4.
Alexander Spermann University of Freiburg, SS 2008 Matching and DiD 1 Overview of non- experimental approaches: Matching and Difference in Difference Estimators.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
ENDOGENEITY - SIMULTANEITY Development Workshop. What is endogeneity and why we do not like it? [REPETITION] Three causes: – X influences Y, but Y reinforces.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Estimating standard error using bootstrap
Matching Procedures: Propensity Score
Quasi Experimental Methods I
Constructing Propensity score weighted and matched Samples Stacey L
Quasi Experimental Methods I
Propensity Score Matching
Matching Methods & Propensity Scores
Matching Methods & Propensity Scores
Methods of Economic Investigation Lecture 12
Explanation of slide: Logos, to show while the audience arrive.
Matching Methods & Propensity Scores
Instrumental Variables
Impact Evaluation Methods: Difference in difference & Matching
Presentation transcript:

Matching Estimators Methods of Economic Investigation Lecture 11

Last Time  General Theme: If you don’t have an experiment, how do you get a ‘control group’  Difference in Differences How it works: compare before-after between two comparable entities Assumptions: Fixed differences over time Tests to improve credibility of assumption  Pre-treatment trends  Ashenfelter Dip

Today’s Class  Another way to get a control group: Matching Assumptions for identification Specific form of matching called “propensity score matching” Is it better than just a plain old regression?

The Counterfactual Framework  Counterfactual: what would have happened to the treated subjects, had they not received treatment?  Idea: individuals selected into treatment and nontreatment groups have potential outcomes in both states: the one in which they are observed the one in which they are not observed.

Reminder of Terms  For the treated group, we have observed mean outcome under the condition of treatment E(Y 1 |T=1) and unobserved mean outcome under the condition of nontreatment E(Y 0 |T=1).  For the control group we have both observed mean E(Y 0 |T=0) and unobserved mean E(Y 1 |T=0)

What is “matching”?  Pairing treatment and comparison units that are similar in terms of observable characteristics  Can do this in regressions (with covariates) or prior to regression to define your treatment and control samples

Matching Assumption  Conditioning on observables (X) we can take assignment to treatment ‘as if’ random, i.e.  What is the implicit statement: unobservables (stuff not in X) plays no role in treatment assignment (T)

A matched estimator  E(Y 1 – Y 0 | T=1) = E[Y 1 | X, T=1] – E[Y 0 | X, T=0] - E[Y 0 | X, T=1] – E[Y 0 | X, T=0] Key idea: all selection occurs only through observed X Matched treatment effect Assumed to be zero

Just do a regression…  Regression are flexible if you only put in a “main effect” the regression will estimate a purely linear specification Interactions and fixed effects allow different slopes and intercepts for any combination of variables Can include quadratic and higher order polynomial terms if necessary But fundamentally specify additively separable terms

Sometimes regression not feasible…  The issue is largely related to dimentionality  Each time you add an observable characteristics, you partition your data into bins.  Imagine all variables are zero-one variables Then if you have k X’s, you have 2 k potential different values Need enough observations in each value to estimate that precisely

Reducing the Dimensionality  Use of propensity score: Probability of receiving treatment, conditional on covariates  Key assumption: if and defining  If this is true, can interpret estimate of differences in outcomes conditional on X as causal effect

Why not control for X  Matching is flexible in a different way Avoid specifying a particular for the outcome equation, decision process or unobservable term Just need the “right” observables Flexible in the form of how X’s affect treatment probability but inflexible in how treatment probability affects outcome

Participation decision  Remember our 3 groups: Always takers: take the treatment if offered AND take the treatment if not offered We observe them if T=0 but R=1 Never takers: don’t take the treatment if not offered AND don’t take it even if it is offered We observe them if T=1 but R=0 Compliers: just do what they’re assigned to do T=1 & R=1 OR T=0 & R=0

Conditions for Matching to Work Take 1-sided non-compliance for ease…if not offered, can’t take it, but some people don’t take it even if offered Error term for never takers Error term for compliers If it’s zero  Perfect compliance: so conditioning on X replicates experimental setting On avg, conditional on X unobservable are the same

Common Support  Can only exist if there is a region of “common support” People with the same X values are in both the treatment and the control groups Let S be the set of all observables X, then 0<Pr(T=1 | X)<0 for some S * subset of S  Intuition: Someone in control close enough to match to treatment unit OR enough overlap in the distribution of treated and untreated individuals

Lots of common support Between red and blue line is area of common support

Not so much common support

Trimming  Define Min and Max values of X for region of overlap—drop all units not in that region  Remove Regions which do not have strictly positive propensity score in both treatment and control distributions (Petra and Todd, 2005)  Both are quite similar when used in practice but if missing sections in middle of distribution can use the second option

How do we match on p(X)  Taken literally, should match on exactly p(X i )  In practice hard to do so strategy is to match treated units to comparison units whose p-scores are sufficiently close to consider  Issues: How many times can 1 unit be a “match” How many to match to treatment unit How to “match” if using more than 1 control unit per treatment unit

Replacement  Issue: once control group person Z is a match for individual A, can she also be a match for individual B  Trade-off between bias and precision: With replacement minimizes the propensity score distance between the matched and the comparison unit Without replacement

Are we doing a one-to-one match?  If 1-to-1 match: units closely related but may not be very precise estimates  More you include in match, the more the p-score of the control group will differ from the treatment group  Trade-off between bias and precision Typically use 1-to-many match because 1-to-1 is extremely data intensive if X is multi- dimensional

Different matching algorithms-1  Can use nearest neighbor which chooses m closest comparison units implicitly weights these all the same Get fixed m but may end up with different pscores  Can use ‘caliper’—radius around a point Again implicitly weights these the same Fixed difference in p-scores, but may not be many units in radius  Stratify Break sample up into intervals Estimate treatment effect separately in each region

Different Matching Algorithms-2  Can also use some type of distribution: Kernel estimator puts some type of distribution (e.g. normal) around the each treatment unit and weights closer control units more and farther control units less Explicit weighting function can be used if you have some knowledge of how related units of certain distances are to each other

How close is close enough?  No “right” answer in these choices—will depend heavily on sample issues How deep is the common support (i.e. are there lots of people in both control and treatment group at all the p-score values  Should all be the same asymptotically but in finite samples (which is everything) may differ

Tradeoffs in different methods Source: Caliendo and Kopeinig, 2005

How to estimate a p-score  Typically use a logit Specific, useful functional form for estimating “discrete choice” models You haven’t learned these yet but you will  For now, think of running a regular OLS regression where the outcome is 1 if you got the treatment and zero if you didn’t  Take the E[T | X] and that’s your propensity score

The Treatment Effect  CIA holds and sufficient region of of common support  Difference in outcome between treated individual i and weighted comparison group J, with weight generated by the p-score distribution in the common support region N is the treatment group and |N| is the size of the treatment group J is comparison group with |J| is the number of comparison group units matched to i

1-to-n Match  Nearest neighbor matching  Caliper matching  Nonparametric/kernel matching Run Regression: Dependent variable: T=1, if participate; T = 0, otherwise. Choose appropriate conditioning variables, X Obtain propensity score: predicted probability (p) General Procedure Multivariate analysis based on new sample 1-to-1 match  estimate difference in outcomes for each pair  Take average difference as treatment effect

Standard Errors  Problem: Estimated variance of treatment effect should include additional variance from estimating p Typically people “bootstrap” which is a non- parametric form of estimating your coefficients over and over until you get a distribution of those coefficients—use the variance from that Will do this in a few weeks

Some concerns about Matching  Data intensive in propensity score estimation May reduce dimensionality of treatment effect estimation but still need enough of a sample to estimate propensity score over common support Need LOTS of X’s for this to be believable  Inflexible in how p-score is related to treatment Worry about heterogeneity Bias terms much more difficult to sign (non- linear p-score bias)

Matching + Diff-in-Diff  Worry that unobservables causing selection because matching on X not sufficient  Can combine this with difference and difference estimates Take control group J for each individual i Estimate difference before treatment If the groups are truly ‘as if’ random should be zero If it’s not zero: can assume fixed differences over time and take before after difference in treatment and control groups

Next Time  Comparing Non-Experimental Methods to the experiments they are trying to replicate  Goal: See how well these techniques work to get the estimated experimental effect