Matching STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.

Slides:

Advertisements

Similar presentations

Inference: Fishers Exact p-values STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke.

Advertisements

Introduction to Propensity Score Matching

Rerandomization in Randomized Experiments STAT 320 Duke University Kari Lock Morgan.

Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.

Observational Studies and Propensity Scores STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical.

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTION 2.6, 9.1 Least squares line Interpreting.

Inference: Neyman’s Repeated Sampling STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science.

INFERENTIAL STATISTICS. Descriptive statistics is used simply to describe what's going on in the data. Inferential statistics helps us reach conclusions.

Propensity Score Matching Lava Timsina Kristina Rabarison CPH Doctoral Seminar Fall 2012.

The World Bank Human Development Network Spanish Impact Evaluation Fund.

Regression. So far, we've been looking at classification problems, in which the y values are either 0 or 1. Now we'll briefly consider the case where.

Method Reading Group September 22, 2008 Matching.

Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review

BCOR 1020 Business Statistics

Multiple Regression Models

STAT 101 Dr. Kari Lock Morgan Exam 2 Review.

MACHINE LEARNING 6. Multivariate Methods 1. Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Motivating Example  Loan.

Stat 112: Lecture 9 Notes Homework 3: Due next Thursday

Multiple Regression III 4/16/12 More on categorical variables Missing data Variable Selection Stepwise Regression Confounding variables Not in book Professor.

Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.

STA 320: Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.

Simple Linear Regression Least squares line Interpreting coefficients Prediction Cautions The formal model Section 2.6, 9.1, 9.2 Professor Kari Lock Morgan.

Using Covariates in Experiments: Design and Analysis STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical.

Weighting STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.

Subclassification STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.

Randomized Experiments STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/27/12 Multiple Regression SECTION 10.3 Categorical variables Variable.

ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?

Advanced Statistics for Interventional Cardiologists.

6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.

BPS - 3rd Ed. Chapter 211 Inference for Regression.

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/1/12 ANOVA SECTION 8.1 Testing for a difference in means across multiple.

Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory variables.

SUTVA, Assignment Mechanism STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.

Variability.  Reflects the degree to which scores differ from one another  Usually in reference to the mean value  A measure of the central tendency.

Matching Estimators Methods of Economic Investigation Lecture 11.

Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.

Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.

Dependencies Complex Data in Meta-analysis. Common Dependencies Independent subgroups within a study (nested in lab?) Multiple outcomes on the same people.

Propensity Score Matching for Causal Inference: Possibilities, Limitations, and an Example sean f. reardon MAPSS colloquium March 6, 2007.

AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.

Intermediate Applied Statistics STAT 460 Lecture 17, 11/10/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu

Comments on Midterm Comments on HW4 Final Project Regression Example Sensitivity Analysis? Quiz STA 320 Design and Analysis of Causal Studies Dr. Kari.

Generalizing Observational Study Results Applying Propensity Score Methods to Complex Surveys Megan Schuler Eva DuGoff Elizabeth Stuart National Conference.

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/6/12 Synthesis Big Picture Essential Synthesis Bayesian Inference (continued)

IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.

WS 2007/08Prof. Dr. J. Schütze, FB GW KI 1 Hypothesis testing Statistical Tests Sometimes you have to make a decision about a characteristic of a population.

Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/20/12 Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory.

1 Hester van Eeren Erasmus Medical Centre, Rotterdam Halsteren, August 23, 2010.

Introduction to Machine Learning Multivariate Methods 姓名 : 李政軒.

Machine Learning 5. Parametric Methods.

Randomized Assignment Difference-in-Differences

Rerandomization to Improve Covariate Balance in Randomized Experiments Kari Lock Harvard Statistics Advisor: Don Rubin 4/28/11.

Rerandomization in Randomized Experiments Kari Lock and Don Rubin Harvard University JSM 2010.

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/6/12 Simple Linear Regression SECTION 2.6 Interpreting coefficients Prediction.

MATCHING Eva Hromádková, Applied Econometrics JEM007, IES Lecture 4.

(ARM 2004) 1 INNOVATIVE STATISTICAL APPROACHES IN HSR: BAYESIAN, MULTIPLE INFORMANTS, & PROPENSITY SCORES Thomas R. Belin, UCLA.

BPS - 5th Ed. Chapter 231 Inference for Regression.

Alexander Spermann University of Freiburg, SS 2008 Matching and DiD 1 Overview of non- experimental approaches: Matching and Difference in Difference Estimators.

Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.

Matching methods for estimating causal effects Danilo Fusco Rome, October 15, 2012.

Eastern Michigan University

Constructing Propensity score weighted and matched Samples Stacey L

Sec 9C – Logistic Regression and Propensity scores

Methods of Economic Investigation Lecture 12

Treatment effect: Part 2

Impact Evaluation Methods: Difference in difference & Matching

Rerandomization to Improve Baseline Balance in Educational Experiments

Presentation transcript:

Matching STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University

Quiz 2 > summary(Quiz2) Min. 1st Qu. Median Mean 3rd Qu. Max

Quiz 2 one-sided or two-sided p-value? (depends on question being asked) imputation: use observed control outcomes to impute missing treatment outcomes and vice versa. o class year: use observed outcomes from control sophomores to impute missing outcomes for treatment sophomores biased or unbiased

Matching Matching: Find control units to “match” the units in the treatment group Restrict the sample to matched units Analyze the difference for each match (analyze as matched pair experiment) Useful when one group (usually the control) is much larger than the other

Estimand Changes the estimand: now estimating the causal effect for the subpopulation of treated units ATE: Average treatment effect ATT: Average treatment effect for the treated ATC: Average treatment effect for the controls

Exact Matching For exact matching, covariate values must match exactly 21 year old female in treatment group must be matched with 21 year old female in control group

Inexact Matches Often, exact matching is not feasible, and matches are just as close as possible The farther apart the matches are, the more bias there will be Bias: covariate imbalance There are ways of adjusting for bias (ch 18) Can use calipers: only matches within a certain caliper are acceptable (remove units without an acceptable match)

Matching Y(1)Y(0)X (Age) observed?19 observed?22 observed?23 ?observed51 ?observed30 ?observed20 ?observed15 ?observed42 ?observed21 ?observed25

Matching Y(1)Y(0)X (Age) observed?19 observed?22 observed?23 ?observed51 ?observed30 ?observed20 ?observed15 ?observed42 ?observed21 ?observed25

Matching Y(1)Y(0)X (Age) observed?19 observed?22 observed?23 ?observed51 ?observed30 ?observed20 ?observed15 ?observed42 ?observed21 ?observed25

Matching Y(1)Y(0)X (Age) observed?19 observed?22 observed?23 ?observed51 ?observed30 ?observed20 ?observed15 ?observed42 ?observed21 ?observed25

Matching Y(1)Y(0)X (Age) observed?19 observed?22 observed?23 ?observed51 ?observed30 ?observed20 ?observed15 ?observed42 ?observed21 ?observed60

Matching Y(1)Y(0)X (Age) observed?19 observed?22 observed?23 ?observed51 ?observed30 ?observed20 ?observed15 ?observed42 ?observed21 ?observed60

Matching Y(1)Y(0)X (Age) observed?19 observed?22 observed?23 ?observed51 ?observed30 ?observed20 ?observed15 ?observed42 ?observed21 ?observed60

Matching Y(1)Y(0)X (Age) observed?19 observed?22 observed?23 ?observed51 ?observed35 ?observed20 ?observed15 ?observed42 ?observed21 ?observed60 uh oh…

Ideal Matches Ideal: minimize total (or average) covariate distance for pairs Hard to do computationally, especially for large sample sizes

“Greedy” Matching Greedy matching orders the treated units, and then sequentially chooses the closest control (ignoring effect on later matches) When doing this, helps to first match units that will be hardest to match One possibility: order by decreasing propensity score (treated units with highest propensity scores are most unlike controls)

Matching with Replacement Matching can be done with replacement Pros: o Easier computationally (ideal matches overall same as just closest for each unit) o Better matches Cons: o Variance of estimator higher (controls can be used more than once, so less information) o Variance is harder to estimate (no longer independent)

Matching with Replacement Matching with replacement is necessary if the group you want to make inferences about is the smaller group Matching with replacement also allows you to make inferences about the entire sample (find a match for every unit, from opposite group) Units more similar to those in the opposite group will be selected more

Multiple Covariates With multiple covariates, how do you know which to prioritize? 21 year old female Which is a better match: o 18 year old female o 21 year old male Want a way to measure multivariate covariate distance

Distance Metric Lots of different possible distance metrics Mahalanobis distance? Sum of squared (standardized) covariate difference in means? Difference in propensity scores? Linearized propensity score…

Linearized Propensity Score Difference between propensity scores of and 0.01 is larger than difference between propensity scores 0.1 and Better option: linearized propensity score, the log odds of propensity score: Logistic regression:

Linearized Propensity Score PSLinearized PS

Linearized Propensity Score Note: the linearized propensity score is recommended for subclassification as well, although it isn’t as important in that setting Won’t change subclasses, but will change your view of whether a subclass is small enough

Hybrid Matching In hybrid matching, match on more than one criteria Example: exact matches are required for some covariates, and other covariates are just as close as possible o Example: 21 year old female; look for closest age only within female controls Example: match on propensity score and important covariate(s)

Multiple Matches Paired matching is called 1:1 matching (1 control to 1 treated) If the control group is much bigger than the treatment group, can do 2:1 matching (2 controls to 1 treatment unit), or more to one matching Another option: caliper matching in which all controls within a certain distance (based on some metric) of a treated unit are matched with that unit

Matching Like propensity score estimation… and like subclassification…. … there are no “right” matches If the matches you choose give good covariate balance, then you did a good job!

Decisions Estimating propensity score: o What variables to include? o How many units to trim, if any? Subclassification: o How many subclasses and where to break? Matching: o with or without replacement? o 1:1, 2:1, … ? o how to weight variables / distance measure? o exact matching for any variable(s)? o calipers for which a match is “acceptable”? o…o…

Lalonde Data Analyze the causal effect of a job training program on wages Data on 185 treated (participated in job training program) and 2490 controls (did not participate in job training program) GOAL: achieve covariate balance!

To Do Read Ch 15, 18 Homework 4 (due Monday)