Regression Discontinuity 10/13/09. What is R.D.? Regression--the econometric/statistical tool social scientists use to analyze multivariate correlations.

Slides:

Advertisements

Similar presentations

Regression Discontinuity. Basic Idea Sometimes whether something happens to you or not depends on your ‘score’ on a particular variable e.g –You get a.

Advertisements

Girls’ scholarship program.  Often small/no impacts on actual learning in education research ◦ Inputs (textbooks, flipcharts) little impact on learning.

Review of Identifying Causal Effects Methods of Economic Investigation Lecture 13.

#ieGovern Impact Evaluation Workshop Istanbul, Turkey January 27-30, 2015 Measuring Impact 1 Non-experimental methods 2 Experiments Vincenzo Di Maro Development.

Random Assignment Experiments

Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.

Regression Discontinuity/Event Studies

Presented by Malte Lierl (Yale University).  How do we measure program impact when random assignment is not possible ?  e.g. universal take-up  non-excludable.

The counterfactual logic for public policy evaluation Alberto Martini hard at first, natural later 1.

Tips for Writing Free Response Questions on the AP Statistics Exam Laura Trojan Cannon School.

Objectives Use simulations and hypothesis testing to compare treatments from a randomized experiment.

Models with Discrete Dependent Variables

Regression Discontinuity 10/13/08. What is R.D.? Regression--the econometric/statistical tool social scientists use to analyze multivariate correlations.

Impact Evaluation Methods. Randomized Trials Regression Discontinuity Matching Difference in Differences.

Econometric Details -- the market model Assume that asset returns are jointly multivariate normal and independently and identically distributed through.

Agenda for January 25 th Administrative Items/Announcements Attendance Handouts: course enrollment, RPP instructions Course packs available for sale in.

1 Psych 5500/6500 The t Test for a Single Group Mean (Part 5): Outliers Fall, 2008.

Regression Discontinuity Design 1. 2 Z Pr(X i =1 | z) 0 1 Z0Z0 Fuzzy Design Sharp Design.

Sampling Distributions

C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.

1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals.

Bootstrapping applied to t-tests

Larysa Minzyuk - Felice Russo Department of Management and Economics - University of Salento (Lecce)

Linear Regression Inference

Chapter 5 Sampling Distributions

Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.

The t Tests Independent Samples. The t Test for Independent Samples Observations in each sample are independent (not from the same population) each other.

Significance Tests: THE BASICS Could it happen by chance alone?

Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.

Psyc 235: Introduction to Statistics DON’T FORGET TO SIGN IN FOR CREDIT!

Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.

Exam Exam starts two weeks from today. Amusing Statistics Use what you know about normal distributions to evaluate this finding: The study, published.

From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.

Part III Taking Chances for Fun and Profit Chapter 8 Are Your Curves Normal? Probability and Why it Counts.

Session III Regression discontinuity (RD) Christel Vermeersch LCSHD November 2006.

Application 3: Estimating the Effect of Education on Earnings Methods of Economic Investigation Lecture 9 1.

Lecture 7: What is Regression Analysis? BUEC 333 Summer 2009 Simon Woodcock.

Scatterplots & Regression Week 3 Lecture MG461 Dr. Meredith Rolfe.

Applying impact evaluation tools A hypothetical fertilizer project.

Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.

Assumptions 5.4 Data Screening. Assumptions Parametric tests based on the normal distribution assume: – Independence – Additivity and linearity – Normality.

FINAL MEETING – OTHER METHODS Development Workshop.

Africa Program for Education Impact Evaluation Dakar, Senegal December 15-19, 2008 Experimental Methods Muna Meky Economist Africa Impact Evaluation Initiative.

Week 6. Statistics etc. GRS LX 865 Topics in Linguistics.

Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.

Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.

Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.

Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.

Hypothesis Testing with z Tests Chapter 7. A quick review This section should be a review because we did a lot of these examples in class for chapter.

The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.

Statistics 19 Confidence Intervals for Proportions.

Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.

Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.

Differences-in-Differences

Differences-in-Differences

Tips for Writing Free Response Questions on the AP Statistics Exam

Chapter 5 Sampling Distributions

Tips for Writing Free Response Questions on the AP Statistics Exam

Chapter 5 Sampling Distributions

Methods of Economic Investigation Lecture 12

Warm-up We are going to collect some data and determine if it is “normal” Roll each pair of dice 10 times and record the SUM of the two digits in your.

Regression Discontinuity

Chapter 5 Sampling Distributions

Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.

Impact Evaluation Methods: Difference in difference & Matching

Honors Statistics The Standard Deviation as a Ruler and the Normal Model Chapter 6 Part 3.

Explanation of slide: Logos, to show while the audience arrive.

Ch. 13. Pooled Cross Sections Across Time: Simple Panel Data.

Using Big Data to Solve Economic and Social Problems

Presentation transcript:

Regression Discontinuity 10/13/09

What is R.D.? Regression--the econometric/statistical tool social scientists use to analyze multivariate correlations Where Y is some sort of dependent variable, alpha’s a constant, the X’s are a bunch of independent variables, the beta’s are coefficients, and the e is the error term.

Some sort of arbitrary jump/change thanks to a quirk in law or nature. Discontinuity

Some sort of arbitrary jump/change thanks to a quirk in law or nature. We’re interested in the ones that make very similar people get very dissimilar results. Discontinuity

Discontinuity Examples PSAT/NMSQT –Basically the top 16,000 test-takers get a scholarship. –A small difference in test score can means a discontinuous jump in scholarship amount.

Discontinuity Examples School Class Size –Maimonides’ Rule--No more than 40 kids in a class in Israel. –40 kids in school means 40 kids per class. 41 kids means two classes with 20 and 21. (Angrist & Lavy, QJE 1999)

Discontinuity Examples Union Elections –If employers want to unionize, NLRB holds election. 50% means the employer doesn’t have to recognize the union, and 50% + 1 means the employer is required to “bargain in good faith” with the union. (DiNardo & Lee, QJE 2004)

Discontinuity Examples U.S. House Elections –Incumbency advantage. If you’re first past the pole in the previous election, even by just one vote, you get a huge advantage in the next election. (David Lee, Journal of Econometrics 2007)

Discontinuity Examples Air Pollution and Home Values –The Clean Air Act’s National Ambient Air Quality Standards say if the geometric mean concentration of 5 pollutant particulates is 75 micrograms per cubic meter or greater, county is classified as “non-attainment” and are subject to much more stringent regulation. (Ken Chay, Michael Greenstone, JPE 2005)

Combine the “R” and the “D” Run a regression based on a situation where you’ve got a discontinuity. Treat above-the-cutoff and below-the- cutoff like the treatment and control groups from a randomization.

Why are we doing this? Why do we have to look for quirks like this? Can’t we just control for whatever we want using OLS or some other line-fitting tool? Just get a bunch of people’s salaries and PSAT scores. PSAT’s are X, income is Y, run a regression in Stata, and we have causal inference, right? Higher test scores cause people to earn more later in life.

No. The statistical methods we use are based on lot of assumptions. Importantly, the error terms (which is really full of things we can’t measure, the unobservables) are supposed to be uncorrelated with the X’s and normally distributed. In reality, those conditions probably hasn’t been met in any of the previous situations.

No. The statistical methods we use are based on lot of assumptions. Importantly, the error terms (which is really full of things we can’t measure, the unobservables) are supposed to be uncorrelated with the X’s and normally distributed. In reality, those conditions probably hasn’t been met in any of the previous situations. For example, class size is probably correlated with some type of neighborhood quality. Please turn to your neighbor and discuss what is probably wrong with each of the previous 5 examples (PSAT, class size, union elections, house elections, air pollution)

No. Higher PSAT kids might have higher ability. Crowded classrooms might be in poorer schools. (Or special needs students might be in small classes.) Unionized workers might work for certain types of firms. Incumbent politicians might be better. They won before, didn’t they? Pollution might be correlated to economic growth, which could increase home values.

Controlling for everything? Focus on the Israeli schools for a second. We can try and control for neighborhood poverty level. Does that solve the problem? No. If neighborhood poverty level (observables) are correlated with the X of interest (class size) why would you think it’s safe to assume that the unobservables aren’t correlated? Have you magically controlled for every single thing that’s correlated with the X of interest? Probably not.

Controlling for everything? Focus on the Israeli schools for a second. We can try and control for neighborhood poverty level. Does that solve the problem? No. If neighborhood poverty level (observables) are correlated with the X of interest (class size) why would you think it’s safe to assume that the unobservables aren’t correlated? Have you magically controlled for every single thing that’s correlated with the X of interest? Probably not. So let’s find a bandwidth in which these things are uncorrelated.

A Bandwidth of Randomness Test scores aren’t random, and neither is class size, nor air pollution. But is a kid in the 94.9th percentile really that different from the 95th percentile kid? Is a school with 40 kids that different from a school with 41? Right around the cutoff, there’s a good chance things are random.

No Sorting - Observables Don’t take my word for it. Look at the averages of the observables in your below-cutoff group, and the averages of the observables in the above-cutoff group. Are they the same? Hopefully. Do people know about this cutoff? Are they doing some endogenous sorting? When deciding where to live, did good moms look for schools where their kids would be the 41st kid? Did certain types of polluters look for counties where they’d be below the cutoff? These things can be checked to some degree--look at the average observables above and below the cutoff.

No Sorting - Clumping In addition to checking the observables on either side of the cutoff, we should check the density of the distribution. Is it unusually low/high right around the cutoff? If there’s some abnormally large portion of people right around the cutoff, it’s quite possible that you don’t have random assignment.

No Sorting - Clumping You’re totally cheating. Please stop. Emily Conover & Adriana Camacho “Manipulation of Social Program Eligibility”

GSP--Multiple Analyses “Incentives to Learn,” Ted Miguel, Michael Kremer, Rebecca Thornton Girls Scholarship Program, Busia Kenya. Randomize holding a scholarship competition across schools in Busia and Teso districts.

GSP--Multiple Analyses “Incentives to Learn,” Ted Miguel, Michael Kremer, Rebecca Thornton Girls Scholarship Program, Busia Kenya. Randomize holding a scholarship competition across schools in Busia and Teso districts. Treatment: If a girl finishes in the top 15% in her district on the end-of- year exam, she wins a two-year scholarship. Randomization Analysis: Does attending a school with the competition make you work harder/improve schooling outcomes? RD Analysis: Does winning the award improve schooling outcomes?

P-900 in Chile “The Central Role of Noise in Evaluating Interventions That Use Test Scores to Rank Schools” Kenneth Y. Chay, Patrick J. Mcewan, Miguel Urquiola, AER 2005 Mean Reversion: Sophomore Slump, SI Cover Curse, Heisman Trophy Curse, Madden curse, and in the opposite direction.

THIS IS THE MOST AMAZING THING EVER! Look at the educational outcomes of treatment schools in 1990, compared to those same schools in 1988, before the program. AMAZING! FANTABULOUS!

Oh, wait. Hmm. That’s kind of disappointing.

So how do we actually do this? 1.Draw two pretty pictures 1.Eligibility criterion (test score, income, or whatever) vs. Program Enrollment 2.Eligibility criterion vs. Outcome

So how do we actually do this? 2. Run a simple regression. (Yes, this is basically all we ever do, and STATA can run the calculation in almost any situation, but before we do it, it’s necessary to make sure the situation is appropriate and draw the graphs so that we can have confidence that our estimates are actually causal.) Outcome as a function of test score (or whatever), with a binary (1 if yes, 0 if no) variable for program enrollment.

Is it really that simple? Not quite. You could totally have a situation where the outcome is some sort of quadratic or cubic or nth polynomial function of the test score. Try controlling for that. This is going to depend on the situation and is somewhat arbitrary.

Wait, “somewhat arbitrary?” Lame, I know. But two things aren’t universally clear: 1. How wide a bandwidth around the cutoff are we looking at? We’re really only confident in our estimate for people that are close to the cutoff. This is a LOCAL AVERAGE TREATMENT EFFECT. We can confidently say that a school right around the cutoff would improve average test scores by X if they received the treatment, but we’re not so confident that already-awesome schools would get the same benefit.

Wait, “somewhat arbitrary?” 2. Without the program, what shaped function would there be naturally? What sort of function do we throw in to control for the fact that even if there was no National Merit Semifinalist scholarship, smarter kids are likely to earn more later in life? The solution: SHOW YOUR WORK

Fake Programs In addition to showing your work, another good robustness check is to test for the effects of non-existent programs.

Fake Programs

Conclusion Find a threshold Look at people just above and just below Make sure there’s no sorting It’s only a local effect