Addressing Alternative Explanations: Multiple Regression 17.871 Spring 2007.

Slides:



Advertisements
Similar presentations
Dummy Variables and Interactions. Dummy Variables What is the the relationship between the % of non-Swiss residents (IV) and discretionary social spending.
Advertisements

Sociology 601 Class 24: November 19, 2009 (partial) Review –regression results for spurious & intervening effects –care with sample sizes for comparing.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.16 Original citation: Dougherty, C. (2012) EC220 - Introduction.
From Anova to Regression: analyzing the effect on consumption of no. of persons in family Family consumption data family.dta E/Albert/Courses/cdas/appstat00/From.
ELASTICITIES AND DOUBLE-LOGARITHMIC MODELS
Lecture 9 Today: Ch. 3: Multiple Regression Analysis Example with two independent variables Frisch-Waugh-Lovell theorem.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: exercise 3.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Multiple Regression Spring Gore Likeability Example Suppose: –Gore’s* likeability is a function of Clinton’s likeability and not directly.
Sociology 601 Class 25: November 24, 2009 Homework 9 Review –dummy variable example from ASR (finish) –regression results for dummy variables Quadratic.
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
1 Multiple Regression EPP 245/298 Statistical Analysis of Laboratory Data.
Regression Example Using Pop Quiz Data. Second Pop Quiz At my former school (Irvine), I gave a “pop quiz” to my econometrics students. The quiz consisted.
Introduction to Regression Analysis Straight lines, fitted values, residual values, sums of squares, relation to the analysis of variance.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
1 Michigan.do. 2. * construct new variables;. gen mi=state==26;. * michigan dummy;. gen hike=month>=33;. * treatment period dummy;. gen treatment=hike*mi;
Regression Forced March Spring Regression quantifies how one variable can be described in terms of another.
Sociology 601 Class 23: November 17, 2009 Homework #8 Review –spurious, intervening, & interactions effects –stata regression commands & output F-tests.
A trial of incentives to attend adult literacy classes Carole Torgerson, Greg Brooks, Jeremy Miles, David Torgerson Classes randomised to incentive or.
1 Zinc Data EPP 245 Statistical Analysis of Laboratory Data.
1 Regression and Calibration EPP 245 Statistical Analysis of Laboratory Data.
Sociology 601 Class 26: December 1, 2009 (partial) Review –curvilinear regression results –cubic polynomial Interaction effects –example: earnings on married.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: variable misspecification iii: consequences for diagnostics Original.
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
SLOPE DUMMY VARIABLES 1 The scatter diagram shows the data for the 74 schools in Shanghai and the cost functions derived from a regression of COST on N.
EDUC 200C Section 4 – Review Melissa Kemmerle October 19, 2012.
TOBIT ANALYSIS Sometimes the dependent variable in a regression model is subject to a lower limit or an upper limit, or both. Suppose that in the absence.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: Tobit models Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 INTERACTIVE EXPLANATORY VARIABLES The model shown above is linear in parameters and it may be fitted using straightforward OLS, provided that the regression.
Confidence intervals were treated at length in the Review chapter and their application to regression analysis presents no problems. We will not repeat.
EXERCISE 5.5 The Stata output shows the result of a semilogarithmic regression of earnings on highest educational qualification obtained, work experience,
Country Gini IndexCountryGini IndexCountryGini IndexCountryGini Index Albania28.2Georgia40.4Mozambique39.6Turkey38 Algeria35.3Germany28.3Nepal47.2Turkmenistan40.8.
How do Lawyers Set fees?. Learning Objectives 1.Model i.e. “Story” or question 2.Multiple regression review 3.Omitted variables (our first failure of.
Social Science Models What elements (at least two) are necessary for a “social science model”?
Addressing Alternative Explanations: Multiple Regression
MultiCollinearity. The Nature of the Problem OLS requires that the explanatory variables are independent of error term But they may not always be independent.
EDUC 200C Section 3 October 12, Goals Review correlation prediction formula Calculate z y ’ = r xy z x for a new data set Use formula to predict.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
MULTIPLE REGRESSION WITH TWO EXPLANATORY VARIABLES: EXAMPLE 1 This sequence provides a geometrical interpretation of a multiple regression model with two.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 1) Slideshow: exercise 1.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
. reg LGEARN S WEIGHT85 Source | SS df MS Number of obs = F( 2, 537) = Model |
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: exercise 5.2 Original citation: Dougherty, C. (2012) EC220 - Introduction.
Panel Data. Assembling the Data insheet using marriage-data.csv, c d u "background-data", clear d u "experience-data", clear u "wage-data", clear d reshape.
Simple Linear Regression. Data available : (X,Y) Goal : To predict the response Y. (i.e. to obtain the fitted response function f(X)) Least Squares Fitting.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: exercise 4.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
(1)Combine the correlated variables. 1 In this sequence, we look at four possible indirect methods for alleviating a problem of multicollinearity. POSSIBLE.
Special topics. Importance of a variable Death penalty example. sum death bd- yv Variable | Obs Mean Std. Dev. Min Max
COST 11 DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES 1 This sequence explains how you can include qualitative explanatory variables in your regression.
Lecture 5. Linear Models for Correlated Data: Inference.
STAT E100 Section Week 12- Regression. Course Review - Project due Dec 17 th, your TA. - Exam 2 make-up is Dec 5 th, practice tests have been updated.
RAMSEY’S RESET TEST OF FUNCTIONAL MISSPECIFICATION 1 Ramsey’s RESET test of functional misspecification is intended to provide a simple indicator of evidence.
1 CHANGES IN THE UNITS OF MEASUREMENT Suppose that the units of measurement of Y or X are changed. How will this affect the regression results? Intuitively,
SEMILOGARITHMIC MODELS 1 This sequence introduces the semilogarithmic model and shows how it may be applied to an earnings function. The dependent variable.
GRAPHING A RELATIONSHIP IN A MULTIPLE REGRESSION MODEL The output above shows the result of regressing EARNINGS, hourly earnings in dollars, on S, years.
1 BINARY CHOICE MODELS: LINEAR PROBABILITY MODEL Economists are often interested in the factors behind the decision-making of individuals or enterprises,
1 REPARAMETERIZATION OF A MODEL AND t TEST OF A LINEAR RESTRICTION Linear restrictions can also be tested using a t test. This involves the reparameterization.
1 In the Monte Carlo experiment in the previous sequence we used the rate of unemployment, U, as an instrument for w in the price inflation equation. SIMULTANEOUS.
1 Experimental Statistics - week 12 Chapter 11: Linear Regression and Correlation Chapter 12: Multiple Regression.
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE In this sequence we will investigate the consequences of including an irrelevant variable.
QM222 Class 9 Section A1 Coefficient statistics
The slope, explained variance, residuals
QM222 Your regressions and the test
Auto Accidents: What’s responsible?
QM222 Class 15 Section D1 Review for test Multicollinearity
Simple Linear Regression
Covariance x – x > 0 x (x,y) y – y > 0 y x and y axes.
EPP 245 Statistical Analysis of Laboratory Data
Introduction to Econometrics, 5th edition
Presentation transcript:

Addressing Alternative Explanations: Multiple Regression Spring 2007

Gore Likeability Example Did Clinton hurt Gore in the 2000 election? How would you test this? What alternative explanations would you need to address? Other examples of alternative explanations based on omitted variables?

Democratic picture

Independent picture

Republican picture

Combined data picture

Combined data picture with regression Clinton thermometer

Combined data picture with “true” regression lines overlaid Clinton thermometer

Tempting yet wrong normalizations Subtract the Gore therm. from the avg. Gore therm. score Subtract the Clinton therm. from the avg. Clinton therm. score

Summary: Why we control Address alternative explanations by removing confounding effects Improve efficiency

Gore vs. Clinton Overall Within party Rep. Ind. Dem.

3D Relationship

3D Linear Relationship

The Linear Relationship between Three Variables

The Slope Coefficients

The Slope Coefficients More Simply

The Matrix form y1y1 y2y2 … ynyn 1x 1,1 x 2,1 …x k,1 1x 1,2 x 2,2 …x k,2 1………… 1x 1,n x 2,n …x k,n

Consider two regression coefficients When does ? Obviously, when

Separate regressions (1)(2)(3) Intercept Clinton Party

Why did the Clinton Coefficient change from 0.62 to corr gore clinton party,cov (obs=1745) | gore clinton party gore | clinton | party3 |

The Calculations. corr gore clinton party,cov (obs=1745) | gore clinton party gore | clinton | party3 |

Accounting for total effects (i.e., regression coefficient when we regress X 2 (as dep. var.) on X 1 (as ind. var.)

Accounting for the total effect Total effect = Direct effect + indirect effect Y X1X1 X2X2

Accounting for the total effects in the Gore thermometer example EffectTotalDirectIndirect Clinton Party

The Output. reg gore clinton party3 Source | SS df MS Number of obs = F( 2, 1742) = Model | Prob > F = Residual | R-squared = Adj R-squared = Total | Root MSE = gore | Coef. Std. Err. t P>|t| [95% Conf. Interval] clinton | party3 | _cons |

Other approaches to addressing confounding effects? Experiments Difference-in-differences designs Others? Is regression the best approach to addressing confounding effects?  Problems

Drinking and Greek Life Example Why is there a correlation between living in a fraternity/sorority house and drinking?  Greek organizations often emphasize social gatherings that have alcohol. The effect is being in the Greek organization itself, not the house.  There’s something about the House environment itself.

Dependent variable: Times Drinking in Past 30 Days

. infix age residence 16 greek 24 screen 102 timespast howmuchpast gpa studying 281 timeshs 325 howmuchhs 326 socializing 283 stwgt_ weight using da3818.dat,clear (14138 observations read). recode timespast30 timeshs (1=0) (2=1.5) (3=4) (4=7.5) (5=14.5) (6=29.5) (7=45) (timespast30: 6571 changes made) (timeshs: changes made). replace timespast30=0 if screen<=3 (4631 real changes made)

. tab timespast30 timespast30 | Freq. Percent Cum | 4, | 2, | 2, | 1, | 1, | | Total | 13,

Three Regressions Dependent variable: number of times drinking in past 30 days Live in frat/sor house4.44 (0.35) (0.38) Member of frat/sor (0.16) 2.44 (0.18) Intercept4.54 (0.56) 4.27 (0.059) 4.27 (0.059) R N13,876 Note: Corr. Between living in frat/sor house and being a member of a Greek organization is.42

The Picture Drinks per 30 day period Living in frat house Member of fraternity

Accounting for the effects of frat house living and Greek membership on drinking EffectTotalDirectIndirect Member of Greek org (85%) 0.44 (15%) Live in frat/ sor. house (51%) 2.18 (49%)