Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 6: Multiple Regression – Additional Topics

Similar presentations


Presentation on theme: "Chapter 6: Multiple Regression – Additional Topics"β€” Presentation transcript:

1 Chapter 6: Multiple Regression – Additional Topics

2 Functional Form of Regression: Logs
Logarithmic functional forms Log-linear: ln(y)=a+bx  Ξ” ln 𝑦 = Δ𝑦 𝑦 =π‘βˆ—Ξ”π‘₯ ln(y)=10+.3x  one unit increase in x causes 30% change in y Log-log: ln(y)=a+bln(x)  Ξ”ln y =bβˆ—Ξ” ln π‘₯  Δ𝑦 𝑦 =π‘βˆ— Ξ”π‘₯ π‘₯ ln(y)=3-.2ln(x)  1% increase in x causes .2% decrease in y. Slope coefficients on logged variables are invariant to rescaling by a constant. ln(yi)= π‘Ž 0 + π‘Ž 1 ln π‘₯ 𝑖 Suppose π‘₯ 𝑖 βˆ— = π‘₯ 𝑖 𝑐 π‘€β„Žπ‘’π‘Ÿπ‘’ 𝑐 𝑖𝑠 π‘Ž π‘π‘œπ‘›π‘ π‘‘π‘Žπ‘›π‘‘ ln 𝑦 = 𝑏 0 + 𝑏 1 ln π‘₯ 𝑖 βˆ— = 𝑏 0 +𝑏1 ln π‘₯ 𝑖 βˆ’ ln 𝑐 =( 𝑏 0 βˆ’π‘1 ln 𝑐 + 𝑏 1 ln π‘₯ 𝑖 Note: intercept changes, but not slope.

3 Functional Form of Regression: Logs
Taking logs often eliminates/mitigates problems with outliers Taking logs often helps to secure normality and homoskedasticity Logs must not be used if variables take on zero or negative values

4 Functional Form of Regression: Quadratic
Using quadratic functional forms Example: Wage equation Marginal effect of experience Over what range of experience is marginal effect positive? Negative? Concave experience profile

5 Functional Form of Regression: Quadratic
Example: Effects of pollution on housing prices Nitrogen oxide in the air, distance from employment centers, average student/teacher ratio Does this mean that, at a low number of rooms, more rooms are associated with lower prices?

6 Functional Form of Regression: Quadratic
Calculation of the turnaround point Turnaround point: This area can be ignored as it concerns only 1% of the observations. Increase rooms from 5 to 6: Increase rooms from 6 to 7:

7 Functional Form of Regression: Interaction Terms
Models with interaction terms Interaction term Effect of number of bedrooms, but for a square footage of zero

8 Functional Form of Regression: Interaction Terms
Reparametrization of interaction effects 𝑦= 𝛽 0 + 𝛽 1 π‘₯ 1 + 𝛽 2 π‘₯ 2 + 𝛽 3 π‘₯ 1 π‘₯ 2 +𝑒 Measure ( π‘₯ 1 , π‘₯ 2 ) as deviations from means πœ‡ 1 , πœ‡ 2 π‘₯ 1 βˆ— = π‘₯ 1 βˆ’ πœ‡ 1 , π‘₯ 2 βˆ— = π‘₯ 2 βˆ’ πœ‡ 2 Re-estimate model with π‘₯ 1 βˆ— , π‘₯ 2 βˆ— 𝑦= 𝛿 0 + 𝛿 1 π‘₯ 1 βˆ— + 𝛿 2 π‘₯ 2 βˆ— + 𝛿 3 π‘₯ 1 βˆ— π‘₯ 2 βˆ— +𝑒 Δ𝑦 Ξ” π‘₯ 1 βˆ— = 𝛿 1 + 𝛿 3 π‘₯ 2 βˆ— = 𝛿 1 𝑖𝑓 π‘₯ 2 = πœ‡ 2 Advantages of reparametrization Easy interpretation of coefficients Standard errors for partial effects at the mean Interactions may be centered at values other than mean

9 Average Partial Effects
In models with quadratics, interactions, and other nonlinear functional forms, the partial effect depends on the values of one or more explanatory variables Average partial effect (APE) is a summary measure to describe the relationship between dependent variable and each explanatory variable After computing the partial effect and plugging in the estimated parameters, average the partial effects for each unit across the sample

10 Goodness of Fit: R2 General remarks on R2 Adjusted vs Unadjusted R2
A high R-squared does not imply that there is a causal interpretation A low R-squared does not preclude precise estimation of partial effects Adjusted vs Unadjusted R2 is an estimate for The adjusted R-squared increases if, and only if, the t-statistic of a newly added regressor is greater than one in absolute value

11 Goodness of Fit: R2 Using adjusted R-squared to choose between nonnested models Models are nonnested if neither model is a special case of the other A comparison between the R-squared of both models would be unfair to the first model because the first model contains fewer parameters In the given example, even after adjusting for the difference in degrees of freedom, the quadratic model is preferred

12 Goodness of Fit: R2 Comparing models with different dependent variables R-squared or adjusted R-squared must not be used to compare models which differ in their definition of the dependent variable Other methods must be used (perhaps later) CEO compensation and firm performance There is much less variation in log(salary) that needs to be explained than in salary

13 Controlling for too many factors
In some cases, certain variables should not be held fixed In a regression of traffic fatalities on state beer taxes (and other factors) one should not directly control for beer consumption In a regression of family health expenditures on pesticide usage among farmers one should not control for doctor visits Different regressions may serve different purposes In a regression of house prices on house characteristics, one would only include price assessments if the purpose of the regression is to study their validity; otherwise one would not include them

14 Effect of adding variables to regression.

15 Predicting y when log(y) is the dependent variable
Under the additional assumption that is independent of : Prediction for y

16 Comparing R-squared of a logged and an unlogged specification
These are the R-squareds for the predictions of the unlogged salary variable (squared correlation between salary and predicted salary). Both R-squareds can now be directly compared.


Download ppt "Chapter 6: Multiple Regression – Additional Topics"

Similar presentations


Ads by Google