Regression Extensions W&W Chapter 14. Introduction So far we have assumed that our independent variables are measured intervally. Today we will discuss.

Slides:



Advertisements
Similar presentations
Multiple Regression and Model Building
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Chapter 10 Simple Regression.
Chapter 12 Simple Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 13-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Predictive Analysis in Marketing Research
Chapter 11 Multiple Regression.
Econ 140 Lecture 191 Heteroskedasticity Lecture 19.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Pertemua 19 Regresi Linier
CHAPTER 6 ECONOMETRICS x x x x x Dummy Variable Regression Models Dummy, or indicator, variables take on values of 0 or 1 to indicate the presence or absence.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Correlation and Linear Regression
Regression and Correlation Methods Judy Zhong Ph.D.
ECON 6012 Cost Benefit Analysis Memorial University of Newfoundland
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Multiple Regression. In the previous section, we examined simple regression, which has just one independent variable on the right side of the equation.
Quantitative Methods Heteroskedasticity.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Chapter 14 Introduction to Multiple Regression
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
Lecture 4 Introduction to Multiple Regression
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Lecture 10: Correlation and Regression Model.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
ANOVA, Regression and Multiple Regression March
Statistics for Managers Using Microsoft® Excel 5th Edition
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Chapter 13 Simple Linear Regression
Linear Regression.
Statistics for Managers using Microsoft Excel 3rd Edition
CHAPTER 29: Multiple Regression*
Prepared by Lee Revere and John Large
BEC 30325: MANAGERIAL ECONOMICS
Regression and Categorical Predictors
Chapter 9 Dummy Variables Undergraduated Econometrics Page 1
BEC 30325: MANAGERIAL ECONOMICS
Presentation transcript:

Regression Extensions W&W Chapter 14

Introduction So far we have assumed that our independent variables are measured intervally. Today we will discuss how to interpret dummy variables in regression. Recall that a dummy variable takes on two possible values, 0 or 1.

The Regression Model Suppose we want to estimate the following model: Y =  +  1 D +  2 X +  Y = # of militarized disputes a state gets involved in per year X = amount of annual military spending D = 1 if the state is democratic D = 0 if the state is non-democratic

The Results We want to see if there is a difference between democratic and non-democratic regimes in terms of how many militarized disputes they get involved in. Suppose we estimate the following regression model: Y p = -6.4 – 2.02D X

The Results We can compare the two groups as follows: For D=0, Y p = -6.4 –2.02(0) X = X For D=1, Y p = -6.4 –2.02(1) X = X The coefficient for D (  1 ) is the change in Y that accompanies a unit change in D, which is either zero or one.

The Impact of a Dummy Variable We can see that the dummy variable changes the value of the Y-intercept (  ). If we were to plot these two regression lines, they would be parallel lines with slope ( ), and Y- intercepts –6.4 or –8.42. We can conclude that democracies get involved in about 2 fewer conflicts per year compared to non-democracies.

Summary More generally, if D is a 0-1 dummy variable in a regression model, Y p =a + b 1 D + b 2 X Then the regression line where D=1 is parallel and b 1 units higher than the line where D=0.

Multiple categories Suppose we want to expand our measure of democracy to three levels: democracy, anocracy, and autocracy. We could create two dummy variables. D 1 = 1 if anocracy, 0 otherwise D 2 = 1 if democracy, 0 otherwise

Multiple categories We leave the third category (autocracy) out as a reference group. We always have one less dummy than there are categories because if we included all three, there would be perfect multicollinearity. Our new model is: Y =  +  1 D 1 +  2 D 2 +  3 X + 

Estimating the model Suppose we estimate our new model and obtain the following results: Y p = -5.1 –1.4D 1 – 3.6D X For autocracies (D 1 =0, D 2 =0), Y p = X For anocracies (D 1 =1, D 2 =0), Y p = X For democracies (D 1 =0, D 2 =1), Y p = X

Summary In this case we have three parallel regression lines. We can see that democracies are involved in 3 ½ fewer disputes than autocracies, and around 2 fewer disputes than anocracies. We can also see that anocracies get involved in 1.4 fewer disputes than autocracies.

Example Severing the Electoral Connection: Shirking in the Contemporary Congress Lawrence S. Rothenberg and Mitchell S. Sanders American Journal of Political Science Question: If incumbents in Congress plan to retire or if they pursue higher office and face a distinct constituency, will they behave differently? In particular, will they have greater incentives to shirk?

Example Shirking: Legislative behavior differs from what would be observed given perfect monitoring and effective punishment by constituents (i.e., through elections). Two Types of Shirking: 1) Ideological Shirking Members change their votes away from the ideological position of their district. 2) Participatory shirking Members vote less frequently (casting fewer roll call votes).

Example Previous studies suggest that participatory shirking occurs, but that ideological shirking does not. Rothenberg and Sanders argue that this is largely a function of poorly specified measures of Congressional Shirking. Research Design C ompare a member's actions during the 4 th quarter of one Congress with her actions in the 4 th quarter of the next Congress. Shortly before an election, those seeking reelection know that they will be judged by the electorate; those not standing for reelection know that they are free to shirk.

Example Research Design They examine roll call votes taking place after July 1 st of an election year in consecutive Congresses between 1991 and This produces 366 cases from the 102 nd Congress, 305 from the 103 rd Congress, and 327 from the 104 th Congress.

Example Dependent Variables 1) Ideological Change =  Ideological Position i Congress k+I - Ideological Position i Congress k  2) Abstention Change =  Abstention Rate i Congress k+I - Abstention Rate i Congress k  Expectation: If departing members change their voting patterns more and abstain more, then this constitutes evidence of shirking.

Example Independent Variables 1) Retiring (Dummy Variable): equals 1 for individuals not running for reelection and not seeking other elected office (14.2% of their total 998 legislators) and 0 otherwise 2) Pursuing Statewide Office (Dummy Variable): equals 1 for individuals leaving the House to seek statewide office (3.2% of the total) and 0 otherwise 3) Seniority : years of prior service at the beginning of each Congress; do senior members change their position less and vote more often than junior members?

Example Independent Variables 4) Electoral Slack : member's vote share (proportion of the two-party vote) in the prior election; do electorally secure members have more liberty in voting? 5) District Political Change : absolute difference in the proportion of the two-party vote received by 1988 presidential candidate Michael Dukakis in the old and the new district (to reflect possible vote changes based on redistricting that occurred in 1992).

Example  There is evidence of member shirking, both in terms of ideological and participatory shirking. Retiring and Pursuing Statewide Office are significant and positive in both equations.  Impact on Ideological Change: retiring members increase their ideological change by.039, while political aspirants increase their ideological change by.033 (the scale is -1 to 1 for the ideology variable).

Example  Impact on Abstention Change: Retiring members' abstention rates increase by.11 (11%), which is substantial in an era where average abstention rates are near 5%. Political aspirants' abstention rates increase by.15 (15%).  Small R 2 values, especially for the Ideological Change model, suggest that there is substantial randomness associated with behavioral change.  Significant intercept in the Ideological Change equation indicates that legislator ideology is at least somewhat fluid for all members.

Comparing members running for reelection to those leaving Congress 1) For Ideological Change  Running for reelection (Retiring = 0 and Pursuing Statewide Office = 0) Y = (0) (0) *District Political Change *Electoral Slack *Seniority Y = *District Political Change *Electoral Slack *Seniority

Comparing members running for reelection to those leaving Congress  Not running for reelection (Retiring = 1 and Pursuing Statewide Office = 1) Y = (1) (1) *District Political Change *Electoral Slack *Seniority Y = *District Political Change *Electoral Slack *Seniority This means that members that are leaving Congress are 7.2% more likely to change their ideological position ( ) compared to members staying.

Comparing members running for reelection to those leaving Congress 1) For Abstention Change  Running for reelection (Retiring = 0 and Pursuing Statewide Office = 0) Y = (0) (0) *District Political Change *Electoral Slack *Seniority Y = *District Political Change *Electoral Slack *Seniority

Comparing members running for reelection to those leaving Congress  Not running for reelection (Retiring = 1 and Pursuing Statewide Office = 1) Y = (1) (1) *District Political Change *Electoral Slack *Seniority Y = *District Political Change *Electoral Slack *Seniority This means that members that are leaving Congress have changes in abstention rates that are 26% higher than members staying in Congress.

Interactive dummy variables We can also interact our dummy variables with other independent variables if our theory implies such a relationship. For example, we might posit the following model: Y =  +  1 D +  2 X +  3 DX + 

Interactive dummy variables In our previous example, this would be warranted if we believed that military spending in democracies had a different effect on their propensity to use force than military spending in non-democracies. We might argue that democracies use increased military spending for defense, not for aggression, and thus that their rates of dispute involvement might vary (this is just a hypothetical argument).

Interactive dummy variables For D=0 (non-democracies), the model reduces to: Y =  +  2 X +  For D=1 (democracies), the model reduces to: Y =  +  1 (1) +  2 X +  3 (1)X +  Y = (  +  1 ) + (  2 +  3 )X +  We can see that an interactive dummy changes both the intercept (by  1 ) and the slope (by  3 ).

Examining the Residuals of a Regression Model We make several assumptions about the error term in the regression model, . For example, we assume that the errors are normally distributed:  i  N[0,  2 ] A violation of non-constant variance is called heteroskedasticity (errors are not the same across the range of X values).

Heteroskedasticity A simple way to detect heteroskedasticity is to plot the residuals against one or more of the independent variables. One common pattern that may indicate heteroskedasticity is a fanning out residual pattern.

Autocorrelation We also assume no auto-correlation, or that Cov[  i,  j ] = 0 if i  j. A typical violation occurs with time series data where the errors are related over time. To detect autocorrelation, it is useful to plot the residuals over time. We should observe a pattern in the residuals such that a high value is followed by another high value, and a low value is followed by another low value, etc.