Multiple Regression Analysis

Slides:



Advertisements
Similar presentations
Chapter 18 Multiple Regression.
Advertisements

Multiple Regression. Introduction In this chapter, we extend the simple linear regression model. Any number of independent variables is now allowed. We.
Lecture Unit Multiple Regression.
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
1 Multiple Regression Chapter Introduction In this chapter we extend the simple linear regression model, and allow for any number of independent.
1 Multiple Regression Model Error Term Assumptions –Example 1: Locating a motor inn Goodness of Fit (R-square) Validity of estimates (t-stats & F-stats)
Lecture 9- Chapter 19 Multiple regression Introduction In this chapter we extend the simple linear regression model and allow for any number of.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Chapter 13 Multiple Regression
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Chapter 12 Simple Regression
1 Multiple Regression Chapter Introduction In this chapter we extend the simple linear regression model, and allow for any number of independent.
Chapter 12 Multiple Regression
Lecture 22 Multiple Regression (Sections )
Chapter 13 Introduction to Linear Regression and Correlation Analysis
1 Multiple Regression. 2 Introduction In this chapter we extend the simple linear regression model, and allow for any number of independent variables.
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 13-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Lecture 23 Multiple Regression (Sections )
Multiple Regression and Correlation Analysis
Linear Regression Example Data
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 10 th Edition.
Chapter 7 Forecasting with Simple Regression
1 8.4 Multiple Regression Lecture Unit Introduction In this section we extend simple linear regression where we had one explanatory variable,
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
1 5. Multiple Regression II ECON 251 Research Methods.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 13: Inference in Regression
Chapter 14 Simple Regression
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Chapter 14 Introduction to Multiple Regression
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 18 Multiple Regression.
ANOVA for Regression ANOVA tests whether the regression model has any explanatory power. In the case of simple regression analysis the ANOVA test and the.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Discussion of time series and panel models
Economics 173 Business Statistics Lecture 19 Fall, 2001© Professor J. Petry
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Lecture 10: Correlation and Regression Model.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights.
Statistics for Managers Using Microsoft® Excel 5th Edition
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Copyright © 2009 Cengage Learning 17.1 Chapter 19 Multiple Regression.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Conceptual Foundations © 2008 Pearson Education Australia Lecture slides for this course are based on teaching materials provided/referred by: (1) Statistics.
Economics 173 Business Statistics Lecture 18 Fall, 2001 Professor J. Petry
Multiple Regression Reference: Chapter 18 of Statistics for Management and Economics, 7 th Edition, Gerald Keller. 1.
INTRODUCTION TO MULTIPLE REGRESSION MULTIPLE REGRESSION MODEL 11.2 MULTIPLE COEFFICIENT OF DETERMINATION 11.3 MODEL ASSUMPTIONS 11.4 TEST OF SIGNIFICANCE.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Simple Linear Regression and Correlation
Introduction to Regression
Correlation and Simple Linear Regression
Presentation transcript:

Multiple Regression Analysis Multiple regression uses many independent variables to predict or explain the variation in a dependent variables The basic multiple regression model is a first-order model, containing each predictor but no nonlinear terms such as squared values. In this model, each slope should be interpreted as a partial slope, the predicted effect of a one- change in a variable, holding all other variables constant.

Estimating Equation Describing Relationship among Three Variables Multiple Regression Analysis Estimating Equation Describing Relationship among Three Variables Estimating Equation Describing Relationship among n Variables

Multiple Regression Analysis – Example File: PPT_MultRegr The department is interested to know whether the amount of field audits and computer hours spent on tracking have yielded any results. Further the department has introduced the reward system for tracking the culprits. The data on actual unpaid taxes for ten cases is considered for analysis. Initially the regression of Actual Unpaid Taxes(Y) on Field Audits(X1) and Computer Hours(X2) was carried out and as a next step the Reward to Informants(X3) was also considered as a variable and analyzed. The analysis yielded the following SPSS outputs.

Regression Statistics Multiple Regression and Correlation Analysis – Excel Output SUMMARY OUTPUT Regression Statistics Multiple R 0.85377 R Square 0.72892322 Adjusted R Square 0.65147271 Standard Error 1.07063884 Observations 10 ANOVA   df SS MS F Significance F Regression 2 21.57612732 10.78806 9.411471 0.010371059 Residual 7 8.023872679 1.146268 Total 9 29.6 Coefficients t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept -13.819629 13.3232999 -1.03725 0.334115 -45.32422668 17.68497 -45.324227 17.68496939 X Variable 1 0.56366048 0.303273881 1.858586 0.10543 -0.153468296 1.280789 -0.1534683 1.280789251 X Variable 2 1.0994695 0.313139013 3.511123 0.009844 0.359013391 1.839926 0.35901339 1.839925601

Multiple Regression and Correlation Analysis – SPSS Output

Regression Statistics Multiple Regression and Correlation Analysis – Excel Output for 3 Independent Variables SUMMARY OUTPUT Regression Statistics Multiple R 0.9916677 R Square 0.9834049 Adjusted R Square 0.9751073 Standard Error 0.2861281 Observations 10 ANOVA   df SS MS F Significance F Regression 3 29.1087841 9.702928 118.51727 9.93505E-06 Residual 6 0.491215904 0.081869 Total 9 29.6 Coefficients t Stat P-value Lower 95% Upper 95% Intercept -45.796348 4.877650787 -9.38902 8.288E-05 -57.73152916 --3.8612 X Variable 1 0.5969718 0.081124288 7.358731 0.0003225 0.398467818 0.795476 X Variable 2 1.1768377 0.084074178 13.99761 8.29E-06 0.971115623 1.38256 X Variable 3 0.4051086 0.042233591 9.592096 7.341E-05 0.301766772 0.508451

Multiple Regression and Correlation Analysis – SPSS Output for 3 Independent Variables

Using two independent variables : Multiple Regression and Correlation Analysis Using two independent variables : Using three independent variables:

Coefficient of Determination From the output, R2 = 0.983 98.3% of the variation in actual unpaid taxes is explained by the three independent variables. 1.7% remains unexplained.

Making inferences about Population Parameters Multiple Regression and Correlation Analysis Making inferences about Population Parameters 1. Inferences about an individual slope or whether a variable is significant 2. Regression as a whole

Inferences about the Regression as a Whole Multiple Regression and Correlation Analysis Inferences about the Regression as a Whole

Value of the test statistic: Multiple Regression and Correlation Analysis Test Statistic Value of the test statistic:

Multiple Regression and Correlation Analysis

Test of whether a variable is significant. Multiple Regression and Correlation Analysis Test of whether a variable is significant. Test whether reward to informants is a significant explanatory variable.

Test statistic, with n-2 degrees of freedom: Multiple Regression and Correlation Analysis Test statistic, with n-2 degrees of freedom: Rejection Region

Value of the test statistic: Multiple Regression and Correlation Analysis Value of the test statistic: Conclusion: The standardized regression coefficient is 9.6429 which is outside the acceptance region. Therefore we will reject the null hypothesis. The reward to informants is a significant explanatory variable.

Interpreting the Coefficients b0 = - 45.796. This is the intercept, the value of y when all the variables take the value zero. Since the data range of all the independent variables do not cover the value zero, do not interpret the intercept. b1 = 0.597. In this model, for each additional field audit, the actual unpaid taxes increases on average by .597% (assuming the other variables are held constant).

Estimating the Coefficients and Assessing the Model, Another Example Where to locate a new motor inn? La Quinta Motor Inns is planning an expansion. Management wishes to predict which sites are likely to be profitable. Several areas where predictors of profitability can be identified are: Competition Market awareness Demand generators Demographics Physical quality

Estimating the Coefficients and Assessing the Model, Example Operating Margin Profitability Market awareness Physical Competition Customers Community Rooms Nearest Office space College enrollment Income Disttwn Number of hotels/motels rooms within 3 miles from the site. Distance to the nearest La Quinta inn. Median household income. Distance to downtown.

Estimating the Coefficients and Assessing the Model, Example Data were collected from randomly selected 100 inns that belong to La Quinta, and ran for the following suggested model (Multiple Regr_Margin.sav): Margin = b0 + b1Rooms + b2Nearest + b3Office + b4College + b5Income + b6Disttwn

Regression Analysis, SPSS Output This is the sample regression equation (sometimes called the prediction equation) Margin = 38.139 - 0.008Number +1.646Nearest + 0.020Office Space +0.212Enrollment + 0.413Income - 0.225Distance

Coefficient of Determination From the printout, R2 = 0.525 52.5% of the variation in operating margin is explained by the six independent variables. 47.5% remains unexplained.

Testing the Validity of the Model We pose the question: Is there at least one independent variable linearly related to the dependent variable? To answer the question we test the hypothesis H0: B0 = B1 = B2 = … = Bk H1: At least one Bi is not equal to zero. If at least one Bi is not equal to zero, the model has some validity.

Testing the Validity of the La Quinta Inns Regression Model The hypotheses are tested by an ANOVA procedure ( the SPSS output) MSR/MSE k = n–k–1 = n-1 = SSR MSR=SSR/k MSE=SSE/(n-k-1) SSE

Testing the Validity of the La Quinta Inns Regression Model Conclusion: There is sufficient evidence to reject the null hypothesis in favor of the alternative hypothesis. At least one of the bi is not equal to zero. Thus, at least one independent variable is linearly related to y. This linear regression model is valid Fa,k,n-k-1 = F0.05,6,100-6-1=2.17 F = 17.14 > 2.17 Also, the p-value (Significance F) = 0.0000 Reject the null hypothesis.

Interpreting the Coefficients b0 = 38.139. This is the intercept, the value of y when all the variables take the value zero. Since the data range of all the independent variables do not cover the value zero, do not interpret the intercept. b1 = – 0.008. In this model, for each additional room within 3 mile of the La Quinta inn, the operating margin decreases on average by .008% (assuming the other variables are held constant).

Interpreting the Coefficients b2 = 1.646. In this model, for each additional mile that the nearest competitor is to a La Quinta inn, the operating margin increases on average by 1.646% when the other variables are held constant. b3 = 0.020. For each additional 1000 sq-ft of office space, the operating margin will increase on average by .02% when the other variables are held constant. b4 = 0.212. For each additional thousand students the operating margin increases on average by .212% when the other variables are held constant.

Interpreting the Coefficients b5 = 0.413. For additional $1000 increase in median household income, the operating margin increases on average by .413%, when the other variables remain constant. b6 = -0.225. For each additional mile to the downtown center, the operating margin decreases on average by .225% when the other variables are held constant.

Testing the Coefficients The hypothesis for each bi is SPSS printout H0: bi = 0 H1: bi ¹ 0 Test statistic d.f. = n - k -1

La Quinta Inns, Predictions Predict the average operating margin of an inn at a site with the following characteristics: 3815 rooms within 3 miles, Closet competitor .9 miles away, 476,000 sq-ft of office space, 24,500 college students, $35,000 median household income, 11.2 miles distance to downtown center. MARGIN = 38.14 - 0.008(3815) +1.65(.9) + 0.020(476) +0.212(24.5) + 0.413(35) - 0.225(11.2) = 37.1%