Chapter 15 Multiple Regression. Regression Multiple Regression Model y =  0 +  1 x 1 +  2 x 2 + … +  p x p +  Multiple Regression Equation y = 

Slides:



Advertisements
Similar presentations
Test of (µ 1 – µ 2 ),  1 =  2, Populations Normal Test Statistic and df = n 1 + n 2 – 2 2– )1– 2 ( 2 1 )1– 1 ( 2 where ] 2 – 1 [–
Advertisements

Multiple Regression and Model Building
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Chapter 12 Simple Linear Regression
Chapter 13 Multiple Regression
© 2008 Prentice-Hall, Inc. Chapter 4 To accompany Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna Power Point slides created.
Chapter 14 Introduction to Multiple Regression
Chapter 12 Simple Regression
Chapter 12 Multiple Regression
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
Multiple Regression and Correlation Analysis
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Linear Regression Example Data
Ch. 14: The Multiple Regression Model building
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Introduction to Linear Regression and Correlation Analysis
Non-continuous Relationships If the relationship between the dependent variable and an independent variable is non-continuous a slope dummy variable can.
Econ 3790: Business and Economics Statistics
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2003 Thomson/South-Western Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Chapter 14 Introduction to Multiple Regression
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Bivariate Regression Assumptions and Testing of the Model Economics 224, Notes for November 17, 2008.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chap 14-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
ANOVA for Regression ANOVA tests whether the regression model has any explanatory power. In the case of simple regression analysis the ANOVA test and the.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Chapter 13 Multiple Regression
Regression Analysis Relationship with one independent variable.
Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations10 ANOVA dfSSMSF Regression
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Lecture 10: Correlation and Regression Model.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Simple linear regression and correlation Regression analysis is the process of constructing a mathematical model or function that can be used to predict.
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 19 Measure of Variation in the Simple Linear Regression Model (Data)Data.
INTRODUCTION TO MULTIPLE REGRESSION MULTIPLE REGRESSION MODEL 11.2 MULTIPLE COEFFICIENT OF DETERMINATION 11.3 MODEL ASSUMPTIONS 11.4 TEST OF SIGNIFICANCE.
Chapter 14 Introduction to Multiple Regression
Basic Estimation Techniques
Essentials of Modern Business Statistics (7e)
Relationship with one independent variable
Quantitative Methods Simple Regression.
Basic Estimation Techniques
Prepared by Lee Revere and John Large
Relationship with one independent variable
Introduction to Regression
Presentation transcript:

Chapter 15 Multiple Regression

Regression Multiple Regression Model y =  0 +  1 x 1 +  2 x 2 + … +  p x p +  Multiple Regression Equation y =  0 +  1 x 1 +  2 x 2 + … +  p x p Estimated Multiple Regression Equation

Car Data MPGWeightYearCylinders Continuing on for 397 observations

Multiple Regression, Example CoefficientsStandard Errort Stat Intercept Weight R Square0.687 CoefficientsStandard Errort Stat Intercept Weight Year R Square0.807

Multiple Regression, Example CoefficientsStandard Errort Stat Intercept Weight Year Cylinders R Square0.807 Predicted MPG for car weighing 4000 lbs built in 1980 with 6 cylinders: (4000)+.76(80)-.0741(6) = =19.88

SST = SSR + SSE Sums of Squares

Multiple Coefficient of Determination The share of the variation explained by the estimated model. R 2 = SSR/SST Multiple Correlation Coefficient The correlation coefficient of the actual and predicted values

Adjusted Multiple Coefficient of Determination Regression Statistics Multiple R0.898 R Square0.807 Adjusted R Square0.805 Standard Error3.44 Observations397

F Test for Overall Significance H 0 :  1 =  2 =... =  p = 0 H a : One or more of the parameters is not equal to zero Reject H 0 if: F > F  Or Reject H 0 if: p-value <  F = MSR/MSE

ANOVA Table for Multiple Regression Model SourceSum of Squares Degrees of Freedom Mean SquaresF RegressionSSRpMSR = SSR/pF=MSR/MSE ErrorSSEn-p-1MSE = SSE/(n-p-1) TotalSSTn-1

ANOVA Example ANOVA dfSSMSF Significance F Regression E-140 Residual Total

t Test for Coefficients H 0 :  1 = 0 H a :  1 ≠ 0 Reject H 0 if: t t  Or if: p <  t = b 1 /s b1 With a t distribution of n-p-1 df

t Test Example CoefficientsStandard Errort StatP-value Intercept Weight E-37 Year E-41 Cylinders

Multicollinearity When two or more independent variables are highly correlated. When multicollinearity is severe the estimated values of coefficients will be unreliable.

Multicollinearity Two guidelines for identifying multicollinearity: If the absolute value of the correlation coefficient for two independent variables exceeds 0.7 If the correlation coefficient for an independent variable and some other independent variable is greater than the correlation with that variable and the dependent variable

Multicollinearity MPGWeightYearCylinders MPG1 Weight Year Cylinders Table of correlation coefficients:

Multicollinearity CoefficientsStandard Errort Stat Intercept Weight Year Cylinders R Square0.807 CoefficientsStandard Errort Stat Intercept Year Cylinders R Square 0.708

Qualitative Variables and Regression Quantitative variable – A variable that can be measured numerically (interval or ratio scale of measurement) Qualitative variable – A variable where labels or names are used to identify some attribute (nominal or ordinal scale of measurement)

Qualitative Variables and Regression The effect of a quantitative variable can be estimated using a dummy variable. A dummy variable can equal 0 or 1, it creates different y intercepts for groups with different attributes.

Qualitative Variables and Regression Assume we estimate a regression model for the number of sick days an employee takes per year. A dummy variable is included that equals 1 if the individual smokes and 0 if they do not. Age is also included in the model.

Qualitative Variables and Regression Estimated model: Sick days taken = -1 +(3)Smoker + (.1)Age Sick DaysSmokerAge Example of how data would be coded:

Dummy Variables Sick days taken = -1 +(3)Smoker + (.1)Age What is the y-intercept for nonsmokers? What is the y-intercept for smokers? 2 What is the predicted number of sick days for a 40-year-old smoker? 6 What is the average difference in the number of sick days taken by smokers and nonsmokers? 3

Dummy Variables If an attribute has three or more possible values you must include k-1 dummy variables in the model, where k is the number of possible values.

Dummy Variables Suppose we have three job classifications: manager, operator, and secretary Operator dummy equals 1 if the person is an operator, 0 otherwise Secretary dummy equals 1 if the person is an secretary, 0 otherwise Manager is the omitted group (choice of omitted group will not alter the predicted values)

Dummy Variables Sick days taken = -1 +(1)Operator + 1.5(Secretary) + (.1)Age What are the y-intercepts for each job classification? Managers=-1, Operators=0, Secretaries=0.5 What is the predicted number of sick days for a 40- year-old secretary? 4.5 What is the average difference in the number of sick days taken by operators and secretaries? 0.5

Dummy Variables In some cases there will be multiple sets of dummy variables, such as: Sick days taken = -1 +(3)Smoker + (1)Operator + 1.5(Secretary) + (.1)Age Note that there are now 6 different intercepts: Nonsmoker, Manager: -1 (omitted group) Smoker, Manager: 2 Nonsmoker, Operator: 0 Smoker, Operator: 3 Nonsmoker, Secretary: 0.5 Smoker, Secretary: 3.5

Dummy Variables Note that when dummy variables are used we are assuming that the coefficients of the other variables are the same for all groups. In this example the increase in sick days used from aging a year is equal to 0.1 for all of the groups. If there is reason to believe the effect of an independent variable differs by group, you may want to estimate separate equations for each group.

Nonlinear Relationships Nonlinear relationships can be modeled by including a variable that is a nonlinear function of an independent variable. For example it is usually assumed that health care expenditures increase at an increasing rate as people age.

Nonlinear Relationships In that case you might try including age squared into the model: Health expend = (5)Age + (.5)AgeSQ AgeHealth Expend

Nonlinear Relationships If the dependent variable increases at a decreasing rate as the independent variable rises you might want to include the square root of the independent variable. If you are unsure of the nature of the relationship you can use dummy variables for different ranges of values of the independent variable.

Non-continuous Relationships If the relationship between the dependent variable and an independent variable is non-continuous a slope dummy variable can be used to estimate two sets of coefficients for the independent variable. For example, if natural gas usage is not affected by temperature when the temperature rises above 60 degrees, we could have: Gas usage = b0 + b 1 (GT60) + b 2 (Temp) + b 2 (GT60)(Temp)

Non-continuous Relationships Note that at temperatures above 60 degrees the net effect of a 1 degree increase in temperature on gas usage is ( ) Coefficients Standard Errort StatP-value Intercept E-18 GT Temp E-13 (GT60)(Temp)