Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.

Slides:



Advertisements
Similar presentations
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Advertisements

Qualitative Variables and
Chapter 13 Multiple Regression
Chapter 13 Additional Topics in Regression Analysis
Statistics for Managers Using Microsoft® Excel 5th Edition
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 12 Multiple Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 13-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
Chap 15-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics.
8-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Statistics for Managers using Microsoft.
Linear Regression Example Data
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Pertemua 19 Regresi Linier
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Business Statistics: A Decision-Making Approach 8 th Edition Chapter 15 Multiple.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Chapter 15: Model Building
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 18-1 Chapter 18 Data Analysis Overview Statistics for Managers using Microsoft Excel.
Chapter 13 Simple Linear Regression
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Statistics for Managers Using Microsoft Excel 3rd Edition
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Chapter 13 Simple Linear Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 11 Simple Regression
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
© 2003 Prentice-Hall, Inc.Chap 11-1 Business Statistics: A First Course (3 rd Edition) Chapter 11 Multiple Regression.
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Chapter 14 Introduction to Multiple Regression
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Understanding Regression Analysis Basics. Copyright © 2014 Pearson Education, Inc Learning Objectives To understand the basic concept of prediction.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
Statistics for Business and Economics 8 th Edition Chapter 12 Multiple Regression Ch Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Chap 14-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics.
Chap 13-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 12.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Lecture 4 Introduction to Multiple Regression
Lecture 10: Correlation and Regression Model.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
1 Experimental Statistics - week 12 Chapter 12: Multiple Regression Chapter 13: Variable Selection Model Checking.
Lecture 3 Introduction to Multiple Regression Business and Economic Forecasting.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Chap 13-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 13 Multiple Regression and.
Statistics for Managers Using Microsoft® Excel 5th Edition
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Chapter 12 Simple Linear Regression.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Chapter 13 Simple Linear Regression
Chapter 15 Multiple Regression Model Building
Chapter 15 Multiple Regression and Model Building
Multiple Regression Analysis and Model Building
Chapter 13 Simple Linear Regression
Chapter 15 Multiple Regression Analysis and Model Building
Chapter 13 Additional Topics in Regression Analysis
Presentation transcript:

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Edition

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-2 Learning Objectives In this chapter, you learn: To use quadratic terms in a regression model To use transformed variables in a regression model To measure the correlation among the independent variables To build a regression model using either the stepwise or best-subsets approach To avoid the pitfalls involved in developing a multiple regression model

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-3 The relationship between the dependent variable and an independent variable may not be linear Can review the scatter plot to check for non- linear relationships Example: Quadratic model The second independent variable is the square of the first variable Nonlinear Relationships DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-4 Quadratic Regression Model where: β 0 = Y intercept β 1 = regression coefficient for linear effect of X on Y β 2 = regression coefficient for quadratic effect on Y ε i = random error in Y for observation i Model form: DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-5 Linear fit does not give random residuals Linear vs. Nonlinear Fit Nonlinear fit gives random residuals X residuals X Y X Y X DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-6 Quadratic Regression Model Quadratic models may be considered when the scatter plot takes on one of the following shapes: X1X1 Y X1X1 X1X1 YYY β 1 < 0β 1 > 0β 1 < 0β 1 > 0 β 1 = the coefficient of the linear term β 2 = the coefficient of the squared term X1X1 β 2 > 0 β 2 < 0 DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-7 Testing the Overall Quadratic Model Test for Overall Relationship H 0 : β 1 = β 2 = 0 (no overall relationship between X and Y) H 1 : β 1 and/or β 2 ≠ 0 (there is a relationship between X and Y) F STAT = Estimate the quadratic model to obtain the regression equation: DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-8 Testing for Significance: Quadratic Effect Testing the Quadratic Effect Compare quadratic regression equation with the linear regression equation DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-9 Testing for Significance: Quadratic Effect Testing the Quadratic Effect Consider the quadratic regression equation Hypotheses (The quadratic term does not improve the model) (The quadratic term improves the model) H 0 : β 2 = 0 H 1 : β 2  0 (continued) DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Testing for Significance: Quadratic Effect Testing the Quadratic Effect Hypotheses (The quadratic term does not improve the model) (The quadratic term improves the model) The test statistic is H 0 : β 2 = 0 H 1 : β 2  0 (continued) where: b 2 = squared term slope coefficient β 2 = hypothesized slope (zero) S b = standard error of the slope 2 DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Testing for Significance: Quadratic Effect Testing the Quadratic Effect Compare r 2 from simple regression to adjusted r 2 from the quadratic model If adj. r 2 from the quadratic model is larger than the r 2 from the simple model, then the quadratic model is likely a better model (continued) DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Example: Quadratic Model Purity increases as filter time increases: Purity Filter Time DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Example: Quadratic Model (continued) Regression Statistics R Square Adjusted R Square Standard Error Simple regression results: Y = Time Coefficients Standard Errort StatP-value Intercept Time E-10 FSignificance F E-10 ^ t statistic, F statistic, and r 2 are all high, but the residuals are not random: DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Coefficients Standard Errort StatP-value Intercept Time Time-squared E-05 Regression Statistics R Square Adjusted R Square Standard Error FSignificance F E-13 Quadratic regression results: Y = Time (Time) 2 ^ Example: Quadratic Model in Excel (continued) The quadratic term is significant and improves the model: adj. r 2 is higher and S YX is lower, residuals are now random DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Using Transformations in Regression Analysis Idea: non-linear models can often be transformed to a linear form Can be estimated by least squares if transformed transform X or Y or both to get a better fit or to deal with violations of regression assumptions Can be based on theory, logic or scatter plots DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall The Square Root Transformation The square-root transformation Used to overcome violations of the constant variance assumption fit a non-linear relationship DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Shape of original relationship X The Square Root Transformation (continued) b 1 > 0 b 1 < 0 X Y Y Y Y Relationship when transformed DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Original multiplicative model The Log Transformation Transformed multiplicative model The Multiplicative Model: Original multiplicative model Transformed exponential model The Exponential Model: DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Interpretation of coefficients For the multiplicative model: When both dependent and independent variables are logged: The coefficient of the independent variable X k can be interpreted as : a 1 percent change in X k leads to an estimated b k percentage change in the average value of Y. Therefore b k is the elasticity of Y with respect to a change in X k. DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Collinearity Collinearity: High correlation exists among two or more independent variables This means the correlated variables contribute redundant information to the multiple regression model DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Collinearity Including two highly correlated independent variables can adversely affect the regression results No new information provided Can lead to unstable coefficients (large standard error and low t-values) Coefficient signs may not match prior expectations (continued) DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Some Indications of Strong Collinearity Incorrect signs on the coefficients Large change in the value of a previous coefficient when a new variable is added to the model A previously significant variable becomes non- significant when a new independent variable is added The estimate of the standard deviation of the model increases when a variable is added to the model DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Detecting Collinearity (Variance Inflationary Factor) VIF j is used to measure collinearity: If VIF j > 5, X j is highly correlated with the other independent variables where R 2 j is the coefficient of determination of variable X j with all other X variables DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Example: Pie Sales Sales = b 0 + b 1 (Price) + b 2 (Advertising) Week Pie Sales Price ($) Advertising ($100s) Recall the multiple regression equation of chapter 14: DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Detecting Collinearity in Excel using PHStat Output for the pie sales example: Since there are only two independent variables, only one VIF is reported VIF is < 5 There is no evidence of collinearity between Price and Advertising Regression Analysis Price and all other X Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations15 VIF PHStat / regression / multiple regression … Check the “variance inflationary factor (VIF)” box DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Model Building Goal is to develop a model with the best set of independent variables Easier to interpret if unimportant variables are removed Lower probability of collinearity Stepwise regression procedure Provide evaluation of alternative models as variables are added and deleted Best-subset approach Try all combinations and select the best using the highest adjusted r 2 and lowest standard error DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Idea: develop the least squares regression equation in steps, adding one independent variable at a time and evaluating whether existing variables should remain or be removed The coefficient of partial determination is the measure of the marginal contribution of each independent variable, given that other independent variables are in the model Stepwise Regression DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Best Subsets Regression Idea: estimate all possible regression equations using all possible combinations of independent variables Choose the best fit by looking for the highest adjusted r 2 and lowest standard error Stepwise regression and best subsets regression can be performed using PHStat DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Alternative Best Subsets Criterion Calculate the value C p for each potential regression model Consider models with C p values close to or below k + 1 k is the number of independent variables in the model under consideration DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Alternative Best Subsets Criterion The C p Statistic Where k = number of independent variables included in a particular regression model T = total number of parameters to be estimated in the full regression model = coefficient of multiple determination for model with k independent variables = coefficient of multiple determination for full model with all T estimated parameters (continued) DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Steps in Model Building 1. Compile a listing of all independent variables under consideration 2. Estimate full model and check VIFs 3. Check if any VIFs > 5 If no VIF > 5, go to step 4 If one VIF > 5, remove this variable If more than one, eliminate the variable with the highest VIF and go back to step 2 4.Perform best subsets regression with remaining variables … DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Steps in Model Building 5. List all models with C p close to or less than (k + 1) 6. Choose the best model Consider parsimony Do extra variables make a significant contribution? 7.Perform complete analysis with chosen model, including residual analysis 8.Transform the model if necessary to deal with violations of linearity or other model assumptions 9.Use the model for prediction and inference (continued) DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Model Building Flowchart Choose X 1,X 2,…X k Run regression to find VIFs Remove variable with highest VIF Any VIF>5? Run subsets regression to obtain “best” models in terms of C p Do complete analysis Add quadratic and/or interaction terms or transform variables Perform predictions No More than one? Remove this X Yes No Yes DCOVA

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Pitfalls and Ethical Considerations Understand that interpretation of the estimated regression coefficients are performed holding all other independent variables constant Evaluate residual plots for each independent variable Evaluate interaction terms To avoid pitfalls and address ethical considerations:

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Additional Pitfalls and Ethical Considerations Obtain VIFs for each independent variable before determining which variables should be included in the model Examine several alternative models using best- subsets regression Use other methods when the assumptions necessary for least-squares regression have been seriously violated To avoid pitfalls and address ethical considerations: (continued)

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter Summary Developed the quadratic regression model Discussed using transformations in regression models The multiplicative model The exponential model Described collinearity Discussed model building Stepwise regression Best subsets Addressed pitfalls in multiple regression and ethical considerations

Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. Printed in the United States of America.