Essentials of Statistics for Business and Economics (8e)

Slides:



Advertisements
Similar presentations
Pengujian Parameter Regresi Pertemuan 26 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Advertisements

6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Objectives (BPS chapter 24)
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Simple Linear Regression Estimates for single and mean responses.
Chapter 12 Simple Linear Regression
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 12b Testing for significance—the t-test Developing confidence intervals for estimates of β 1. Testing for significance—the f-test Using Excel’s.
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 5 月 4 日 第十二週:複迴歸.
Simple Linear Regression Analysis
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 Pertemuan 13 Regresi Linear dan Korelasi Matakuliah: I0262 – Statistik Probabilitas Tahun: 2007 Versi: Revisi.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Inference for regression - Simple linear regression
Chapter 12 Multiple Regression and Model Building.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Statistics for Business and Economics Dr. TANG Yu Department of Mathematics Soochow University May 28, 2007.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide Simple Linear Regression Part A n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide Simple Linear Regression Coefficient of Determination Chapter 14 BA 303 – Spring 2011.
Introduction to Linear Regression
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 11 Linear Regression Straight Lines, Least-Squares and More Chapter 11A Can you pick out the straight lines and find the least-square?
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
1 1 Slide Simple Linear Regression Estimation and Residuals Chapter 14 BA 303 – Spring 2011.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
© 2016 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Chapter 12 Simple Linear Regression.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
732G21/732G28/732A35 Lecture 4. Variance-covariance matrix for the regression coefficients 2.
1 1 Slide © 2011 Cengage Learning Assumptions About the Error Term  1. The error  is a random variable with mean of zero. 2. The variance of , denoted.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
23. Inference for regression
Warm-Up The least squares slope b1 is an estimate of the true slope of the line that relates global average temperature to CO2. Since b1 = is very.
Lecture 11: Simple Linear Regression
Chapter 20 Linear and Multiple Regression
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Inference for Least Squares Lines
Pertemuan 22 Analisis Varians Untuk Regresi
Pengujian Parameter Regresi dan Korelasi Pertemuan 20
Essentials of Modern Business Statistics (7e)
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
John Loucks St. Edward’s University . SLIDES . BY.
Simple Linear Regression
Statistics for Business and Economics (13e)
Chapter 13 Simple Linear Regression
Slides by JOHN LOUCKS St. Edward’s University.
Business Statistics Multiple Regression This lecture flows well with
Lecture Slides Elementary Statistics Thirteenth Edition
The Practice of Statistics in the Life Sciences Fourth Edition
CHAPTER 29: Multiple Regression*
Solution 9 1. a) From the matrix plot, 1) The assumption about linearity seems ok; 2).The assumption about measurement errors can not be checked at this.
Multiple Regression Chapter 14.
Chapter Fourteen McGraw-Hill/Irwin
Chapter 13 Additional Topics in Regression Analysis
Chapter Thirteen McGraw-Hill/Irwin
St. Edward’s University
Chapter 13 Simple Linear Regression
Presentation transcript:

Essentials of Statistics for Business and Economics (8e) Anderson, Sweeney, Williams, Camm, Cochran © 2018 Cengage Learning

Chapter 14, Part B Simple Linear Regression Using the Estimated Regression Equation for Estimation and Prediction Computer Solution Residual Analysis: Validating Model Assumptions Residual Analysis: Outliers and Influential Observations

Using the Estimated Regression Equation for Estimation and Prediction A confidence interval is an interval estimate of the mean value of y for a given value of x. A prediction interval is used whenever we want to predict an individual value of y for a new observation corresponding to a given value of x. The margin of error is larger for a prediction interval.

Using the Estimated Regression Equation for Estimation and Prediction Confidence Interval Estimate of E(y*) 𝑦 ∗ ± 𝑡 𝛼/2 𝑠 𝑦 ∗ Prediction Interval Estimate of y* 𝑦 ∗ ± 𝑡 𝛼/2 𝑠 𝑝𝑟𝑒𝑑 where: confidence coefficient is 1 -  and t/2 is based on a t distribution with n - 2 degrees of freedom

Point Estimation If 3 TV ads are run prior to a sale, we expect the mean number of cars sold to be: 𝑦 =10+5 3 =25 cars

Confidence Interval for E(y*) Estimate of the Standard Deviation of 𝑦 ∗ 𝑠 𝑦 ∗ =𝑠 1 𝑛 + 𝑥 ∗ − 𝑥 2 𝑥 𝑖 − 𝑥 2 𝑠 𝑦 ∗ =2.16025 1 5 + 3−2 2 1−2 2 + 3−2 2 +…+ 3−2 2 𝑠 𝑦 ∗ =2.16025 1 5 + 1 4 = 1.4491

Confidence Interval for E(y*) The 95% confidence interval estimate of the mean number of cars sold when 3 TV ads are run is: 𝑦 ∗ ± 𝑡 𝛼/2 𝑠 𝑦 ∗ 25 + 3.1824(1.4491) 25 + 4.61 20.39 to 29.61 cars

Prediction Interval for y* Estimate of the Standard Deviation of an Individual Value of y* 𝑠 𝑝𝑟𝑒𝑑 =𝑠 1+ 1 𝑛 + 𝑥 ∗ − 𝑥 2 𝑥 𝑖 − 𝑥 2 𝑠 𝑝𝑟𝑒𝑑 =2.16025 1+ 1 5 + 1 4 spred = 2.16025(1.20416) = 2.6013

Prediction Interval for y* The 95% prediction interval estimate of the number of cars sold in one particular week when 3 TV ads are run is: 𝑦 ∗ ± 𝑡 𝛼/2 𝑠 𝑝𝑟𝑒𝑑 25 + 3.1824(2.6013) 25 + 8.28 16.72 to 33.28 cars

Computer Solution Up to this point, you have seen how Excel can be used for various parts of a regression analysis. Excel also has a comprehensive tool in its Data Analysis package called Regression. The Regression tool can be used to perform a complete regression analysis.

Computer Solution Performing the regression analysis computations without the help of a computer can be quite time consuming. On the next slide we show Minitab output for the Reed Auto Sales example. Recall that the independent variable was named Ads and the dependent variable was named Cars in the example.

Minitab Output Estimated ANOVA Regression Table Equation Interval The regression equation is Cars = 10.0 + 5.00 Ads Predictor Coef SE Coef T p Constant 10.000 2.366 4.23 0.024 Ads 5.0000 1.080 4.63 0.019 S = 2.16025 R-sq = 87.7% R-sq(adj) = 83.6% Analysis of Variance SOURCE DF SS MS F Regression 1 100 21.43 Residual Err. 3 14 4.667 Total 4 114 Predicted Values for New Observations New Obs Fit SE Fit 95% C.I. 95% P.I. 25 1.45 (20.39, 29.61) (16.72, 33.28) Estimated Regression Equation ANOVA Table Interval Estimates

Minitab Output Minitab prints the estimated regression equation as Cars = 10.0 + 5.00 Ads. For each of the coefficients b0 and b1, the output shows its value, standard deviation, t value, and p-value. Minitab prints the standard error of the estimate, s, as well as information about the goodness of fit. . The standard ANOVA table is printed. Also provided are the 95% confidence interval estimate of the expected number of cars sold and the 95% prediction interval estimate of the number of cars sold for an individual weekend with 3 ads.

Using Excel’s Regression Tool Excel Output (top portion) A B C 9 10 Regression Statistics 11 Multiple R 0.936585812 12 R Square 0.877192982 13 Adjusted R Square 0.83625731 14 Standard Error 2.160246899 15 Observations 5 16

Using Excel’s Regression Tool Excel Output (middle portion) A B C D E F 16 17 ANOVA 18 df SS MS Significance F 19 Regression 1 100 21.4286 0.018986231 20 Residual 3 14 4.66667 21 Total 4 114 22

Using Excel’s Regression Tool Excel Output (bottom-left portion) Note: Columns F-I are not shown.

Using Excel’s Regression Tool Excel Output (bottom-right portion) Note: Columns C-E are hidden.

Residual Analysis If the assumptions about the error term e appear questionable, the hypothesis tests about the significance of the regression relationship and the interval estimation results may not be valid. The residuals provide the best information about e . Residual for observation i 𝑦 𝑖 − 𝑦 𝑖 Much of the residual analysis is based on an examination of graphical plots.

Residual Plot Against x If the assumption that the variance of e is the same for all values of x is valid, and the assumed regression model is an adequate representation of the relationship between the variables, then the residual plot should give an overall impression of a horizontal band of points.

Residual Plot Against x Good Pattern Residual 𝑦− 𝑦

Residual Plot Against x Residual Nonconstant Variance 𝑦− 𝑦

Model Form Not Adequate Residual Plot Against x x Residual Model Form Not Adequate 𝑦− 𝑦

Residual Plot Against x Residuals Observation Predicted Cars Sold Residuals 1 15 -1 2 25 3 20 -2 4 5

Residual Plot Against x

Standardized Residuals Standardized Residual for Observation i 𝑦 𝑖 − 𝑦 𝑖 𝑠 𝑦 𝑖 − 𝑦 𝑖 𝑠 𝑦 𝑖 − 𝑦 𝑖 =𝑠 1− ℎ 𝑖 where: ℎ 𝑖 = 1 𝑛 + 𝑥 𝑖 − 𝑥 2 𝑥 𝑖 − 𝑥 2

Standardized Residual Plot The standardized residual plot can provide insight about the assumption that the error term e has a normal distribution. If this assumption is satisfied, the distribution of the standardized residuals should appear to come from a standard normal probability distribution.

Standardized Residual Plot Standardized Residuals Predicted y Residual Standardized 1 15 -1 -0.5345 2 25 3 20 -2 -1.0690 4 1.0690 5 Observation

Standardized Residual Plot

Standardized Residual Plot All of the standardized residuals are between –1.5 and +1.5 indicating that there is no reason to question the assumption that e has a normal distribution.

Outliers and Influential Observations Detecting Outliers An outlier is an observation that is unusual in comparison with the other data. Minitab classifies an observation as an outlier if its standardized residual value is < -2 or > +2. This standardized residual rule sometimes fails to identify an unusually large observation as being an outlier. This rule’s shortcoming can be circumvented by using studentized deleted residuals. The |i th studentized deleted residual| will be larger than the |i th standardized residual|.

End of Chapter 14, Part B