Measures of Regression and Prediction Intervals

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Inference for Regression
Statistics Measures of Regression and Prediction Intervals.
 Coefficient of Determination Section 4.3 Alan Craig
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-4 Variation and Prediction Intervals.
Chapter 12 Simple Linear Regression
Introduction to Regression Analysis
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Simple Linear Regression Analysis
Chapter 9: Correlation and Regression
Linear Regression Larson/Farber 4th ed. 1 Section 9.2.
Correlation and Regression
SIMPLE LINEAR REGRESSION
Simple Linear Regression and Correlation
Confidence Intervals for the Mean (σ Unknown) (Small Samples)
Sec 10.3 Coefficient of Determination and Standard Error of the Estimate.
SIMPLE LINEAR REGRESSION
Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Correlation and Regression
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Section 8.2 Estimating  When  is Unknown
SECTION 6.4 Confidence Intervals for Variance and Standard Deviation Larson/Farber 4th ed 1.
Correlation and Regression
CHAPTER 14 MULTIPLE REGRESSION
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Correlation and Regression Chapter 9. § 9.3 Measures of Regression and Prediction Intervals.
Elementary Statistics Correlation and Regression.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Variation and Prediction Intervals
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Section 9.2 Linear Regression. Section 9.2 Objectives Find the equation of a regression line Predict y-values using a regression equation.
Section 12.3 Regression Analysis HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.3.
Confidence Intervals for the Mean (Small Samples) 1 Larson/Farber 4th ed.
Section 6.1 Confidence Intervals for the Mean (Large Samples) Larson/Farber 4th ed.
Linear Regression 1 Section 9.2. Section 9.2 Objectives 2 Find the equation of a regression line Predict y-values using a regression equation.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Correlation and Regression 9.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Section 6.2 Confidence Intervals for the Mean (  Unknown)
Section 6.2 Confidence Intervals for the Mean (Small Samples) Larson/Farber 4th ed.
Chapter Outline 6.1 Confidence Intervals for the Mean (Large Samples) 6.2 Confidence Intervals for the Mean (Small Samples) 6.3 Confidence Intervals for.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section 9.3 Measures of Regression and Prediction Intervals.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
Section 6.2 Confidence Intervals for the Mean (Small Samples) © 2012 Pearson Education, Inc. All rights reserved. 1 of 83.
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Correlation and Linear Regression Chapter 13 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Correlation and Regression
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Basic Estimation Techniques
Chapter 6 Confidence Intervals.
Section 9-3   We already know how to calculate the correlation coefficient, r. The square of this coefficient is called the coefficient of determination.
Basic Estimation Techniques
Correlation and Regression
Coefficient of Determination and Standard Error of the Estimate.
Elementary Statistics: Picturing The World
Chapter 6 Confidence Intervals.
Correlation and Regression-III
Correlation and Regression
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Chapter 6 Confidence Intervals.
Section 9-3   We already know how to calculate the correlation coefficient, r. The square of this coefficient is called the coefficient of determination.
Measures of Regression Prediction Interval
Presentation transcript:

Measures of Regression and Prediction Intervals Section 9.3 Measures of Regression and Prediction Intervals

Section 9.3 Objectives Interpret the three types of variation about a regression line Find and interpret the coefficient of determination Find and interpret the standard error of the estimate for a regression line Construct and interpret a prediction interval for y

Variation About a Regression Line Three types of variation about a regression line Total variation Explained variation Unexplained variation To find the total variation, you must first calculate The total deviation The explained deviation The unexplained deviation

Variation About a Regression Line Total Deviation = Explained Deviation = Unexplained Deviation = y (xi, yi) Unexplained deviation Total deviation Explained deviation (xi, ŷi) (xi, yi) x

Variation About a Regression Line Total variation The sum of the squares of the differences between the y-value of each ordered pair and the mean of y. Explained variation The sum of the squares of the differences between each predicted y-value and the mean of y. Total variation = Explained variation =

Variation About a Regression Line Unexplained variation The sum of the squares of the differences between the y-value of each ordered pair and each corresponding predicted y-value. Unexplained variation = The sum of the explained and unexplained variation is equal to the total variation. Total variation = Explained variation + Unexplained variation

Coefficient of Determination The ratio of the explained variation to the total variation. Denoted by r2

Example: Coefficient of Determination The correlation coefficient for the advertising expenses and company sales data as calculated in Section 9.1 is r ≈ 0.913. Find the coefficient of determination. What does this tell you about the explained variation of the data about the regression line? About the unexplained variation? Solution: About 83.4% of the variation in the company sales can be explained by the variation in the advertising expenditures. About 16.9% of the variation is unexplained.

The Standard Error of Estimate The standard deviation of the observed yi -values about the predicted ŷ-value for a given xi -value. Denoted by se. The closer the observed y-values are to the predicted y- values, the smaller the standard error of estimate will be. n is the number of ordered pairs in the data set

The Standard Error of Estimate In Words In Symbols Make a table that includes the column heading shown. Use the regression equation to calculate the predicted y-values. Calculate the sum of the squares of the differences between each observed y-value and the corresponding predicted y-value. Find the standard error of estimate.

Example: Standard Error of Estimate The regression equation for the advertising expenses and company sales data as calculated in section 9.2 is ŷ = 50.729x + 104.061 Find the standard error of estimate. Solution: Use a table to calculate the sum of the squared differences of each observed y-value and the corresponding predicted y-value.

Solution: Standard Error of Estimate x y ŷ i (yi – ŷ i)2 2.4 225 225.81 (225 – 225.81)2 = 0.6561 1.6 184 185.23 (184 – 185.23)2 = 1.5129 2.0 220 205.52 (220 – 205.52)2 = 209.6704 2.6 240 235.96 (240 – 235.96)2 = 16.3216 1.4 180 175.08 (180 – 175.08)2 = 24.2064 186 (186 – 205.52)2 = 381.0304 2.2 215 215.66 (215 – 215.66)2 = 0.4356 Σ = 635.3463 unexplained variation

Solution: Standard Error of Estimate n = 8, Σ(yi – ŷ i)2 = 635.3463 The standard error of estimate of the company sales for a specific advertising expense is about $10.29.

Prediction Intervals Two variables have a bivariate normal distribution if for any fixed value of x, the corresponding values of y are normally distributed and for any fixed values of y, the corresponding x-values are normally distributed.

Prediction Intervals A prediction interval can be constructed for the true value of y. Given a linear regression equation ŷ = mx + b and x0, a specific value of x, a c-prediction interval for y is ŷ – E < y < ŷ + E where The point estimate is ŷ and the margin of error is E. The probability that the prediction interval contains y is c.

Constructing a Prediction Interval for y for a Specific Value of x In Words In Symbols Identify the number of ordered pairs in the data set n and the degrees of freedom. Use the regression equation and the given x-value to find the point estimate ŷ. Find the critical value tc that corresponds to the given level of confidence c. d.f. = n – 2 Use Table 5 in Appendix B.

Constructing a Prediction Interval for y for a Specific Value of x In Words In Symbols Find the standard error of estimate se. Find the margin of error E. Find the left and right endpoints and form the prediction interval. Left endpoint: ŷ – E Right endpoint: ŷ + E Interval: ŷ – E < y < ŷ + E

Example: Constructing a Prediction Interval Construct a 95% prediction interval for the company sales when the advertising expenses are $2100. What can you conclude? Recall, n = 8, ŷ = 50.729x + 104.061, se = 10.290 Solution: Point estimate: ŷ = 50.729(2.1) + 104.061 ≈ 210.592 Critical value: d.f. = n –2 = 8 – 2 = 6 tc = 2.447

Solution: Constructing a Prediction Interval Left Endpoint: ŷ – E Right Endpoint: ŷ + E 210.592 – 26.857 ≈ 183.735 210.592 + 26.857 ≈ 237.449 183.735 < y < 237.449 You can be 95% confident that when advertising expenses are $2100, the company sales will be between $183,735 and $237,449.

Section 9.3 Summary Interpreted the three types of variation about a regression line Found and interpreted the coefficient of determination Found and interpreted the standard error of the estimate for a regression line Constructed and interpreted a prediction interval for y