Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.

Slides:



Advertisements
Similar presentations
Forecasting Using the Simple Linear Regression Model and Correlation
Advertisements

Inference for Regression
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-4 Variation and Prediction Intervals.
Linear regression models
Correlation and Regression
Inference for Regression 1Section 13.3, Page 284.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Chapter 11 Multiple Regression.
Simple Linear Regression Analysis
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
SIMPLE LINEAR REGRESSION
Introduction to Regression Analysis, Chapter 13,
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Relationships Among Variables
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
Correlation & Regression
Correlation and Linear Regression
SIMPLE LINEAR REGRESSION
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Chapter 13: Inference in Regression
Linear Regression and Correlation
Correlation and Linear Regression
Linear Regression Inference
Introduction to Statistical Inferences
Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Confidence Interval Estimation
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Simple Linear Regression Models
Chapter 13: Linear Correlation and Regression Analysis
CHAPTER 14 MULTIPLE REGRESSION
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Section 12.3 Regression Analysis HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.3.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
1 ES9 Chapter 24 ~ Linear Correlation & Regression Analysis Weight Waist Size.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
Inference about the slope parameter and correlation
Regression and Correlation
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Correlation and Regression
Chapter 10 Correlation and Regression
Descriptive Analysis and Presentation of Bivariate Data
Copyright © Cengage Learning. All rights reserved.
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Presentation transcript:

Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis

Copyright © Cengage Learning. All rights reserved Confidence Intervals for Regression

3 Confidence Intervals for Regression The confidence interval for and the prediction interval for are constructed in a similar fashion, with replacing as our point estimate. If we were to randomly select several samples from the population, construct the line of best fit for each sample, calculate for a given x using each regression line, and plot the various values (they would vary because each sample would yield a slightly different regression line), we would find that the values form a normal distribution.

4 Confidence Intervals for Regression That is, the sampling distribution of is normal, just as the sampling distribution of is normal. What about the appropriate standard deviation of ? The standard deviation in both cases ( and ) is calculated by multiplying the square root of the variance of the error by an appropriate correction factor. We know that the variance of the error,, is calculated by means of formula (13.8).

5 Confidence Intervals for Regression Before we look at the correction factors for the two cases, let’s see why they are necessary. We know that the line of best fit passes through the point, the centroid. If we draw lines with slopes equal to the extremes of that confidence interval, 1.27 to 2.51, through the point, [which is (12.3, 26.9)] on the scatter diagram, we will see that the value for fluctuates considerably for different values of x (Figure 13.11). Lines Representing the Confidence Interval for Slope FIGURE 13.11

6 Confidence Intervals for Regression Therefore, we should suspect a need for a wider confidence interval as we select values of x that are farther away from x. Hence we need a correction factor to adjust for the distance between x 0 and x. This factor must also adjust for the variation of the y values about. First, let’s estimate the mean value of y at a given value of x,. The confidence interval formula is: (13.16)

7 Confidence Intervals for Regression Note The numerator of the second term under the radical sign is the square of the distance of x 0 from. The denominator is closely related to the variance of x and has a “standardizing effect” on this term. Formula (13.16) can be modified for greater ease of calculation. Here is the new form:

8 Confidence Intervals for Regression Let’s compare formula (13.16) with formula (9.1): replaces, and (13.16)

9 Confidence Intervals for Regression the estimated standard deviation of in estimating, replaces, the standard deviation of.The degrees of freedom are now n – 2 instead of n – 1 as before.

10 Example 10 – Constructing a Confidence Interval for  Y|x 0 Construct a 95% confidence interval for the mean travel time for the co-workers who travel 7 miles to work (refer to Example 5 in Section 13.3). Solution: Step 1 Parameter of interest:  y|x = 7, the mean travel time for co-workers who travel 7 miles to work Step 2 a. Assumptions: The ordered pairs form a random sample, and we will assume that the y values minutes) at each x (miles) have a normal distribution. cont’d

11 Example 10 – Solution b. Probability distribution and formula: Student’s t-distribution and formula (13.17) c. Level of confidence: 1 –  = 0.95 Step 3 Sample information: where and therefore, = (found in example 5 in section 13.3) S e = = 5.40 cont’d

12 Example 10 – Solution = x = (7) = Step 4 a. Confidence coefficient: t (13, 0.025) = 2.16 (from Table 6 in Appendix B) b. Maximum error of estimate: Using formula (13.17), we have cont’d

13 Example 10 – Solution c. Lower and upper confidence limits: Thus, to is the 95% confidence interval for  x|y = 7. That is, with 95% confidence, the mean travel time for commuters that travel 7 miles is between minutes (12 min, 26 sec) and minutes (21 min, 18 sec). cont’d

14 Example 10 – Solution This confidence interval is shown in Figure by the dark red vertical line. cont’d Confidence Belts for  Y|x 0 Figure 13.12

15 Example 10 – Solution The confidence belt showing the upper and lower boundaries of all intervals at 95% confidence is also shown in red. Notice that the boundary lines for the x values far away from become close to the two lines that represent the equations with slopes equal to the extreme values of the 95% confidence interval for the slope (see Figure 13.12). cont’d

16 Confidence Intervals for Regression The formula for the prediction interval of the value of a single randomly selected y is

17 Confidence Intervals for Regression There are three basic precautions that you need to be aware of as you work with regression analysis: 1. Remember that the regression equation is meaningful only in the domain of the x-variable studied. Estimation outside this domain is extremely dangerous; it requires that we know or assume that the relationship between x and y remains the same outside the domain of the sample data. However, although projections outside the interval may be somewhat dangerous, they may be the best predictors available.

18 Confidence Intervals for Regression 2. Don’t get caught by the common fallacy of applying the regression results inappropriately. Basically, the results of one sample should not be used to make inferences about a population other than the one from which the sample was drawn.

19 Confidence Intervals for Regression 3. Don’t jump to the conclusion that the results of the regression prove that x causes y to change. (This is perhaps the most common fallacy.) Regressions measure only movement between x and y; they never prove causation. The most common difficulty in this regard occurs because of what is called the missing variable, or third- variable, effect. That is, we observe a relationship between x and y because a third variable, one that is not in the regression, affects both x and y.