13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Chapter Thirteen McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Linear Regression and Correlation.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Objectives (BPS chapter 24)
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Summarizing Bivariate Data Introduction to Linear Regression.
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson11-1 Lesson 11: Linear Regression and Correlation.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
9. SIMPLE LINEAR REGESSION AND CORRELATION
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Correlation and Regression Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Ka-fu Wong © 2003 Chap Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
SIMPLE LINEAR REGRESSION
Linear Regression and Correlation Analysis
Simple Linear Regression Analysis
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
SIMPLE LINEAR REGRESSION
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Correlation and Linear Regression
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Correlation and Linear Regression Chapter 13 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Correlation and Regression
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Correlation and Linear Regression
Chapter 6 & 7 Linear Regression & Correlation
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Examining Relationships in Quantitative Research
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
14- 1 Chapter Fourteen McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Chapter 12 Simple Linear Regression.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Correlation and Linear Regression Chapter 13 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Correlation and Linear Regression
Regression and Correlation
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
Inferential Statistics and Probability a Holistic Approach
Correlation and Regression
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Chapter Thirteen McGraw-Hill/Irwin
Linear Regression and Correlation
Chapter 13 Simple Linear Regression
Presentation transcript:

13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.

13- 2 Chapter Thirteen Linear Regression and Correlation GOALS When you have completed this chapter, you will be able to: ONE Draw a scatter diagram. TWO Understand and interpret the terms dependent variable and independent variable. THREE Calculate and interpret the coefficient of correlation, the coefficient of determination, and the standard error of estimate. FOUR Conduct a test of hypothesis to determine if the population coefficient of correlation is different from zero. Goals

13- 3 Chapter Thirteen continued Linear Regression and Correlation GOALS When you have completed this chapter, you will be able to: FIVE Calculate the least squares regression line and interpret the slope and intercept values. SIX Construct and interpret a confidence interval and prediction interval for the dependent variable. SEVEN Set up and interpret an ANOVA table. Goals

13- 4 Correlation Analysis Independent Variable The Independent Variable provides the basis for estimation. It is the predictor variable. Correlation Analysis Correlation Analysis is a group of statistical techniques to measure the association between two variables. Scatter Diagram A Scatter Diagram is a chart that portrays the relationship between two variables. Dependent Variable The Dependent Variable is the variable being predicted or estimated.

13- 5 The Coefficient of Correlation, r Negative values indicate an inverse relationship and positive values indicate a direct relationship. Coefficient of Correlation The Coefficient of Correlation (r) is a measure of the strength of the relationship between two variables. Also called Pearson’s r and Pearson’s product moment correlation coefficient. It requires interval or ratio- scaled data. It can range from to Values of or 1.00 indicate perfect and strong correlation. Values close to 0.0 indicate weak correlation.

13- 6 Perfect Negative Correlation X Y

X Y Perfect Positive Correlation

X Y Zero Correlation

X Y Strong Positive Correlation

Formula for r We calculate the coefficient of correlation from the following formula. r=  (X – X)(Y – Y) (n-1)s x s y

Coefficient of Determination It is the square of the coefficient of correlation. It ranges from 0 to 1. It does not give any information on the direction of the relationship between the variables. coefficient of determination The coefficient of determination (r 2 ) is the proportion of the total variation in the dependent variable (Y) that is explained or accounted for by the variation in the independent variable (X).

Example 1 Dan Ireland, the student body president at Toledo State University, is concerned about the cost to students of textbooks. He believes there is a relationship between the number of pages in the text and the selling price of the book. To provide insight into the problem he selects a sample of eight textbooks currently on sale in the bookstore. Draw a scatter diagram. Compute the correlation coefficient.

Example 1 continued BookPage Price($) Introduction to History Basic Algebra Introduction to Psychology Introduction to Sociology Business Management Introduction to Biology Fundamentals of Jazz Principles of Nursing800 93

Example 1 continued

EXCELEXCEL Example 1 continued

 (X – X)(Y – Y) Example 1 continued

 (X – X)(Y – Y) (n-1)s x s y r = = (138.87)(12.21) =.657 r 2 = =.432 The correlation between the number of pages and the selling price of the book is This indicates a moderate association between the variable. Example 1 continued

Did a computed r come from a population of paired observations with zero correlation? t test for the coefficient of correlation t = r  n- 2  1- r 2 With n-2 d.f. H o :  = 0 (The correlation in the population is zero.) H 1 :  ≠ 0 (The correlation in the population is different from zero. T-test of significance of r

Step 4 H 0 is rejected if t>3.143 or if t< or if p <.02. There are 6 degrees of freedom, found by n – 2 = 8 – 2 = 6. Step 3 The statistic to use follows the t distribution. Step 1 H 0 : the correlation in the population is zero. H 1 :The correlation in the population is not zero. Computed r =.657. Test the hypothesis that there is no correlation in the population. Use a.02 significance level. Step 2 Significance level is.02.

Example 1 continued H 0 is not rejected. We cannot reject the hypothesis that there is no correlation in the population. The amount of association could be due to chance. t = r  n- 2  1- r 2 =.657  8 – 2  = p(t > 2.135) =.077 Step 5 Find the value of the test statistic.

Regression Analysis The least squares criterion is used to determine the equation. That is the term  (Y – Y’) 2 is minimized. Regression Analysis In Regression Analysis we use the independent variable (X) to estimate the dependent variable (Y). The relationship between the variables is linear. Both variables must be at least interval scale.

Regression Analysis The regression equation is Y’= a + bX where Y’ is the average predicted value of Y for any X. a is the Y-intercept. It is the estimated Y value when X=0 b is the slope of the line, or the average change in Y’ for each change of one unit in X The least squares principle is used to obtain a and b.

Regression Analysis b = r sysxsysx a = Y – bX The least squares principle is used to obtain a and b. The equations to determine a and b are:

Example 1 revisited b = r sysxsysx = (.657) =.0578 a = Y – bX = *625 = Develop a regression equation for the information given in example 1 that can be used to estimate the selling price based on the number of pages.

Example 1 revisited The regression equation is: Y’ = X The slope of the line is Each addition page costs about a nickel. The sign of the b value and the sign of r will always be the same. The equation crosses the Y-axis at $ A book with no pages would cost $43.39.

Example 1 revisited Price= $ (Number of Pages) = $ (800) = $89.61 We can use the regression equation to estimate values of Y. The estimated selling price of an 800 page book is $89.61, found by

The Standard Error of Estimate The formula that is used to compute the standard error: Standard Error of Estimate The Standard Error of Estimate measures the scatter, or dispersion, of the observed values around the line of regression

Find the standard error of estimate for the problem involving the number of pages in a book and the selling price. Example 1 revisited = 9.944

The Y values are statistically independent. This means that in the selection of a sample, the Y values chosen for a particular X value do not depend on the Y values for any other X values. For each value of X, there is a group of Y values, and these Y values are normally distributed. Assumptions Underlying Linear Regression The means of these normal distributions of Y values all lie on the straight line of regression. The standard deviations of these normal distributions are the same.

Confidence Interval Y’is the predicted value for any selected X value X is an selected value of X X is the mean of the Xs n is the number of observations S y.x is the standard error of the estimate t is the value of t at n-2 degrees of freedom The confidence interval for the mean value of Y for a given value of X is given by:

For our earlier price estimate of $89.61, the confidence interval, assuming a desired 95% confidence, is calculated as follows. Example 1

Example 1 revisited Y’ the predicted value, is $89.61 X is 800 pages X is 625, the mean of the pages n is 8, the number of observations S y.x is 9.944, the standard error of the estimate t is at 8-2 degrees of freedom and 95% confidence

Prediction Interval The prediction interval for an individual value of Y for a given value of X

Summarizing The Results  The estimated selling price for a book with 800 pages is $  The standard error of estimate is $9.94.  The 95 percent confidence interval for all books with 800 pages is $ $ This means the limits are between $75.18 and $  The 95 percent prediction interval for a particular book with 800 pages is $ $ The means the limits are between $61.32 and $  These results appear in the following Minitab and Excel outputs. Example 1 revisited

Regression Analysis The regression equation is Price = No of Pages Predictor Coef StDev T P Constant No of Pages S = R-Sq = 43.2% R-Sq(adj) = 33.7% Analysis of Variance Source DF SS MS F P Regression Error Total Fit StDev Fit 95.0% CI 95.0% PI ( 75.17, ) ( 61.31, ) Example 1 revisited MINITABMINITAB

Example 1 revisited EXCELEXCEL