Correlation and Regression

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Correlation and Regression
Objectives (BPS chapter 24)
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Chapter 12 Simple Linear Regression
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Correlation and Regression
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Chapter 9: Correlation and Regression
SIMPLE LINEAR REGRESSION
Chapter 12 Section 1 Inference for Linear Regression.
1 1 Slide Simple Linear Regression Chapter 14 BA 303 – Spring 2011.
Correlation and Linear Regression
Sec 10.3 Coefficient of Determination and Standard Error of the Estimate.
SIMPLE LINEAR REGRESSION
Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Correlation and Regression
© The McGraw-Hill Companies, Inc., 2000 Business and Finance College Principles of Statistics Lecture 10 aaed EL Rabai week
EQT 272 PROBABILITY AND STATISTICS
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Elementary Statistics Correlation and Regression.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Unit 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
REGRESSION AND CORRELATION SIMPLE LINEAR REGRESSION 10.2 SCATTER DIAGRAM 10.3 GRAPHICAL METHOD FOR DETERMINING REGRESSION 10.4 LEAST SQUARE METHOD.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Correlation and Linear Regression
Correlation and Regression
Inference about the slope parameter and correlation
Correlation and Linear Regression
Regression and Correlation
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Correlation and Simple Linear Regression
Correlation and Regression
Simple Linear Regression
Statistics for Business and Economics (13e)
Slides by JOHN LOUCKS St. Edward’s University.
CHAPTER 10 Correlation and Regression (Objectives)
Correlation and Regression
Coefficient of Determination and Standard Error of the Estimate.
PENGOLAHAN DAN PENYAJIAN
Statistical Inference about Regression
Correlation and Regression
SIMPLE LINEAR REGRESSION
CHAPTER 12 More About Regression
Product moment correlation
SIMPLE LINEAR REGRESSION
Chapter Thirteen McGraw-Hill/Irwin
St. Edward’s University
Presentation transcript:

Correlation and Regression 11-1 Chapter 11 Correlation and Regression

Outline 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression

Outline 11-3 11-5 Coefficient of Determination and Standard Error of Estimate

Objectives Draw a scatter plot for a set of ordered pairs. 11-4 Draw a scatter plot for a set of ordered pairs. Find the correlation coefficient. Test the hypothesis H0:  = 0. Find the equation of the regression line.

Objectives Find the coefficient of determination. 11-5 Find the coefficient of determination. Find the standard error of estimate. Find a prediction interval.

11-2 Scatter Plots 11-6 A scatter plot is a graph of the ordered pairs (x, y) of numbers consisting of the independent variable, x, and the dependent variable, y.

11-2 Scatter Plots - Example 11-7 Construct a scatter plot for the data obtained in a study of age and systolic blood pressure of six randomly selected subjects. The data is given on the next slide.

11-2 Scatter Plots - Example 11-8

11-2 Scatter Plots - Example 11-9 Positive Relationship

11-2 Scatter Plots - Other Examples 11-10 Negative Relationship

11-2 Scatter Plots - Other Examples 11-11 11-2 Scatter Plots - Other Examples 7 6 5 4 3 2 1 x y 7 6 5 4 3 2 1 X Y No Relationship

11-3 Correlation Coefficient 11-12 The correlation coefficient computed from the sample data measures the strength and direction of a relationship between two variables. Sample correlation coefficient, r. Population correlation coefficient, 

11-3 Range of Values for the Correlation Coefficient 11-13 Strong negative relationship No linear relationship Strong positive relationship   

11-3 Formula for the Correlation Coefficient r 11-14       n  xy   x  y r                n  x  x 2 2 n  y  2 y 2 Where n is the number of data pairs

11-3 Correlation Coefficient - Example (Verify) 11-15 Compute the correlation coefficient for the age and blood pressure data.

11-3 The Significance of the Correlation Coefficient 11-16 The population corelation coefficient, , is the correlation between all possible pairs of data values (x, y) taken from a population.

11-3 The Significance of the Correlation Coefficient 11-17 H0: = 0 H1:  0 This tests for a significant correlation between the variables in the population.

11-3 Formula for the t tests for the Correlation Coefficient 11-18  n 2  t  1 r 2   with d . f . n 2

11-3 Example 11-19 Test the significance of the correlation coefficient for the age and blood pressure data. Use  = 0.05 and r = 0.897. Step 1: State the hypotheses. H0: = 0 H1:  0

11-3 Example 11-20 Step 2: Find the critical values. Since  = 0.05 and there are 6 – 2 = 4 degrees of freedom, the critical values are t = +2.776 and t = –2.776. Step 3: Compute the test value. t = 4.059 (verify).

11-3 Example 11-21 Step 4: Make the decision. Reject the null hypothesis, since the test value falls in the critical region (4.059 > 2.776). Step 5: Summarize the results. There is a significant relationship between the variables of age and blood pressure.

11-4 Regression 11-22 The scatter plot for the age and blood pressure data displays a linear pattern. We can model this relationship with a straight line. This regression line is called the line of best fit or the regression line. The equation of the line is y  = a + bx.

11-4 Formulas for the Regression Line y  = a + bx. 11-23           y  x   2 x xy a        n  x  2 x 2       n  xy   x  y  b     n   x  2 2 x Where a is the y  intercept and b is the slope of the line.

11-4 Example 11-24 Find the equation of the regression line for the age and the blood pressure data. Substituting into the formulas give a = 81.048 and b = 0.964 (verify). Hence, y  = 81.048 + 0.964x. Note, a represents the intercept and b the slope of the line.

11-4 Example 11-25 y  = 81.048 + 0.964x

11-4 Using the Regression Line to Predict 11-26 The regression line can be used to predict a value for the dependent variable (y) for a given value of the independent variable (x). Caution: Use x values within the experimental region when predicting y values.

11-4 Example 11-27 Use the equation of the regression line to predict the blood pressure for a person who is 50 years old. Since y  = 81.048 + 0.964x, then y  = 81.048 + 0.964(50) = 129.248 129. Note that the value of 50 is within the range of x values.

11-5 Coefficient of Determination and Standard Error of Estimate 11-28 The coefficient of determination, denoted by r2, is a measure of the variation of the dependent variable that is explained by the regression line and the independent variable.

11-5 Coefficient of Determination and Standard Error of Estimate 11-29 r2 is the square of the correlation coefficient. The coefficient of nondetermination is (1 – r2). Example: If r = 0.90, then r2 = 0.81.

11-5 Coefficient of Determination and Standard Error of Estimate 11-30 The standard error of estimate, denoted by sest, is the standard deviation of the observed y values about the predicted y  values. The formula is given on the next slide.

11-5 Formula for the Standard Error of Estimate 11-31    y  y 2   s  est n 2 or  y  a  y  2 b  xy s  n  est 2

11-5 Standard Error of Estimate - Example 11-32 From the regression equation, y  = 55.57 + 8.13x and n = 6, find sest. Here, a = 55.57, b = 8.13, and n = 6. Substituting into the formula gives sest = 6.48 (verify).

11-5 Prediction Interval 11-33 A prediction interval is an interval constructed about a predicted y value, y , for a specified x value.

11-5 Prediction Interval 11-34 For given  value, we can state with (1 – )100% confidence that the interval will contain the actual mean of the y values that correspond to the given value of x.

11-5 Formula for the Prediction Interval about a Value y 11-35 2 1 n ( x - X ) y ¢ - t s 1 + + a 2 2 est n 2 ( ) n å x - å x 2 1 n ( x - X ) y ¢ + t s 1 + + a 2 est n 2 2 ( ) n å x - å x with d . f .  n  2

11-5 Prediction interval - Example 11-36 A researcher collects the data shown on the next slide and determines that there is a significant relationship between the age of a copy machine and its monthly maintenance cost. The regression equation is y  = 55.57 + 8.13x. Find the 95% prediction interval for the monthly maintenance cost of a machine that is 3 years old.

11-5 Prediction Interval - Example 11-37 A 1 $62 B 2 $78 C 3 $70 D 4 $90 E 4 $93 F 6 $103

11-5 Prediction Interval - Example 11-38 Step 1: Find x, x2 and . x = 20, x2 = 82, Step 2: Find y  for x = 3. y  = 55.57 + 8.13(3) = 79.96 Step 3: Find sest sest = 6.48 as shown in previous example.

11-5 Prediction Interval - Example 11-39 Step 4: Substitute in the formula and solve. t/2 = 2.776, d.f. = 6 – 2 = 4 for 95% 60.53 < y < 99.39 (verify) Hence, one can be 95% confident that the interval 60.53 < y < 99.39 contains the actual value of y.