Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.

Slides:



Advertisements
Similar presentations
Copyright © 2010 Pearson Education, Inc. Slide
Advertisements

Inference for Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
© 2010 Pearson Prentice Hall. All rights reserved Regression Interval Estimates.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Hypothesis Testing Using a Single Sample
© 2010 Pearson Prentice Hall. All rights reserved Two Sample Hypothesis Testing for Means from Independent Groups.
Two Sample Hypothesis Testing for Proportions
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
© 2010 Pearson Prentice Hall. All rights reserved Hypothesis Testing Using a Single Sample.
© 2010 Pearson Prentice Hall. All rights reserved Two Sample Hypothesis Testing for Means from Paired or Dependent Groups.
Chapter 10 Simple Regression.
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Inferences About Process Quality
SIMPLE LINEAR REGRESSION
Chapter 9 Hypothesis Testing.
5-3 Inference on the Means of Two Populations, Variances Unknown
Simple Linear Regression and Correlation
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression Analysis
Lecture 5 Correlation and Regression
SIMPLE LINEAR REGRESSION
Correlation and Regression
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Chapter 13: Inference in Regression
Statistical inference: confidence intervals and hypothesis testing.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on Categorical Data 12.
QMS 6351 Statistics and Research Methods Regression Analysis: Testing for Significance Chapter 14 ( ) Chapter 15 (15.5) Prof. Vera Adamchik.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
1 Lecture 4 Main Tasks Today 1. Review of Lecture 3 2. Accuracy of the LS estimators 3. Significance Tests of the Parameters 4. Confidence Interval 5.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 26 Chapter 11 Section 1 Inference about Two Means: Dependent Samples.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 3 – Slide 1 of 27 Chapter 11 Section 3 Inference about Two Population Proportions.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Lesson Testing the Significance of the Least Squares Regression Model.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
CHAPTER 29: Multiple Regression*
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Elementary Statistics
Statistical Inference about Regression
Presentation transcript:

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Requirement 1 for Inference on the Least-Squares Regression Model For any particular value of the explanatory variable x, the mean of the corresponding responses in the population depends linearly on x. That is, for some numbers β 0 and β 1, where μ y|x represents the population mean response when the value of the explanatory variable is x. 14-2

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Requirement 2 for Inference on the Least-Squares Regression Model The response variables are normally distributed with mean and standard deviation σ. 14-3

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. “In Other Words” When doing inference on the least-squares regression model, we require (1) for any explanatory variable, x, the mean of the response variable, y, depends on the value of x through a linear equation, and (2) the response variable, y, is normally distributed with a constant standard deviation, σ. The mean increases/decreases at a constant rate depending on the slope, while the standard deviation remains constant. 14-4

Copyright © 2013, 2010 and 2007 Pearson Education, Inc A large value of σ, the population standard deviation, indicates that the data are widely dispersed about the regression line, and a small value of σ indicates that the data lie fairly close to the regression line.

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. The least-squares regression model is given by where y i is the value of the response variable for the i th individual β 0 and β 1 are the parameters to be estimated based on sample data x i is the value of the explanatory variable for the i th individual ε i is a random error term with mean 0 an variance, the error terms are independent. i =1,…,n, where n is the sample size (number of ordered pairs in the data set) 14-6

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. The standard error of the estimate, s e, is found using the formula 14-7

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. 14-8

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. CAUTION! Be sure to divide by n – 2 when computing the standard error of the estimate. 14-9

Copyright © 2013, 2010 and 2007 Pearson Education, Inc

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Hypothesis Test Regarding the Slope Coefficient, β 1 To test whether two quantitative variables are linearly related, we use the following steps provided that 1.the sample is obtained using random sampling. 2.the residuals are normally distributed with constant error variance

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Step 1: Determine the null and alternative hypotheses. The hypotheses can be structured in one of three ways: Step 2: Select a level of significance, α, depending on the seriousness of making a Type I error. Two-tailedLeft-TailedRight-Tailed H 0 : β 1 = 0 H 1 : β 1 ≠ 0H 1 : β 1 < 0H 1 : β 1 >

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Step 3: Compute the test statistic which follows Student’s t-distribution with n – 2 degrees of freedom. Remember, when computing the test statistic, we assume the null hypothesis to be true. So, we assume that β 1 = 0. Use Table VI to determine the critical value using n – 2 degrees of freedom Classical Approach

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Step 4: Compare the critical value with the test statistic. Classical Approach 14-14

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. By Hand Step 3: Compute the test statistic which follows Student’s t-distribution with n – 2 degrees of freedom. Use Table VI to approximate the P-value P-Value Approach

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Step 4: If the P-value < α, reject the null hypothesis. P-Value Approach 14-16

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. CAUTION! Before testing H 0 : β 1 = 0, be sure to draw a residual plot to verify that a linear model is appropriate

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Parallel Example 5: Testing for a Linear Relation Test the claim that there is a linear relation between drill depth and drill time at the α = 0.05 level of significance using the drilling data

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution Verify the requirements: We assume that the experiment was randomized so that the data can be assumed to represent a random sample. In Parallel Example 4 we confirmed that the residuals were normally distributed by constructing a normal probability plot. To verify the requirement of constant error variance, we plot the residuals against the explanatory variable, drill depth

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. There is no discernable pattern

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution Step 1: We want to determine whether a linear relation exists between drill depth and drill time without regard to the sign of the slope. This is a two-tailed test with H 0 : β 1 = 0 versus H 1 : β 1 ≠ 0 Step 2: The level of significance is α = Step 3: Using technology, we obtained an estimate of β 1 in Parallel Example 2, b 1 = To determine the standard deviation of b 1, we compute. The calculations are on the next slide

Copyright © 2013, 2010 and 2007 Pearson Education, Inc

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution Step 3, cont’d: We have The test statistic is 14-23

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution: Classical Approach Step 3: cont’d Since this is a two-tailed test, we determine the critical t-values at the α = 0.05 level of significance with n – 2 = 12 – 2 = 10 degrees of freedom to be –t = –2.228 and t = Step 4: Since the value of the test statistic, 3.867, is greater than 2.228, we reject the null hypothesis

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution: P-Value Approach Step 3: Since this is a two-tailed test, the P-value is the sum of the area under the t-distribution with 12 – 2 = 10 degrees of freedom to the left of –t 0 = –3.867 and to the right of t 0 = Using Table VI we find that with 10 degrees of freedom, the value is between and corresponding to right-tail areas of and 0.001, respectively. Thus, the P-value is between and Step 4: Since the P-value is less than the level of significance, 0.05, we reject the null hypothesis

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution Step 5: There is sufficient evidence at the α = 0.05 level of significance to conclude that a linear relation exists between drill depth and drill time

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Confidence Intervals for the Slope of the Regression Line A (1 – α)100% confidence interval for the slope of the true regression line, β 1, is given by the following formulas: Lower bound: Upper bound: Here, t a/2 is computed using n – 2 degrees of freedom

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Note: The confidence interval formula for β 1 can be computed only if the data are randomly obtained, the residuals are normally distributed, and there is constant error variance

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Parallel Example 7: Constructing a Confidence Interval for the Slope of the True Regression Line Construct a 95% confidence interval for the slope of the least-squares regression line for the drilling example

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution The requirements for the usage of the confidence interval formula were verified in previous examples. We also determined b 1 = in previous examples

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution Since t = for 10 degrees of freedom, we have Lower bound = – = Upper bound = = We are 95% confident that the mean increase in the time it takes to drill 5 feet for each additional foot of depth at which the drilling begins is between and minutes

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Confidence and Prediction Intervals 14.2

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Confidence intervals for a mean response are intervals constructed about the predicted value of y, at a given level of x, that are used to measure the accuracy of the mean response of all the individuals in the population. Prediction intervals for an individual response are intervals constructed about the predicted value of y that are used to measure the accuracy of a single individual’s predicted value

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Confidence Interval for the Mean Response of y,. A (1 – α)100% confidence interval for, the mean response of y for a specified value of x, is given by Lower bound: Upper bound: where x* is the given value of the explanatory variable, n is the number of observations, and t α/2 is the critical value with n – 2 degrees of freedom

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Parallel Example 1: Constructing a Confidence Interval for a Mean Response Construct a 95% confidence interval about the predicted mean time to drill 5 feet for all drillings started at a depth of 110 feet

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution The least squares regression line is. To find the predicted mean time to drill 5 feet for all drillings started at 110 feet, let x * =110 in the regression equation and obtain. Recall: s e = t = for 10 degrees of freedom 14-36

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution Therefore, Lower bound: Upper bound: 14-37

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution We are 95% confident that the mean time to drill 5 feet for all drillings started at a depth of 110 feet is between 6.45 and 7.15 minutes

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Prediction Interval for an Individual Response about A (1 – α)100% prediction interval for, the individual response of y, is given by Lower bound: Upper bound: where x* is the given value of the explanatory variable, n is the number of observations, and t α/2 is the critical value with n – 2 degrees of freedom

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Parallel Example 2: Constructing a Prediction Interval for an Individual Response Construct a 95% prediction interval about the predicted time to drill 5 feet for a single drilling started at a depth of 110 feet

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution The least squares regression line is. To find the predicted mean time to drill 5 feet for all drillings started at 110 feet, let x * =110 in the regression equation and obtain. Recall: s e = t =2.228 for 10 degrees of freedom 14-41

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution Therefore, Lower bound: Upper bound: 14-42

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Solution We are 95% confident that the time to drill 5 feet for a random drilling started at a depth of 110 feet is between 5.59 and 8.01 minutes