Regression.

Slides:



Advertisements
Similar presentations
Chapter 12 Inference for Linear Regression
Advertisements

Linear Regression (LSRL)
Chapter 27 Inferences for Regression This is just for one sample We want to talk about the relation between waist size and %body fat for the complete population.
Inference for Regression
Regression Inferential Methods
CHAPTER 24: Inference for Regression
Objectives (BPS chapter 24)
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Linear Regression and Correlation Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Chapter 12 Section 1 Inference for Linear Regression.
Correlation & Regression
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
STA291 Statistical Methods Lecture 27. Inference for Regression.
Formulas: Hypothesis test: We would like to know if there is . The data on six-year graduation rate (%), student-related expenditure per full-time.
Inference for Regression Find your notes from last week, Put # of beers in L1 and BAC in L2, then find LinReg(ax+b)
Inferences for Regression
Confidence Intervals for the Regression Slope 12.1b Target Goal: I can perform a significance test about the slope β of a population (true) regression.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Chapter 15 Inference for Regression
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
Regression. Height Weight Suppose you took many samples of the same size from this population & calculated the LSRL for each. Using the slope from each.
Regression with Inference Notes: Page 231. Height Weight Suppose you took many samples of the same size from this population & calculated the LSRL for.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Lecture 10 Chapter 23. Inference for regression. Objectives (PSLS Chapter 23) Inference for regression (NHST Regression Inference Award)[B level award]
Chapter 12 Inference for Linear Regression. Reminder of Linear Regression First thing you should do is examine your data… First thing you should do is.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
Chapter 26: Inference for Slope. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other.
The Practice of Statistics Third Edition Chapter 15: Inference for Regression Copyright © 2008 by W. H. Freeman & Company.
BPS - 5th Ed. Chapter 231 Inference for Regression.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Regression Inference. Height Weight How much would an adult male weigh if he were 5 feet tall? He could weigh varying amounts (in other words, there is.
Chapter 13 Lesson 13.2a Simple Linear Regression and Correlation: Inferential Methods 13.2: Inferences About the Slope of the Population Regression Line.
Chapter 15 Inference for Regression. How is this similar to what we have done in the past few chapters?  We have been using statistics to estimate parameters.
CHAPTER 12 More About Regression
CHAPTER 12 More About Regression
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Regression.
AP Statistics Chapter 14 Section 1.
Regression Inferential Methods
Inferences for Regression
Inference for Regression
CHAPTER 12 More About Regression
Regression.
The Practice of Statistics in the Life Sciences Fourth Edition
CHAPTER 29: Multiple Regression*
Chapter 12 Regression.
Regression Inference.
Regression.
Inference for Regression
Regression.
Chapter 14 Inference for Regression
Regression Chapter 8.
Regression.
CHAPTER 12 More About Regression
Regression.
Chapter 14 Inference for Regression
Day 68 Agenda: 30 minute workday on Hypothesis Test --- you have 9 worksheets to use as practice Begin Ch 15 (last topic)
CHAPTER 12 More About Regression
Inferences for Regression
Inference for Regression
Presentation transcript:

Regression

What would you expect for other heights? Weight What would you expect for other heights? How much would an adult female weigh if she were 5 feet tall? This distribution is normally distributed. (we hope) She could weigh varying amounts – in other words, there is a distribution of weights for adult females who are 5 feet tall. What about the standard deviations of all these normal distributions? We want the standard deviations of all these normal distributions to be the same. Where would you expect the TRUE LSRL to be?

Regression Model The mean response my has a straight-line relationship with x: Where: slope b and intercept a are unknown parameters For any fixed value of x, the response y varies according to a normal distribution. Repeated responses of y are independent of each other. The standard deviation of y (sy) is the same for all values of x. (sy is also an unknown parameter)

What distribution does their weight have? Person # Ht Wt 1 64 130 10 175 15 150 19 125 21 145 40 186 47 121 60 137 63 143 68 120 70 112 78 108 83 160 Suppose we look at part of a population of adult women. These women are all 64 inches tall. What distribution does their weight have?

The slope b of the LSRL is an unbiased estimator of the true slope b. We use to estimate The slope b of the LSRL is an unbiased estimator of the true slope b. The intercept a of the LSRL is an unbiased estimator of the true intercept a. The standard error s is an unbiased estimator of the true standard deviation of y (sy). Note: df = n-2

Do sampling distribution of slopes activity & applet

Height Weight Suppose you took many samples of the same size from this population & calculated the LSRL for each. Using the slope from each of these LSRLs – we can create a sampling distribution for the slope of the true LSRL. What is the standard deviation of the sampling distribution? What is the mean of the sampling distribution equal? What shape will this distribution have? b b b b b b b mb = b

Let’s review the regression model! x & y have a linear relationship with the true LSRL going through the my sy is the same for each x-value. For a given x-value, the responses (y) are normally distributed

Assumptions for inference on slope The observations are independent Check that you have an SRS The true relationship is linear Check the scatter plot & residual plot The standard deviation of the response is constant. The responses vary normally about the true regression line. Check a histogram or boxplot of residuals

What is the slope of a horizontal line? Height Weight Suppose the LSRL has a horizontal line –would height be useful in predicting weight? A slope of zero – means that there is NO relationship between x & y!

Hypotheses Be sure to define b! H0: b = 0 1 Ha: b > 0 Ha: b < 0 This implies that there is no relationship between x & y Or that x should not be used to predict y What would the slope equal if there were a perfect relationship between x & y? H0: b = 0 Ha: b > 0 Ha: b < 0 Ha: b ≠ 0 1 Be sure to define b!

Because there are two unknowns a & b Formulas: Confidence Interval: Hypothesis test: df = n -2 Because there are two unknowns a & b

Body fat = -27.376 + 0.250 weight r = 0.697 r2 = 0.485 Example: It is difficult to accurately determine a person’s body fat percentage without immersing him or her in water. Researchers hoping to find ways to make a good estimate immersed 20 male subjects, and then measured their weights. Find the LSRL, correlation coefficient, and coefficient of determination. Body fat = -27.376 + 0.250 weight r = 0.697 r2 = 0.485

b) Explain the meaning of slope in the context of the problem. There is approximately .25% increase in body fat for every pound increase in weight. c) Explain the meaning of the coefficient of determination in context. Approximately 48.5% of the variation in body fat can be explained by the regression of body fat on weight.

a = -27.376 b = 0.25 s = 7.049 d) Estimate a, b, and s. e) Create a scatter plot and residual plot for the data. Weight Body fat Weight Residuals

f) Is there sufficient evidence that weight can be used to predict body fat? Assumptions: Have an SRS of male subjects Since the residual plot is randomly scattered, weight & body fat are linear Since the points are evenly spaced across the LSRL on the scatterplot, sy is approximately equal for all values of weight Since the boxplot of residual is approximately symmetrical, the responses are approximately normally distributed. H0: b = 0 Where b is the true slope of the LSRL of weight Ha: b ≠ 0 & body fat Since the p-value < a, I reject H0. There is sufficient evidence to suggest that weight can be used to predict body fat.

Be sure to show all graphs! g) Give a 95% confidence interval for the true slope of the LSRL. Assumptions: Have an SRS of male subjects Since the residual plot is randomly scattered, weight & body fat are linear Since the points are evenly spaced across the LSRL on the scatterplot, sy is approximately equal for all values of weight Since the boxplot of residual is approximately symmetrical, the responses are approximately normally distributed. We are 95% confident that the true slope of the LSRL of weight & body fat is between 0.12 and 0.38. Be sure to show all graphs!

What does “s” represent (in context)? h) Here is the computer-generated result from the data: Sample size: 20 R-square = 43.83% s = 7.0491323 df? What does “s” represent (in context)? Parameter Estimate Std. Err. Intercept -27.376263 11.547428 Weight 0.24987414 0.060653996 Correlation coeficient? Be sure to write as decimal first! What does this number represent? What do these numbers represent?