Presentation is loading. Please wait.

Presentation is loading. Please wait.

AP Statistics Chapter 3 Practice Problems

Similar presentations


Presentation on theme: "AP Statistics Chapter 3 Practice Problems"— Presentation transcript:

1 AP Statistics Chapter 3 Practice Problems
Linear Regression

2 Linear Regression A diver is investigating a wreck under the water and has to come up to the surface slowly. The following is a chart detailing his depth from the time he starts ascending. Time (sec) Depth (ft) 240 210 155 30 225 280 185 60 203 330 130 100 189 360 125 140 180 390 120 164

3 Linear Regression What is the explanatory variable? What is the response? We wish to perform regression on the data. What are the three conditions that we must check before we attempt to do regression? The data is quantitative The data is linear There are no outliers

4 Linear Regression Graph the scatterplot and determine if linear regression seems appropriate. No, regression doesn’t seem appropriate because there appears to be an outlier Which point is the outlier? At time 280 seconds, he was at depth of 185 feet.

5 Linear Regression Determine the equation for the Least Squares Regression Line (LSRL). Describe the association. There appears to be a strong, negative, linear association between time and depth, but there appears to be an outlier Determine the correlation. r = Eliminate the outlier. What is the new correlation? Why does it change? r = ; it’s stronger because the residuals are smaller

6 Linear Regression Determine the equation for the Least Squares Regression Line (LSRL) without the outlier. Explain the meaning of the slope of the line. For every 1 second increase in time, our model predicts an average decrease of feet in depth. Explain the meaning of the b0 in context of this problem At a time of 0 seconds, our model predicts a depth of feet. Although this makes sense, we know that the diver began to ascend at a depth of 240 ft.

7 Linear Regression Describe the relationship between time and depth using r2 to make your description more precise. Since r = , r2 = 0.960 96% of the variation in the depth can be explained by the approximate linear relationship with the time.

8 Linear Regression Using the modified model, predict the depth of the diver at each of the following times and comment on the confidence of your prediction: 2 min. 50 sec. ≈ 176 feet 5 min. ≈ 138 feet 6 min. 30 sec. What is the residual at this time? ≈ 112 feet. The residual is 120 – 112 = 8 (or if we use the calculator ≈ 8.27) 10 min. ≈ 50.4 feet

9 Linear Regression Using the following summary statistics of a statistics class, determine the LSRL (assume that IQ is the explanatory variable):

10 Linear Regression Using the following summary statistics of a statistics class, determine the LSRL (assume that IQ is the explanatory variable):

11 Linear Regression With an LSRL of: Interpret b0 Interpret b1
With an IQ of 0, our model predicts an SAT score of This make absolutely no sense. You can’t have an IQ of 0! Interpret b1 For every increase of 1 point in IQ, our model predicts an average increase of point on the SAT.

12 Linear Regression With an LSRL of: Interpret r2
Since r = 0.893, r2 = 0.797 Approximately 80% of the variation in SAT score can be explained by the approximate linear relationship with the IQ.

13 Review Question A researcher uses a regression equation to predict home heating bills (dollar cost), based on home size (square feet). The correlation between predicted bills and home size is What is the correct interpretation of this finding? 70% of the variability in home heating bills can be explained by home size. 49% of the variability in home heating bills can be explained by home size. For each added square foot of home size, heating bills increased by 70 cents. For each added square foot of home size, heating bills increased by 49 cents. None of the above. The answer is b) since the coefficient of determination measures the proportion of variation in the dependent variable that is predictable from the independent variable.

14 Review Question A national consumer magazine reported the following correlations: The correlation between car weight and car reliability is The correlation between car weight and annual maintenance cost is 0.20. Which of the following statements are true? Heavier cars tend to be less reliable. Heavier cars tend to cost more to maintain. Car weight is related more strongly to reliability than to maintenance cost. I only II only III only I and II I, II, and III The answer is e) since reliability tends to decrease as car weight increases, costs tend to increase as car weight increases, and strength increases as correlation gets closer to ±1

15 Review Question In the context of regression analysis, which of the following statements are true? When the sum of the residuals is greater than zero, the data set is nonlinear A random pattern of residuals supports a linear model. A random pattern of residuals supports a non-linear model. I only II only III only I and II I and III The answer is b) since a random pattern of residuals supports a linear model; a non-random pattern supports a non-linear model. The sum of the residuals is always zero, whether the data set is linear or nonlinear


Download ppt "AP Statistics Chapter 3 Practice Problems"

Similar presentations


Ads by Google