AP Statistics Section 3.2 C Coefficient of Determination

Slides:



Advertisements
Similar presentations
AP Statistics Section 3.2 B Residuals
Advertisements

Chapter 3: Describing Relationships
The Role of r2 in Regression Target Goal: I can use r2 to explain the variation of y that is explained by the LSRL. D4: 3.2b Hw: pg 191 – 43, 46, 48,
Definition  Regression Model  Regression Equation Y i =  0 +  1 X i ^ Given a collection of paired data, the regression equation algebraically describes.
CHAPTER 3 Describing Relationships
Regression, Residuals, and Coefficient of Determination Section 3.2.
Ch 3 – Examining Relationships YMS – 3.1
Section 3.2 Least-Squares Regression
Lesson Least-Squares Regression. Knowledge Objectives Explain what is meant by a regression line. Explain what is meant by extrapolation. Explain.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Linear Regression Least Squares Method: the Meaning of r 2.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
The correlation coefficient, r, tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition,
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
The correlation coefficient, r, tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition,
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
AP STATISTICS LESSON 3 – 3 (DAY 2) The role of r 2 in regression.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
CHAPTER 3 Describing Relationships
Describing Relationships
CHAPTER 3 Describing Relationships
LEAST – SQUARES REGRESSION
Statistics 101 Chapter 3 Section 3.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Coefficient of Determination
AP Stats: 3.3 Least-Squares Regression Line
Least-Squares Regression
residual = observed y – predicted y residual = y - ŷ
AP STATISTICS LESSON 3 – 3 (DAY 2)
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Least Squares Method: the Meaning of r2
Least-Squares Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Least-Squares Regression
Chapter 3: Describing Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Section 3.2: Least Squares Regressions
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Warm-up: Pg 197 #79-80 Get ready for homework questions
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
9/27/ A Least-Squares Regression.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

AP Statistics Section 3.2 C Coefficient of Determination

coefficient of determination A residual plot is a graphical tool for evaluating how well a linear model fits the data. The numerical quantity that tells us how well the least-squares line (LSL) does at predicting values of the response variable y is called the __________________________ The symbol is ____. Some computer packages call it “_____”. coefficient of determination R-sq

We have seen instances where the least-squares regression line does not fit the data, and therefore, does not help predict the values of the response variable, y, as x changes. In such cases, our “best guess” for the value of y at any given value of x is simply ___, _____________________ the mean of the y values.

The idea of is this: How much better is the LSL at predictions then if we just used as our prediction each time?

Once again we consider the NEA vs Fat Gain example from section 3. 2 A Once again we consider the NEA vs Fat Gain example from section 3.2 A. The LSL and the lines have been drawn in the residual plot to the right. We would like to know which line comes closer to the actual y-values?

We know that the LSL minimizes the sum of the squared residuals We know that the LSL minimizes the sum of the squared residuals. For this data: We will call this ____, for sum of squared errors. SSE

If we use to make predictions, then our prediction errors would be the vertical distances of the points away from the horizontal line. For this data: _________ We will call this _____, for sum of squared total variation. SST

The difference SST-SSE (in this case ________ ) shows how much the LSL reduces the total variation in the responses y.

We define the coefficient of determination, r2, as the fraction of the variation in the values of y that is explained by the least-squares regression line. We can calculate r2 as follows:

For the NEA vs Fat Gain data:

We have already seen how to calculate r2 on our calculators (i. e We have already seen how to calculate r2 on our calculators (i.e. the same way we found r). Find r2 on your calculator for the NEA vs Fat Gain data.

A lot of factors, such as metabolism for example, affect the variation in the y-values. We can say _______ of the variation in fat gain is explained by the least-squares regression line relating fat gain and non-exercise activity. The other 39% is individual variation among the subjects that is not explained by the linear relationship.

Facts about Least-Squares Regression

The distinction between explanatory and response variables is essential in regression. This means we cannot reverse the roles of the two variables to make predictions. Be sure you know which variable is the explanatory.

There is a close connection between correlation and the slope of the least-squares line. We know . This equation says that along the regression line, a change in one standard deviation in x corresponds to a change of r standard deviations in y.

The least-squares regression line of y on x always passes through the point ( __, __ ).

The correlation r describes the strength of a straight-line relationship. In the regression setting, the square of the correlation, r2, is the fraction of the variation in the values of y that is explained by the least-squares regression of y on x.