Regression, Residuals, and Coefficient of Determination Section 3.2.

Slides:



Advertisements
Similar presentations
AP Statistics Section 3.2 C Coefficient of Determination
Advertisements

AP Statistics Section 3.2 B Residuals
Least Squares Regression
2nd Day: Bear Example Length (in) Weight (lb)
CHAPTER 3 Describing Relationships
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
Ch 3 – Examining Relationships YMS – 3.1
Lesson Least-Squares Regression. Knowledge Objectives Explain what is meant by a regression line. Explain what is meant by extrapolation. Explain.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Least-Squares Regression: Linear Regression Section 3.2 Reference Text: The Practice of Statistics, Fourth Edition. Starnes, Yates, Moore.
Section 2.2 Correlation A numerical measure to supplement the graph. Will give us an indication of “how closely” the data points fit a particular line.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. DosageHeart rate
Describing Bivariate Relationships Chapter 3 Summary YMS AP Stats Chapter 3 Summary YMS AP Stats.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
WARM-UP Do the work on the slip of paper (handout)
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
SWBAT: Calculate and interpret the residual plot for a line of regression Do Now: Do heavier cars really use more gasoline? In the following data set,
A P STATISTICS LESSON 3 – 3 (DAY 3) A P STATISTICS LESSON 3 – 3 (DAY 3) RISIDUALS.
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
CHAPTER 3 Describing Relationships
LEAST-SQUARES REGRESSION 3.2 Role of s and r 2 in Regression.
AP Statistics Section 3.2 A Regression Lines. Linear relationships between two quantitative variables are quite common. Correlation measures the direction.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
CHAPTER 3 Describing Relationships
Statistics 101 Chapter 3 Section 3.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
AP Stats: 3.3 Least-Squares Regression Line
Ice Cream Sales vs Temperature
Least-Squares Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Least Squares Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
3.2 – Least Squares Regression
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Section 3.2: Least Squares Regressions
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least Squares Regression Chapter 3.2
Chapter 3: Describing Relationships
9/27/ A Least-Squares Regression.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Correlation/Regression - part 2
Chapter 3: Describing Relationships
Presentation transcript:

Regression, Residuals, and Coefficient of Determination Section 3.2

Regression Lines

Interpreting the Regression Line  The slope b = tells us that fat gained goes down by kg for each added calorie of NEA according to this linear model. Our regression equation is the predicted RATE OF CHANGE in the response y as the explanatory variable x changes.  The y-intercept a = 3.505kg is the fat gain estimated by this model if NEA does not change when a person overeats.

Prediction  We can use a regression line to predict the response y for a specific value of the explanatory variable x.

Regression Line  In most cases, no line will pass exactly through all the points in a scatter plot and different people will draw different regression lines by eye.  Because we use the line to predict y from x, the prediction errors we make are errors in y, the vertical direction in the scatter plot  A good regression line makes the vertical distances of the points from the line as small as possible  Error: Observed response - predicted response

Least-Squares Regression Line

Least-Squares Regression Line (LSRL)  The least-squares regression line of y on x is the line that makes the sum of the squared vertical distances of the data points form the line as small as possible.  See Superhero Data in Fathom.  Create a Scatterplot of the Superhero data using your calculator.  Describe the direction, form, and strength.  Determine the LSRL.  Verify the slope using the formula b = r(Sy/Sx)

Residuals  A residual is the difference between an observed value of the response variable and the value predicted by the regression line.  Residual = observed y – predicted y  The error of our predictions, or vertical distance from predicted Y to observed Y, are called residuals because they are “left-over” variation in the response.

Residuals Example  Determine the residual value for Captain America.  Determine the residual value for The Black Widow.

Residual Plot  A residual plot is a scatterplot of the regression residuals against the explanatory variable.  Residual plots help assess how well a regression line fits the data.  The sum of the least-squares residuals is always zero.  The mean of the residuals is always zero, the horizontal line at zero in the figure helps orient us. This “residual = 0” line corresponds to the regression line.  Use your calculator to create a Residual Plot for the Superhero Data.

Examining a Residual Plot  Residual plot should show no obvious pattern. A curved pattern shows that the relationship is not linear and a straight line may not be the best model.  Residuals should be relatively small in size. A regression line in a model that fits the data well should come close” to most of the points.

R squared- Coefficient of Determination o If all the points fall directly on the least-squares line, r-squared = 1. Then all the variation in y is explained by the linear relationship with x. o So, if r-squared =.606, that means that 61% of the variation in y among individual subjects is due to the influence of the other variable. The other 39% is “not explained”. o R-squared is a measure of how successful the regression was in explaining the response. o Interpret the r-squared value for the Superhero data.