1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.

Slides:



Advertisements
Similar presentations
Chapter 3 Bivariate Data
Advertisements

2nd Day: Bear Example Length (in) Weight (lb)
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
LSRL Least Squares Regression Line
CHAPTER 3 Describing Relationships
Relationship of two variables
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
AP Statistics Chapter 8 & 9 Day 3
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Regression Regression relationship = trend + scatter
Chapter 5 Residuals, Residual Plots, Coefficient of determination, & Influential points.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.2 Least-Squares Regression.
 Chapter 3! 1. UNIT 7 VOCABULARY – CHAPTERS 3 & 14 2.
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
CHAPTER 3 Describing Relationships
Unit 4 Lesson 3 (5.3) Summarizing Bivariate Data 5.3: LSRL.
Chapter 7 Linear Regression. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable.
Chapter 5 Lesson 5.2 Summarizing Bivariate Data 5.2: LSRL.
Unit 3 Correlation. Homework Assignment For the A: 1, 5, 7,11, 13, , 21, , 35, 37, 39, 41, 43, 45, 47 – 51, 55, 58, 59, 61, 63, 65, 69,
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Chapter 3 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Chapter 5 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Warm-up Get a sheet of computer paper/construction paper from the front of the room, and create your very own paper airplane. Try to create planes with.
Unit 4 LSRL.
LSRL.
Least Squares Regression Line.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 5 LSRL.
LSRL Least Squares Regression Line
Chapter 3.2 LSRL.
Least Squares Regression Line LSRL Chapter 7-continued
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Residuals, Residual Plots, & Influential points
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 5 LSRL.
Chapter 5 LSRL.
Chapter 5 LSRL.
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points

 Often computer output and graphs are provided with bivariate data analysis questions, but you cannot be sure that these will be provided.  You need to know how to use your calculator to make a scatter plot, find the equation of a least-squares regression line, and make a residual plot.

 Comment on the direction, (positive or negative), shape (linear or curved), and strength of the relationship.  As always, IN CONTEXT.  Also, comment on unusual features (such as outliers).

 The Least Squares Regression Line (LSRL) passes through the point and has slope  When you write the equation of a LSRL, be sure to include the “hat” on the y-variable.  Be sure to identify the variables in the equation; that is, tell what x and y stand for in the problem.  You must be able to read computer output: the constant is the y-intercept, and the named quantity is the slope.

Always put your explanation in context Do NOT make a deterministic statement The y-intercept provides an estimate for the value of y when x is zero. The slope provides information about the estimated amount that the y- variable changes (or the amount that the y-variable changes on average) for each unit change in the x-variable.

 Suppose we took a series of measurements on students’ foot length (X) and height (Y). We suspect that there is a positive, linear relationship between these two variables.

Predictor Coef StDev t-ratio p-value Constant Foot < (“Height” is the dependent variable) * Here, the LSRL would be Y-hat = 1.24X , where X is the foot length and y-hat is the predicted height. * Slope Interpretation: “On average, for every one inch increase in foot length, there is a 1.24 inch increase in predicted height.”

 Residual = observed y-value – predicted y- value (Y minus Y-hat) Positive residuals are associated with points lying above the line; negative residuals are associated with points lying below the line. To determine whether the model is a good fit for the data, examine a residual plot : The residuals should be randomly scattered above and below the horizontal axis with no pattern showing curvature.

 Extrapolation is predicting outside the observed set of data and can be risky. Interpreting the y- intercept often is meaningless when it involves extrapolation.  Interpolation is predicting within the observed data.  An influential point is one that noticeably affects the equation of the least squares line when it is added to or removed from the data set.  An outlier is a point that noticeably stands apart from the other points.

The correlation coefficient (r) gives information about the strength and direction of the linear relationship between two variables. The correlation coefficient gives information about how tightly points are clustered about a line. Always between -1 and 1. Special values: r = 1, r = -1, r = 0. The correlation coefficient is sensitive to the effect of outliers. Correlation NEVER has units and is the same for (y, x), (x, y), (kx, y), (x, ky). Correlation IS NOT CAUSATION. A correlation value close to 1 or -1 does not guarantee that a linear model is appropriate.

 r 2 is the “Coefficient of Determination” – it tells what percent of the variation in the observed y-values is explained by the linear relationship with the x-variable.  Sometimes it’s just pronounced “r- squared”!  Be sure that you can interpret r 2 in context.

FOR EXAMPLE: Predictor Coef StDev t-ratio p-value Constant Foot < R-sq = “93.4% of the variation in height (the Y’s) can be explained by our linear model with foot length being the explanatory variable (the X’s).”