Unit 4 Lesson 3 (5.3) Summarizing Bivariate Data 5.3: LSRL.

Slides:



Advertisements
Similar presentations
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
Advertisements

1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Summarizing Bivariate Data Introduction to Linear Regression.
LSRL Least Squares Regression Line
CHAPTER 3 Describing Relationships
Least Squares Regression Line (LSRL)
Linear Regression.
Least-Squares Regression Section 3.3. Why Create a Model? There are two reasons to create a mathematical model for a set of bivariate data. To predict.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
3.2 Least Squares Regression Line. Regression Line Describes how a response variable changes as an explanatory variable changes Formula sheet: Calculator.
Scatterplot and trendline. Scatterplot Scatterplot explores the relationship between two quantitative variables. Example:
Chapters 8 & 9 Linear Regression & Regression Wisdom.
Chapter 5 Residuals, Residual Plots, & Influential points.
Chapter 5 Residuals, Residual Plots, Coefficient of determination, & Influential points.
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. DosageHeart rate
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
 Find the Least Squares Regression Line and interpret its slope, y-intercept, and the coefficients of correlation and determination  Justify the regression.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Summarizing Bivariate Data
 Chapter 3! 1. UNIT 7 VOCABULARY – CHAPTERS 3 & 14 2.
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
Linear Regression Day 1 – (pg )
^ y = a + bx Stats Chapter 5 - Least Squares Regression
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
CHAPTER 3 Describing Relationships
Least Squares Regression Lines Text: Chapter 3.3 Unit 4: Notes page 58.
Chapter 7 Linear Regression. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable.
Chapter 5 Lesson 5.2 Summarizing Bivariate Data 5.2: LSRL.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Unit 3 Correlation. Homework Assignment For the A: 1, 5, 7,11, 13, , 21, , 35, 37, 39, 41, 43, 45, 47 – 51, 55, 58, 59, 61, 63, 65, 69,
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Least Squares Regression Textbook section 3.2. Regression LIne A regression line describes how the response variable (y) changes as an explanatory variable.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
Chapter 3 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Chapter 5 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Unit 4 LSRL.
LSRL.
Least Squares Regression Line.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 5 LSRL.
LSRL Least Squares Regression Line
Chapter 4 Correlation.
Chapter 3.2 LSRL.
Least Squares Regression Line LSRL Chapter 7-continued
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3 Describing Relationships Section 3.2
Do Now What is the predicted English score when the math score is 10?
Chapter 5 LSRL.
Chapter 5 LSRL.
Chapter 5 LSRL.
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
CHAPTER 3 Describing Relationships
Presentation transcript:

Unit 4 Lesson 3 (5.3) Summarizing Bivariate Data 5.3: LSRL

b – is the slope –it is the approximate amount by which y increases when x increases by 1 unit a – is the y-intercept –it is the approximate height of the line when x = 0 –in some situations, the y-intercept has no meaning The LSRL is - (y-hat) means the predicted y Be sure to put the hat on the y Scatterplots frequently exhibit a linear pattern. When this is the case, it makes sense to summarize the relationship between the variables by finding a line that is as close as possible to the plots in the plot. This is done by calculating the line of best fit or Least Square Regression Line (LSRL). minimizes The LSRL is the line that minimizes the sum of the squares of the deviations from the line Let ’ s explore what this means...

(3,10) (6,2) Sum of the squares = y =.5(0) + 4 = 4 0 – 4 = -4 (0,0) y =.5(3) + 4 = – 5.5 = 4.5 y =.5(6) + 4 = 7 2 – 7 = -5 Suppose we have a data set that consists of the observations (0,0), (3,10) and 6,2). Let ’ s just fit a line to the data by drawing a line through what appears to be the middle of the points. Now find the vertical distance from each point to the line. Find the sum of the squares of these deviations (aka Residuals.)

(0,0) (3,10) (6,2) Sum of the squares = 54 Use a calculator to find the line of best fit Find the vertical deviations from the line -3 6 What is the sum of the deviations from the line? Will it always be zero? minimizes LSRL The line that minimizes the sum of the squares of the deviations from the line is the LSRL. Find the sum of the squares of the deviations from the line

Researchers are studying pomegranate's antioxidants properties to see if it might be helpful in the treatment of cancer. In one study, mice were injected with cancer cells and randomly assigned to one of three groups, plain water, water supplemented with.1% pomegranate fruit extract (PFE), and water supplemented with.2% PFE. The average tumor volume for mice in each group was recorded for several points in time. (x = number of days after injection of cancer cells in mice assigned to plain water and y = average tumor volume (in mm 3 ) x y Sketch a scatterplot for this data set.

Pomegranate study continued x = number of days after injection of cancer cells in mice assigned to plain water and y = average tumor volume x y Calculate the LSRL and the correlation coefficient. Interpret the slope and the correlation coefficient in context. The average volume of the tumor increases by approximately mm 3 for each day increase in the number of days after injection. Remember that an interpretation is stating the definition in context. There is a strong, positive, linear relationship between the average tumor volume and the number of days since injection. Does the intercept have meaning in this context? Why or why not?

Pomegranate study continued x = number of days after injection of cancer cells in mice assigned to plain water and y = average tumor volume x y Predict the average volume of the tumor for 20 days after injection. Predict the average volume of the tumor for 5 days after injection. Can volume be negative? This is the danger of extrapolation. The least- squares line should not be used to make predictions for y using x-values outside the range in the data set. Why? It is unknown whether the pattern observed in the scatterplot continues outside the range of x- values.

Extrapolation (cont.) A regression of mean age at first marriage for men vs. year fit to the first 4 decades of the 20 th century does not hold for later years:

Pomegranate study continued x = number of days after injection of cancer cells in mice assigned to plain water and y = average tumor volume x y Suppose we want to know how many days after injection of cancer cells would the average tumor size be 500 mm 3 ? Is this the appropriate regression line to answer this question? No, the slope of the line for predicting x is not and the intercepts are almost always different. Here is the appropriate regression line: The regression line of y on x should not be used to predict x, because it is not the line that minimizes the sum of the squared deviations in the x direction.

x = number of days after injection of cancer cells in mice assigned to plain water and y = average tumor volume x y Minitab, a statistical software package, was used to fit the least-squares regression line. Part of the resulting output is shown below. The regression equation is Predicted volume = days PredictorCoefSE CoefTP Constant Days intercept slope We will discuss what these numbers mean in the Chapter 13.

Homework Pg.174: #5.26, 5.27, 5.29, 5.33