Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

AP Statistics Section 3.2 C Coefficient of Determination
Chapter 8 Linear Regression
Least Squares Regression
Linear Regression.  The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu:  The model won’t be perfect, regardless.
Linear Regression (C7-9 BVD). * Explanatory variable goes on x-axis * Response variable goes on y-axis * Don’t forget labels and scale * Statplot 1 st.
Chapter 8 Linear regression
Chapter 8 Linear regression
Linear Regression Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Copyright © 2010 Pearson Education, Inc. Chapter 8 Linear Regression.
Chapter 8 Linear Regression.
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
Copyright © 2009 Pearson Education, Inc. Chapter 8 Linear Regression.
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Statistics Residuals and Regression. Warm-up In northern cities roads are salted to keep ice from freezing on the roadways between 0 and -9.5 ° C. Suppose.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 8 Linear Regression.
Chapter 7 Linear Regression.
Scatter Diagrams and Linear Correlation
AP Statistics Mrs Johnson
Chapter 8: Linear Regression
Regression, Residuals, and Coefficient of Determination Section 3.2.
Chapter 8: Linear Regression
Correlation Correlation measures the strength of the LINEAR relationship between 2 quantitative variables. Labeled as r Takes on the values -1 < r < 1.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
 The equation used to calculate Cab Fare is y = 0.75x where y is the cost and x is the number of miles traveled. 1. What is the slope in this equation?
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 8 Linear Regression.
Chapter 8 Linear Regression *The Linear Model *Residuals *Best Fit Line *Correlation and the Line *Predicated Values *Regression.
Chapter 8 Linear Regression. Slide 8- 2 Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression (3)
Chapter 8 Linear Regression
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
 Find the Least Squares Regression Line and interpret its slope, y-intercept, and the coefficients of correlation and determination  Justify the regression.
Chapter 8 Linear Regression HOW CAN A MODEL BE CREATED WHICH REPRESENTS THE LINEAR RELATIONSHIP BETWEEN TWO QUANTITATIVE VARIABLES?
Chapter 8 Linear Regression. Objectives & Learning Goals Understand Linear Regression (linear modeling): Create and interpret a linear regression model.
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
CHAPTER 8 Linear Regression. Residuals Slide  The model won’t be perfect, regardless of the line we draw.  Some points will be above the line.
AP Statistics HW: p. 165 #42, 44, 45 Obj: to understand the meaning of r 2 and to use residual plots Do Now: On your calculator select: 2 ND ; 0; DIAGNOSTIC.
Linear Regression Day 1 – (pg )
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8- 1.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Linear Regression Chapter 8. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Copyright © 2010 Pearson Education, Inc. Chapter 8 Linear Regression.
Copyright © 2009 Pearson Education, Inc. Chapter 8 Linear Regression.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 7, Slide 1 Chapter 7 Linear Regression.
Statistics 8 Linear Regression. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King.
Honors Statistics Chapter 8 Linear Regression. Objectives: Linear model Predicted value Residuals Least squares Regression to the mean Regression line.
Part II Exploring Relationships Between Variables.
Training Activity 4 (part 2)
Sections Review.
CHAPTER 3 Describing Relationships
Finding the Best Fit Line
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Chapter 8 Linear Regression.
Chapter 7 Linear Regression.
Chapter 8 – Linear Regression
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Finding the Best Fit Line
Chapter 8 Part 2 Linear Regression
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Chapter 3 Describing Relationships Section 3.2
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
Chapter 8 Part 1 Linear Regression
Ch 4.1 & 4.2 Two dimensions concept
Section 3.2: Least Squares Regressions
Algebra Review The equation of a straight line y = mx + b
9/27/ A Least-Squares Regression.
Presentation transcript:

Chapter 8 Linear Regression

Fat Versus Protein: An Example 30 items on the Burger King menu:

linear model is an equation of a straight line through the data. – The points don’t all line up – straight line can summarize the general pattern

Residuals The scattered points are actual data, The corresponding values on the line are the predicted values, A residual is an error: difference between the actual and the predicted (the line)

A negative residual means the predicted value’s too big (an overestimate). A positive residual means the predicted value’s too small (an underestimate). A residual of zero means the line predicted exact

“Best Fit” Line We want the total residuals to be small as possible (minimizing error) The smaller the sum, the better the fit line Why the linear model is the line of best fit or regression line or least squares line The line always passes through the mean of the x variable and the mean of the y variable

Regression Line Algebra : Statistics: If the model is a good one, the data values will scatter closely around it.

slope (b 1 ) formula – In units of y per unit of x – Interpretation:

intercept (b 0 ) – In units of y – Usefulness of intercept:

Correlation and the Line Correlation coefficient tells us “how linear” So we will use r in the calculation of the slope Neg. Slope = Neg. Correlation Pos. Slope = Pos. Correlation Moving 1 st. dev. away from the mean in x moves us r st. dev. away from the mean in y (and visa versa) Ex: If you’re 1.5 st. dev. above avg. in GPA… Then how many st. dev. above on SAT?

Ways to find the Regression Line 1.Use the formulas for slope and intercept 2.Use raw data and run a stat -> calc -> LinReg 3.Use a given compute output

Burger King data fits the data well: – The equation is predict fat of a 30 g protein sandwich:

Check for same conditions: – Quantitative Variables – Straight Enough – No Outlier

To check whether a linear model is appropriate: 1.Look at the scatter plot of residuals vs. x var or y var (run a LinReg first) 2.If there is a pattern = linear model NOT appropriate 3.If there is NO pattern = linear model is appropriate

The residuals for the BK menu regression look appropriately boring:

The Residual St. Dev. The st. dev. of the residuals, s e, to be relatively small = spread/scatter around the line is small

R 2 —Coefficient of Strength It’s the square of r It’s a percent It’s between 0 and 100% The closer to 100 the stronger the linear model is at predicting Interpretation: “the % of the variation in y var that is explained by the x var” EX: Burger King Fat vs. Protein had R 2 = 69%.

Summary In order to use linear regression: 1.Check the original x, y scatter plot of straightness 2.Run a LinReg, then make a scatterplot of residuals vs x var….check for no pattern 3.Turn on diagnostic to check and run a LinReg to check R 2

Chapter 8 Assignment Pg: 192: #1-11 odd, odd, 27, 31, 35, 37, 45, 49