Regression BPS chapter 5 © 2006 W.H. Freeman and Company.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

MATH 2400 Chapter 5 Notes. Regression Line Uses data to create a linear equation in the form y = ax + b where “a” is the slope of the line (unit rate.
Chapter 3 Bivariate Data
Chapter 6: Exploring Data: Relationships Lesson Plan
2nd Day: Bear Example Length (in) Weight (lb)
Warm up Use calculator to find r,, a, b. Chapter 8 LSRL-Least Squares Regression Line.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Quantitative Variables Chapter 5.
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r Assumptions.
Relationships Between Quantitative Variables
Looking at Data-Relationships 2.1 –Scatter plots.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Regression, Residuals, and Coefficient of Determination Section 3.2.
Correlation & Regression
Objectives (BPS chapter 5)
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
Introduction to Linear Regression and Correlation Analysis
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
Lesson Least-Squares Regression. Knowledge Objectives Explain what is meant by a regression line. Explain what is meant by extrapolation. Explain.
Anthony Greene1 Regression Using Correlation To Make Predictions.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Correlation and Linear Regression INCM Correlation  Correlation coefficients assess strength of linear relationship between two quantitative variables.
Chapter 10 Correlation and Regression
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
Chapters 8 & 9 Linear Regression & Regression Wisdom.
Chapter 5 Residuals, Residual Plots, & Influential points.
Verbal SAT vs Math SAT V: mean=596.3 st.dev=99.5 M: mean=612.2 st.dev=96.1 r = Write the equation of the LSRL Interpret the slope of this line Interpret.
Regression BPS chapter 5 © 2010 W.H. Freeman and Company.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. DosageHeart rate
WARM-UP Do the work on the slip of paper (handout)
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
 Find the Least Squares Regression Line and interpret its slope, y-intercept, and the coefficients of correlation and determination  Justify the regression.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
SWBAT: Calculate and interpret the residual plot for a line of regression Do Now: Do heavier cars really use more gasoline? In the following data set,
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
SWBAT: Calculate and interpret the equation of the least-squares regression line Do Now: If data set A of (x, y) data has correlation r = 0.65, and a second.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Unit 3 Correlation. Homework Assignment For the A: 1, 5, 7,11, 13, , 21, , 35, 37, 39, 41, 43, 45, 47 – 51, 55, 58, 59, 61, 63, 65, 69,
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Part II Exploring Relationships Between Variables.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
Statistics 101 Chapter 3 Section 3.
Regression BPS 7e Chapter 5 © 2015 W. H. Freeman and Company.
Correlation and Simple Linear Regression
EQ: How well does the line fit the data?
Correlation and Simple Linear Regression
Review of Chapter 3 Examining Relationships
Simple Linear Regression and Correlation
Algebra Review The equation of a straight line y = mx + b
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Presentation transcript:

Regression BPS chapter 5 © 2006 W.H. Freeman and Company

Sum of squared errors Which least-squares regression line would have a smaller sum of squared errors (SSE)? The line in Plot A. The line in Plot B. It would be the same for both plots.

Sum of squared errors (answer) Which least-squares regression line would have a smaller sum of squared errors (SSE)? The line in Plot A. The line in Plot B. It would be the same for both plots.

Scatterplots Look at the following scatterplot. What could we say about the relationship between r and the slope of the regression line? Since a is negative, r must also be negative. Since b is negative, r must also be negative. Since a is positive, r must also be positive. Since b is positive, r must also be positive.

Scatterplots (answer) Look at the following scatterplot. What could we say about the relationship between r and the slope of the regression line? Since a is negative, r must also be negative. Since b is negative, r must also be negative. Since a is positive, r must also be positive. Since b is positive, r must also be positive.

Slope Look at the following scatterplot. What would be a correct interpretation of the slope? As we increase our CO content by 1 mg, we increase the tar content by 1.01 mg. As we increase our CO content by 0.66 mg, we increase the tar content by 1.01 mg. As we increase our CO content by 0.66 mg, we increase the tar content by 0.66 mg. As we increase our CO content by 1 mg, we increase the tar content by 0.66 mg.

Slope (answer) Look at the following scatterplot. What would be a correct interpretation of the slope? As we increase our CO content by 1 mg, we increase the tar content by 1.01 mg. As we increase our CO content by 0.66 mg, we increase the tar content by 1.01 mg. As we increase our CO content by 0.66 mg, we increase the tar content by 0.66 mg. As we increase our CO content by 1 mg, we increase the tar content by 0.66 mg.

Regression line Look at the following least-squares regression line. If a person increased his/her weight by 10 pounds, by how much (in inches) would one expect to see their waist girth increase? 0.1332 1.332 99.994 1.332 + 9.9994

Regression line (answer) Look at the following least-squares regression line. If a person increased his/her weight by 10 pounds, by how much (in inches) would one expect to see their waist girth increase? 0.1332 1.332 99.994 1.332 + 9.9994

Regression line Look at the following least-squares regression line. The Y-intercept tells us the predicted waist girth for someone weighing how many pounds? 0.1332 9.9994 Cannot be determined from the graph.

Regression line (answer) Look at the following least-squares regression line. The Y-intercept tells us the predicted waist girth for someone weighing how many pounds? 0.1332 9.9994 Cannot be determined from the graph.

Residuals Look at the following least-squares regression line. Compare the squared errors (residuals) from the two Points A and B. Point A’s would be greater than Point B’s. Point A’s would be less than Point B’s. Point A’s would be equal to Point B’s. There is not enough information.

Residuals (answer) Look at the following least-squares regression line. Compare the squared errors (residuals) from the two Points A and B. Point A’s would be greater than Point B’s. Point A’s would be less than Point B’s. Point A’s would be equal to Point B’s. There is not enough information.

Percent of variation in Y What percent of the variation in the sisters’ heights can be explained by the heights of the brothers? 25.64% (0.558)2 = 31.14% 52.7% 55.8%

Percent of variation in Y (answer) What percent of the variation in the sisters’ heights can be explained by the heights of the brothers? 25.64% (0.558)2 = 31.14% 52.7% 55.8%

Correlation The correlation between math SAT score and total SAT score is about r = 0.9935. What is a correct conclusion that could be made? The least-squares regression line of Y on X would have slope = 0.9935. Math SAT scores explain about 98.7% (which is 0.99352) of the variation in the total SAT scores. About 99.35% of the time math SAT scores will accurately predict total SAT scores. Total SAT score is made up of 99.35% of the math SAT score.

Correlation (answer) The correlation between math SAT score and total SAT score is about r = 0.9935. What is a correct conclusion that could be made? The least-squares regression line of Y on X would have slope = 0.9935. Math SAT scores explain about 98.7% (which is 0.99352) of the variation in the total SAT scores. About 99.35% of the time math SAT scores will accurately predict total SAT scores. Total SAT score is made up of 99.35% of the math SAT score.

Residuals Residual equals

Residuals (answer) Residual equals

Residual plots Residual plots are used to Examine the relationship between two variables. Identify the mean and spread of the residuals. Check for independence of observations. Magnify violations of regression assumptions.

Residual plots (answer) Residual plots are used to Examine the relationship between two variables. Identify the mean and spread of the residuals. Check for independence of observations. Magnify violations of regression assumptions.

Residual plots The following are regression assumptions: The relationship between X and Y can be modeled with a straight line. The variation in the Y values does not depend on the value of X (constant variance). The residual plot shown below indicates the violation of which regression assumption? 1 2 Neither

Residual plots (answer) The following are regression assumptions: The relationship between X and Y can be modeled with a straight line. The variation in the Y values does not depend on the value of X (constant variance). The residual plot shown below indicates the violation of which regression assumption? 1 2 Neither

Residual plots The following are regression assumptions: The relationship between X and Y can be modeled with a straight line. The variation in the Y values does not depend on the value of X (constant variance). The residual plot shown below indicates the violation of which regression assumption? 1 2 Neither

Residual plots (answer) The following are regression assumptions: The relationship between X and Y can be modeled with a straight line. The variation in the Y values does not depend on the value of X (constant variance). The residual plot shown below indicates the violation of which regression assumption? 1 2 Neither

Correlation or regression Which of the following measures the direction and strength of the linear association between X and Y? Correlation Regression

Correlation or regression (answer) Which of the following measures the direction and strength of the linear association between X and Y? Correlation Regression

Correlation or regression Which of the following makes no distinction between explanatory and response variables? Correlation Regression

Correlation or regression (answer) Which of the following makes no distinction between explanatory and response variables? Correlation Regression

Correlation or regression Which of the following is used for prediction? Correlation Regression

Correlation or regression (answer) Which of the following is used for prediction? Correlation Regression

Regression line A regression line always passes through the point

Regression line (answer) A regression line always passes through the point

Correlation and slope Which of the following best describes the relationship between correlation and slope? The correlation of X and Y equals the slope of the regression line modeling the relationship between X and Y. When the correlation between X and Y is zero, the slope of the regression line modeling the relationship between X and Y is negative. The sign of the correlation between X and Y is the same as the sign of the slope of the regression line modeling the relationship between X and Y. The correlation between X and Y is not related to the slope of the regression line modeling the relationship between X and Y.

Correlation and slope (answer) Which of the following best describes the relationship between correlation and slope? The correlation of X and Y equals the slope of the regression line modeling the relationship between X and Y. When the correlation between X and Y is zero, the slope of the regression line modeling the relationship between X and Y is negative. The sign of the correlation between X and Y is the same as the sign of the slope of the regression line modeling the relationship between X and Y. The correlation between X and Y is not related to the slope of the regression line modeling the relationship between X and Y.

Regression line Which of the following best measures the strength of fit of a regression line? Correlation coefficient, r. Square of the correlation coefficient, r2. Square root of the correlation coefficient, .

Regression line (answer) Which of the following best measures the strength of fit of a regression line? Correlation coefficient, r. Square of the correlation coefficient, r2. Square root of the correlation coefficient, .

Causation Researchers interviewed a group of women with knee pain awaiting knee replacement surgery. They also interviewed a group of women from the same geographical area with no knee pain. These researchers reported that wearing high-heeled shoes caused the knee pain which required surgery. As a savvy consumer of statistics, you conclude that: Because this was only an observational study, the researchers should not make claims that the knee pain was caused by high heels. Because the study was a valid experiment, the researchers were valid in their claim about high heels causing pain.

Causation (answer) Researchers interviewed a group of women with knee pain awaiting knee replacement surgery. They also interviewed a group of women from the same geographical area with no knee pain. These researchers reported that wearing high-heeled shoes caused the knee pain which required surgery. As a savvy consumer of statistics, you conclude that: Because this was only an observational study, the researchers should not make claims that the knee pain was caused by high heels. Because the study was a valid experiment, the researchers were valid in their claim about high heels causing pain.

Linear regression The following graph indicates the presence of Extrapolation. An influential observation. A lurking variable.

Linear regression (answer) The following graph indicates the presence of Extrapolation. An influential observation. A lurking variable.

Linear regression The following graph shows the linear relationship between diamond size and price for diamonds size 0.35 carats or less. Using this relationship to predict the price of a diamond that is 1 carat is considered Extrapolation. An influential observation. Prediction.

Linear regression (answer) The following graph shows the linear relationship between diamond size and price for diamonds size 0.35 carats or less. Using this relationship to predict the price of a diamond that is 1 carat is considered Extrapolation. An influential observation. Prediction.

Linear regression The diamonds mentioned in the previous question were of the same cut and clarity. If diamonds of different cuts have different relationship between size and price, we would say that Type of cut is a lurking variable. Type of cut is a confounded variable. Type of cut should be ignored.

Linear regression (answer) The diamonds mentioned in the previous question were of the same cut and clarity. If diamonds of different cuts have different relationship between size and price, we would say that Type of cut is a lurking variable. Type of cut is a confounded variable. Type of cut should be ignored.