Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12: Analyzing the Association Between Quantitative Variables: Regression Analysis Section.

Slides:



Advertisements
Similar presentations
Simple linear models Straight line is simplest case, but key is that parameters appear linearly in the model Needs estimates of the model parameters (slope.
Advertisements

Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Correlation and Regression
2nd Day: Bear Example Length (in) Weight (lb)
Simple Linear Regression Analysis
CHAPTER 3 Describing Relationships
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Simple Linear Regression Analysis
Chapter 2 – Simple Linear Regression - How. Here is a perfect scenario of what we want reality to look like for simple linear regression. Our two variables.
Regression, Residuals, and Coefficient of Determination Section 3.2.
Correlation & Regression
Descriptive Methods in Regression and Correlation
Introduction to Linear Regression and Correlation Analysis
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Relationship of two variables
Copyright © 2012 Pearson Education, Inc. All rights reserved. Chapter 3 Simple Linear Regression.
ASSOCIATION: CONTINGENCY, CORRELATION, AND REGRESSION Chapter 3.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Chapter 14 Inference for Regression © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Agresti/Franklin Statistics, 1 of 88  Section 11.4 What Do We Learn from How the Data Vary Around the Regression Line?
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
AP STATISTICS LESSON 14 – 1 ( DAY 1 ) INFERENCE ABOUT THE MODEL.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
CHAPTER 3 Describing Relationships
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
1 Chapter 12: Analyzing Association Between Quantitative Variables: Regression Analysis Section 12.1: How Can We Model How Two Variables Are Related?
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12: Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
1 Objective Given two linearly correlated variables (x and y), find the linear function (equation) that best describes the trend. Section 10.3 Regression.
Lecture Slides Elementary Statistics Twelfth Edition
CHAPTER 3 Describing Relationships
LEAST – SQUARES REGRESSION
Statistics 101 Chapter 3 Section 3.
CHAPTER 3 Describing Relationships
AP Statistics Chapter 14 Section 1.
CHAPTER 3 Describing Relationships
Simple Linear Regression - Introduction
Lecture Slides Elementary Statistics Thirteenth Edition
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
Regression Models - Introduction
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 14 Inference for Regression
Chapter 3 Describing Relationships Section 3.2
CHAPTER 12 More About Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 14 Inference for Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12: Analyzing the Association Between Quantitative Variables: Regression Analysis Section 12.1 Model How Two Variables Are Related

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 3 Regression Analysis The first step of a regression analysis is to identify the response and explanatory variables.  We use y to denote the response variable.  We use x to denote the explanatory variable.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 4 The Scatterplot The first step in answering the question of association is to look at the data. A scatterplot is a graphical display of the relationship between the response variable (y) and the explanatory variable (x).

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 5 Example: The Strength Study An experiment was designed to measure the strength of female athletes. The goal of the experiment was to determine if there is an association between the maximum number of pounds that each individual athlete could bench press and the number of 60-pound bench presses that athlete could do.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc high school female athletes participated in the study. The data consisted of the following variables:  x: the number of 60-pound bench presses an athlete could do.  y: maximum bench press. Example: The Strength Study

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 7 For the 57 females in this study, these variables are summarized by:  x: mean = 11.0, st. deviation = 7.1  y: mean = 79.9 lbs, st. dev. = 13.3 lbs Example: The Strength Study

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 8 Figure 12.1 Scatterplot for y=Maximum Bench Press and x=Number of 60-lb. Bench Presses. Example: The Strength Study

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 9 The Regression Line Equation When the scatterplot shows a linear trend, a straight line can be fitted through the data points to describe that trend. The regression line is: is the predicted value of the response variable y, is the y-intercept and is the slope.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 10 Example: Regression Line Predicting Maximum Bench Press Table 12.1 MINITAB Printout for Regression Analysis of y=Maximum Bench Press (BP) and x =Number of 60-Pound Bench Presses (BP_60). TI-83+/84 output

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 11 The MINITAB output shows the following regression equation: BP = (BP_60) The y-intercept is 63.5 and the slope is The slope of 1.49 tells us that predicted maximum bench press increases by about 1.5 pounds for every additional 60-pound bench press an athlete can do. Example: Regression Line Predicting Maximum Bench Press

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 12 Outliers Check for outliers by plotting the data. The regression line can be pulled toward an outlier and away from the general trend of points.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 13 Influential Points An observation can be influential in affecting the regression line when one or more of two things happen:  Its x value is low or high compared to the rest of the data.  It does not fall in the straight-line pattern that the rest of the data have

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 14 Residuals are Prediction Errors The regression equation is often called a prediction equation. The difference between an observed outcome and its predicted value is the prediction error, called a residual.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 15 SUMMARY: Review of Residuals Each observation has a residual. A residual is the vertical distance between the data point and the regression line. The smaller the distance, the better the prediction.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 16 We can summarize how near the regression line the data points fall by The regression line has the smallest sum of squared residuals and is called the least squares line. SUMMARY: Review of Residuals

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 17 Regression Model: A Line Describes How the Mean of y Depends on x At a given value of x, the equation:  Predicts a single value of the response variable.  But… we should not expect all subjects at that value of x to have the same value of y because variability occurs in the y values.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 18 SUMMARY: The Regression Line The regression line connects the estimated means of y at the various x values. In summary,  Describes the relationship between x and the estimated means of y at the various values of x.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 19 The population regression equation describes the relationship in the population between x and the means of y. The equation is denoted by: The Population Regression Equation

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 20 In the population regression equation, is a population y-intercept and is a population slope.  These are parameters, so in practice their values are unknown. In practice we estimate the population regression equation using the prediction equation for the sample data. The Population Regression Equation

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 21 The population regression equation merely approximates the actual relationship between x and the population means of y. It is a model.  A model is a simple approximation for how variables relate in the population. The Population Regression Equation

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 22 The Regression Model Figure 12.2 The Regression Model for the Means of y Is a Simple Approximation for the True Relationship. Question: Can you sketch a true relationship for which this model is a very poor approximation?

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 23 If the true relationship is far from a straight line, this regression model may be a poor one. Figure 12.3 The Straight-Line Regression Model Provides a Poor Approximation When the Actual Relationship Is Highly Nonlinear. Question: What type of mathematical function might you consider using for a regression model in this case? The Regression Model

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 24 Variability about the Line At each fixed value of x, variability occurs in the y values around their mean,. The probability distribution of y values at a fixed value of x is a conditional distribution. At each value of x, there is a conditional distribution of y values. An additional parameter describes the standard deviation of each conditional distribution.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 25 A Statistical Model A statistical model never holds exactly in practice. It is merely an approximation for reality. Even though it does not describe reality exactly, a model is useful if the true relationship is close to what the model predicts.