Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.4 Cautions in Analyzing.

Slides:



Advertisements
Similar presentations
Section 4.2. Correlation and Regression Describe only linear relationship. Strongly influenced by extremes in data. Always plot data first. Extrapolation.
Advertisements

Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Agresti/Franklin Statistics, 1 of 52 Chapter 3 Association: Contingency, Correlation, and Regression Learn …. How to examine links between two variables.
Chapter 6: Exploring Data: Relationships Lesson Plan
Chapter 3 Association: Contingency, Correlation, and Regression
Chapter 9 Regression Wisdom
2.4 Cautions about Correlation and Regression. Residuals (again!) Recall our discussion about residuals- what is a residual? The idea for line of best.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Looking at Data-Relationships 2.1 –Scatter plots.
The Simple Regression Model
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
Ch 2 and 9.1 Relationships Between 2 Variables
Basic Practice of Statistics - 3rd Edition
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
Descriptive Methods in Regression and Correlation
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Slide Copyright © 2008 Pearson Education, Inc. Chapter 4 Descriptive Methods in Regression and Correlation.
1 10. Causality and Correlation ECON 251 Research Methods.
ASSOCIATION: CONTINGENCY, CORRELATION, AND REGRESSION Chapter 3.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.1 Using Several Variables to Predict a Response.
1 Chapter 10, Part 2 Linear Regression. 2 Last Time: A scatterplot gives a picture of the relationship between two quantitative variables. One variable.
Chapter 15 Describing Relationships: Regression, Prediction, and Causation Chapter 151.
BPS - 3rd Ed. Chapter 51 Regression. BPS - 3rd Ed. Chapter 52 u Objective: To quantify the linear relationship between an explanatory variable (x) and.
BPS - 5th Ed. Chapter 51 Regression. BPS - 5th Ed. Chapter 52 u Objective: To quantify the linear relationship between an explanatory variable (x) and.
1 EXPLORING RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES SCATTERPLOTS, ASSOCIATION, AND CORRELATION ADDITIONAL REFERENCE READING MATERIAL COURSEPACK PAGES.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12: Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Lecture Presentation Slides SEVENTH EDITION STATISTICS Moore / McCabe / Craig Introduction to the Practice of Chapter 2 Looking at Data: Relationships.
Active Learning Lecture Slides For use with Classroom Response Systems Association: Contingency, Correlation, and Regression.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Copyright © 2010 Pearson Education, Inc. Chapter 9 Regression Wisdom.
Chapter 4 – Correlation and Regression before: examined relationship among 1 variable (test grades, metabolism, trip time to work, etc.) now: will examine.
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Lesson Correlation and Regression Wisdom. Knowledge Objectives Recall the three limitations on the use of correlation and regression. Explain what.
 What is an association between variables?  Explanatory and response variables  Key characteristics of a data set 1.
Lecture 5 Chapter 4. Relationships: Regression Student version.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Business Statistics for Managerial Decision Making
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
1 Regression Line Part II Class Class Objective After this class, you will be able to -Evaluate Regression and Correlation Difficulties and Disasters.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Chapter 9 Regression Wisdom. Getting the “Bends” Linear regression only works for data with a linear association. Curved relationships may not be evident.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 9 Regression Wisdom.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.1 The Association.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Chapter 5: 02/17/ Chapter 5 Regression. 2 Chapter 5: 02/17/2004 Objective: To quantify the linear relationship between an explanatory variable (x)
Chapter 3 Unusual points and cautions in regression.
Chapter 3 Association: Contingency, Correlation, and Regression Section 3.1 How Can We Explore the Association between Two Categorical Variables?
Cautions About Correlation and Regression
Lesson 13: Things To Watch out for
Cautions about Correlation and Regression
Chapter 2 Looking at Data— Relationships
Cautions about Correlation and Regression
Chapter 2 Looking at Data— Relationships
Chapter 1 Linear Functions
3 4 Chapter Describing the Relation between Two Variables
Review of Chapter 3 Examining Relationships
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
Chapter 3 Association: Contingency, Correlation, and Regression
4.2 Cautions about Correlation and Regression
Ch 9.
Chapter 4: More on Two-Variable Data
Presentation transcript:

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.4 Cautions in Analyzing Associations

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 3 Extrapolation: Using a regression line to predict y-values for x-values outside the observed range of the data.  Riskier the farther we move from the range of the given x-values.  There is no guarantee that the relationship given by the regression equation holds outside the range of sampled x-values. Extrapolation Is Dangerous

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 4 One reason to plot the data before you do a correlation or regression analysis is to check for unusual observations. Search for observations that are regression outliers, being well removed from the trend that the rest of the data follow. Be Cautious of Influential Outliers

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 5 A regression outlier is an observation that lies far away from the trend that the rest of the data follows. An observation is influential if  its x value is relatively low or high compared to the remainder of the data.  the observation is a regression outlier. Outliers and Influential Points Influential observations tend to pull the regression line toward that data point and away from the rest of the data points.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 6 Figure 3.18 An Observation Is a Regression Outlier if it is Far Removed from the Trend that the Rest of the Data Follow. The top two points are regression outliers. Not all regression outliers are influential in affecting the correlation or slope. Question: Which regression outlier in this figure is influential? Outliers and Influential Points

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 7 In a regression analysis, suppose that as x goes up, y also tends to go up (or down). Can we conclude that there’s a causal connection, with changes in x causing changes in y?  A strong correlation between x and y means that there is a strong linear association that exists between the two variables.  A strong correlation between x and y, does not mean that x causes y to change. Correlation Does Not Imply Causation

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 8 Data are available for all fires in Chicago last year on x = number of firefighters at the fire and y = cost of damages due to the fire. 1. Would you expect the correlation to be negative, zero, or positive? 2. If the correlation is positive, does this mean that having more firefighters at a fire causes the damages to be worse? Yes or No? 3. Identify a third variable that could be considered a common cause of x and y:  Distance from the fire station  Intensity of the fire  Size of the fire Correlation Does Not Imply Causation

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 9 A lurking variable is a variable, usually unobserved, that influences the association between the variables of primary interest.  Ice cream sales and drowning – lurking variable = temperature  Reading level and shoe size – lurking variable = age  Childhood obesity rate and GDP-lurking variable = time When two explanatory variables are both associated with a response variable but are also associated with each other, there is said to be confounding. Lurking variables are not measured in the study but have the potential for confounding. Lurking Variables & Confounding

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 10 Simpson’s Paradox: When the direction of an association between two variables changes after we include a third variable and analyze the data at separate levels of that third variable. Simpson’s Paradox

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 11 Simpson’s Paradox Example: Smoking and Health Probability of Death of Smoker = 139/582= 24% Probability of Death of Nonsmoker = 230/732= 31% This can’t be true that smoking improves your chances of living! What’s going on?! Is Smoking Actually Beneficial to Your Health? Table 3.7 Smoking Status and 20-Year Survival in Women

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 12 Break out Data by Age Table 3.8 Smoking Status and 20-Year Survival, for Four Age Groups Simpson’s Paradox Example: Smoking and Health

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 13 For instance, for smokers of age 18–34, from Table 3.8 the proportion who died was 5/( ) = 0.028, or 2.8% Could age explain the association? Table 3.9 Conditional Percentages of Deaths for Smokers and Nonsmokers, by Age Simpson’s Paradox Example: Smoking and Health

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 14 An association can look quite different after adjusting for the effect of a third variable by grouping the data according to the values of the third variable (age). Simpson’s Paradox Example: Smoking and Health Figure 3.23 MINITAB Bar Graph Comparing Percentage of Deaths for Smokers and Nonsmokers, by Age. This side-by-side bar graph shows the conditional percentages from Table 3.9.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 15 Lurking variables can affect associations in many ways. For instance, a lurking variable may be a common cause of both the explanatory and response variable. In practice, there’s usually not a single variable that causally explains a response variable or the association between two variables. More commonly, there are multiple causes. When there are multiple causes, the association among them makes it difficult to study the effect of any single variable. The Effect of Lurking Variables on Associations

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 16 When two explanatory variables are both associated with a response variable but are also associated with each other, confounding occurs. It is difficult to determine whether either of them truly causes the response because a variable’s effect could be at least partly due to its association with the other variable. The Effect of Confounding on Associations