Section 7.2 ~ Interpreting Correlations Introduction to Probability and Statistics Ms. Young ~ room 113.

Slides:



Advertisements
Similar presentations
4.7 The coefficient of determination r2
Advertisements

Chapter 3: Describing Relationships
7.1 Seeking Correlation LEARNING GOAL
Section 4.3 ~ Measures of Variation
Section 7.1 ~ Seeking Correlation
Copyright © 2015, 2011, 2008 Pearson Education, Inc. Chapter 5, Unit E, Slide 1 Statistical Reasoning 5.
Chapter 41 Describing Relationships: Scatterplots and Correlation.
Correlation: Relationships Can Be Deceiving. The Impact Outliers Have on Correlation An outlier that is consistent with the trend of the rest of the data.
Describing Relationships: Scatterplots and Correlation
Chapter 7 Scatterplots, Association, Correlation Scatterplots and correlation Fitting a straight line to bivariate data © 2006 W. H. Freeman.
1 10. Causality and Correlation ECON 251 Research Methods.
Association between 2 variables We've described the distribution of 1 variable in Chapter 1 - but what if 2 variables are measured on the same individual?
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Is there a relationship between the lengths of body parts ?
Section 7.3 ~ Best-Fit Lines and Prediction Introduction to Probability and Statistics Ms. Young.
Math 2: Unit 6 Day 1 How do we use scatter plots, correlation, and linear regression?
Chapter 5 Correlation. Suppose we found the age and weight of a sample of 10 adults. Create a scatterplot of the data below. Is there any relationship.
Chapter 3 Correlation. Suppose we found the age and weight of a sample of 10 adults. Create a scatterplot of the data below. Is there any relationship.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Correlation.
Quantitative Data Essential Statistics. Quantitative Data O Review O Quantitative data is any data that produces a measurement or amount of something.
Examining Relationships Prob. And Stat. 2.2 Correlation.
LECTURE UNIT 7 Understanding Relationships Among Variables Scatterplots and correlation Fitting a straight line to bivariate data.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 1 – Slide 1 of 30 Chapter 4 Section 1 Scatter Diagrams and Correlation.
1 Examining Relationships in Data William P. Wattles, Ph.D. Francis Marion University.
Research & Statistics Looking for Conclusions. Statistics Mathematics is used to organize, summarize, and interpret mathematical data 2 types of statistics.
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 113.
Section 7.4 ~ The Search for Causality Introduction to Probability and Statistics Ms. Young.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
More about Correlation
Chapter 7 found in Unit 5 Correlation & Causality Section 1: Seeking Correlation Can't Type? press F11 Can’t Hear? Check: Speakers, Volume or Re-Enter.
Linear regression Correlation. Suppose we found the age and weight of a sample of 10 adults. Create a scatterplot of the data below. Is there any relationship.
The Big Picture Where we are coming from and where we are headed…
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 113.
AP Statistics Monday, 26 October 2015 OBJECTIVE TSW investigate the role of correlation in statistics. EVERYONE needs a graphing calculator. DUE NOW –Gummi.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Regression Analysis: Part 2 Inference Dummies / Interactions Multicollinearity / Heteroscedasticity Residual Analysis / Outliers.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Chapter 7 found in Unit 5 Correlation & Causality Section 1: Seeking Correlation Page 286 Can't Type? press F11 Can’t Hear? Check: Speakers, Volume or.
Copyright © 2009 Pearson Education, Inc. 7.1 Seeking Correlation LEARNING GOAL Be able to define correlation, recognize positive and negative correlations.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
7.1 Seeking Correlation LEARNING GOAL
Welcome to the Unit 5 Seminar Kristin Webster
Quantitative Data Essential Statistics.
Chapter 5 Correlation.
Correlation.
Examining Relationships
Scatterplots Chapter 6.1 Notes.
Is there a relationship between the lengths of body parts?
CHAPTER 7 LINEAR RELATIONSHIPS
7.2 Interpreting Correlations
7.3 Best-Fit Lines and Prediction
7.2 Interpreting Correlations
The Practice of Statistics in the Life Sciences Fourth Edition
7.2 Interpreting Correlations
Chapter 2 Looking at Data— Relationships
Chapter 5 Correlation.
7.3 Best-Fit Lines and Prediction
CHAPTER 3 Describing Relationships
Chapter 3 Scatterplots and Correlation.
Correlation.
6.2 Determining the Four Characteristics of an Association
CHAPTER 3 Describing Relationships
Summarizing Bivariate Data
Correlation and Causality
Honors Statistics Review Chapters 7 & 8
Presentation transcript:

Section 7.2 ~ Interpreting Correlations Introduction to Probability and Statistics Ms. Young ~ room 113

Objective Sec. 7.2 After this section you will be aware of important cautions concerning the interpretation of correlations, especially the effects of outliers, the effects of grouping data, and the crucial fact that correlation does not necessarily imply causality.

Beware of Outliers When examining a scatterplot to determine correlation, be aware of any outliers  They can greatly affect the correlation coefficient, possibly resulting in a misleading conclusion about the relationship between the variables The scatterplot below has an outlier located in the top right  With the outlier included, r = 0.880, which represents a very strong positive correlation  If you calculate the correlation coefficient without the outlier it is 0, which represents absolutely no correlation Even though outliers can mask the correlation, you should not remove them without having a strong reason to believe that they do not belong in the data set Sec. 7.2

Example 1 ~ Masked Correlation You’ve conducted a study to determine how the number of calories a person consumes in a day correlates with time spent in vigorous bicycling. Your sample consisted of ten women cyclists, all of approximately the same height and weight. Over a period of two weeks, you asked each woman to record the amount of time she spent cycling each day and what she ate on each of those days. You used the eating records to calculate the calories consumed each day. The diagram below shows each woman’s mean time spent cycling on the horizontal axis and mean caloric intake on the vertical axis. Do higher cycling times correspond to higher intake of calories? Sec. 7.2

Example 1 ~ Solution If you look at the data as a whole, your eye will probably tell you that there is a positive correlation in which greater cycling time tends to go with higher caloric intake. But the correlation is very weak, with a correlation coefficient of However, notice that two points are outliers: one representing a cyclist who cycled about a half-hour per day and consumed more than 3,000 calories, and the other representing a cyclist who cycled more than 2 hours per day on only 1,200 calories It’s difficult to explain the two outliers, given that all the women in the sample have similar heights and weights. We might therefore suspect that these two women either recorded their data incorrectly or were not following their usual habits during the two-week study. If we can confirm this suspicion, then we would have reason to delete the two data points as invalid. The correlation is quite strong without those two outlier points, and suggests that the number of calories consumed rises by a little more than 500 calories for each hour of cycling, but we should not remove the outliers without confirming our suspicion that they were invalid data points, and we should report our reasons for leaving them out. Sec. 7.2

Beware of Inappropriate Grouping Sometimes grouping data inappropriately can hide correlations  Data may appear to have no correlation, but when grouped differently, a correlation is apparent Ex. ~ Consider a study in which researchers seek a correlation between hours of TV watched per week and high school grade point average (GPA). They collect the 21 data pairs in Table 7.3. Sec. 7.2  The scatterplot shows virtually no correlation, and the correlation coefficient equals  The apparent conclusion is that TV viewing habits are unrelated to academic achievement

Beware of Inappropriate Grouping Cont’d… However, after further investigation, one astute researcher realizes that some of the students watched mostly educational programs, while others tended to watch comedies, dramas, and movies. She therefore divides the data set into two groups, one for the students who watched mostly educational television and one for the other students. Sec. 7.2

Beware of Inappropriate Grouping Cont’d… After graphing each of the groups separately, we find two very strong correlations:  A strong positive correlation for the students who watched educational programs (r = 0.855)  A strong negative correlation for the other students (r = ). Sec. 7.2

Beware of Inappropriate Grouping Cont’d… Sometimes data may appear to have a correlation, but when grouped differently there is no correlation  Ex. ~ Consider the data collected by a consumer group studying the relationship between the weights and prices of cars. The data set as a whole shows a strong positive correlation (r = 0.949) After closer examination, you can see that there are two rather distinct categories; light cars and heavy cars If you analyze the light cars alone, r = (nearly no correlation) If you analyze the heavy cars alone, r = (nearly no correlation) This false correlation occurred because of the separation between the two clusters Sec. 7.2

Correlation Does Not Imply Causality Just because numbers tell us that there is a correlation between two variables, it does not mean that it is necessarily true In other words, “correlation does not imply causality”, or one variable does not necessarily cause the other one Here are some possible explanations for a correlation  The correlation may be a coincidence Ex. ~ Super Bowl and the stock market (refer to ex. 2 on P.303)  Both correlated variables might be directly influenced by some common underlying cause Ex. ~ As eggnog sales increase in Pennsylvania, accident rates increase as well; the underlying cause would be that eggnog is typically sold in the winter and accidents are more common in the winter due to inclement weather  One of the correlated variables may actually be a cause of the other, but it may just be one of several causes Ex. ~ There is a correlation between smoking and lung cancer, but smoking is not the only way one can get lung cancer Sec. 7.2