Chapter 4 Scatterplots and Correlation. Explanatory and Response Variables u Interested in studying the relationship between two variables by measuring.

Slides:



Advertisements
Similar presentations
5/17/2015Chapter 41 Scatterplots and Correlation.
Advertisements

CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
+ Scatterplots and Correlation Displaying Relationships: ScatterplotsThe most useful graph for displaying the relationship between two quantitative variables.
Chapter 41 Describing Relationships: Scatterplots and Correlation.
Looking at data: relationships - Correlation IPS chapter 2.2 Copyright Brigitte Baldi 2005 ©
CHAPTER 3 Describing Relationships
Describing Relationships: Scatterplots and Correlation
Basic Practice of Statistics - 3rd Edition
Chapter 7 Scatterplots, Association, Correlation Scatterplots and correlation Fitting a straight line to bivariate data © 2006 W. H. Freeman.
AP STATISTICS LESSON 3 – 1 EXAMINING RELATIONSHIPS SCATTER PLOTS.
LECTURE 2 Understanding Relationships Between 2 Numerical Variables
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
Examining Relationships Prob. And Stat. CH.2.1 Scatterplots.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Stat 1510: Statistical Thinking and Concepts Scatterplots and Correlation.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
LECTURE UNIT 7 Understanding Relationships Among Variables Scatterplots and correlation Fitting a straight line to bivariate data.
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.2 The Association.
Looking at data: relationships Scatterplots IPS chapter 2.1 © 2006 W. H. Freeman and Company.
Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 141.
Essential Statistics Chapter 41 Scatterplots and Correlation.
Objectives (IPS Chapter 2.1)
BPS - 3rd Ed. Chapter 51 Regression. BPS - 3rd Ed. Chapter 52 u Objective: To quantify the linear relationship between an explanatory variable (x) and.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
CHAPTER 4 SCATTERPLOTS AND CORRELATION BPS - 5th Ed. Chapter 4 1.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 7 Scatterplots, Association, and Correlation.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Chapter 4 - Scatterplots and Correlation Dealing with several variables within a group vs. the same variable for different groups. Response Variable:
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Chapter 141 Describing Relationships: Scatterplots and Correlation.
BPS - 5th Ed. Chapter 41 Scatterplots and Correlation.
Chapter 14 STA 200 Summer I Scatter Plots A scatter plot is a graph that shows the relationship between two quantitative variables measured on the.
Lecture 4 Chapter 3. Bivariate Associations. Objectives (PSLS Chapter 3) Relationships: Scatterplots and correlation  Bivariate data  Scatterplots (2.
Chapter 5: 02/17/ Chapter 5 Regression. 2 Chapter 5: 02/17/2004 Objective: To quantify the linear relationship between an explanatory variable (x)
Essential Statistics Chapter 41 Scatterplots and Correlation.
CHAPTER 3 Describing Relationships
Variables Dependent variable: measures an outcome of a study
Basic Practice of Statistics - 3rd Edition
Daniela Stan Raicu School of CTI, DePaul University
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 5th Edition
Daniela Stan Raicu School of CTI, DePaul University
Variables Dependent variable: measures an outcome of a study
CHAPTER 4: Scatterplots and Correlation
Chapter 2 Looking at Data— Relationships
Warmup In a study to determine whether surgery or chemotherapy results in higher survival rates for a certain type of cancer, whether or not the patient.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 4 - Scatterplots and Correlation
Basic Practice of Statistics - 3rd Edition
Chapter 3 Scatterplots and Correlation.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Essential Statistics Scatterplots and Correlation
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Basic Practice of Statistics - 3rd Edition
CHAPTER 3 Describing Relationships
Presentation transcript:

Chapter 4 Scatterplots and Correlation

Explanatory and Response Variables u Interested in studying the relationship between two variables by measuring both variables on the same individuals. –a response variable measures an outcome of a study –an explanatory variable explains or influences changes in a response variable –sometimes there is no distinction

Question In a study to determine whether surgery or chemotherapy results in higher survival rates for a certain type of cancer, whether or not the patient survived is one variable, and whether they received surgery or chemotherapy is the other. Which is the explanatory variable and which is the response variable?

u Graphs the relationship between two quantitative (numerical) variables measured on the same individuals. u If a distinction exists, plot the explanatory variable on the horizontal (x) axis and plot the response variable on the vertical (y) axis. Scatterplot

Relationship between mean SAT verbal score and percent of high school grads taking SAT Scatterplot

u Look for overall pattern and deviations from this pattern ( linear, curved, clusters, no pattern) u Describe pattern by form, direction, and strength of the relationship u Look for outliers Scatterplot

Linear Relationship Some relationships are such that the points of a scatterplot tend to fall along a straight line -- linear relationship

Direction u Positive association –above-average values of one variable tend to accompany above-average values of the other variable, and below-average values tend to occur together u Negative association –above-average values of one variable tend to accompany below-average values of the other variable, and vice versa

Positive association: High values of one variable tend to occur together with high values of the other variable. Negative association: High values of one variable tend to occur together with low values of the other variable.

One way to remember this: The equation for this line is y = 5. x is not involved. No relationship: x and y vary independently. Knowing x tells you nothing about y.

Examples From a scatterplot of college students, there is a positive association between verbal SAT score and GPA. For used cars, there is a negative association between the age of the car and the selling price.

Examples of Relationships

Measuring Strength & Direction of a Linear Relationship u How closely does a non-horizontal straight line fit the points of a scatterplot? u The correlation coefficient (often referred to as just correlation): r –measure of the strength of the relationship: the stronger the relationship, the larger the magnitude of r. –measure of the direction of the relationship: positive r indicates a positive relationship, negative r indicates a negative relationship.

Correlation Coefficient u special values for r :  a perfect positive linear relationship would have r = +1  a perfect negative linear relationship would have r = -1  if there is no linear relationship, or if the scatterplot points are best fit by a horizontal line, then r = 0  Note: r must be between -1 and +1, inclusive u both variables must be quantitative; no distinction between response and explanatory variables u r has no units; does not change when measurement units are changed (ex: ft. or in.)

Examples of Correlations

u Husband’s versus Wife’s ages v r =.94 u Husband’s versus Wife’s heights v r =.36 u Professional Golfer’s Putting Success: Distance of putt in feet versus percent success v r = -.94

Not all Relationships are Linear Miles per Gallon versus Speed u Linear relationship? u Correlation is close to zero.

Not all Relationships are Linear Miles per Gallon versus Speed u Curved relationship. u Correlation is misleading.

Problems with Correlations u Outliers can inflate or deflate correlations (see next slide) u Groups combined inappropriately may mask relationships (a third variable) –groups may have different relationships when separated

Outliers and Correlation For each scatterplot above, how does the outlier affect the correlation? AB A: outlier decreases the correlation B: outlier increases the correlation

Example: IQ score and grade point average a)Describe what this plot shows in words. b)Describe the direction, shape, and strength. Are there outliers? c)What is the deal with these people?

Correlation Calculation u Suppose we have data on variables X and Y for n individuals: x 1, x 2, …, x n and y 1, y 2, …, y n u Each variable has a mean and std dev:

Case Study Per Capita Gross Domestic Product and Average Life Expectancy for Countries in Western Europe

Case Study CountryPer Capita GDP (x)Life Expectancy (y) Austria Belgium Finland France Germany Ireland Italy Netherlands Switzerland United Kingdom

Case Study xy = = sum = s x =1.532s y =0.795

Case Study

“r” doesn’t distinguish explanatory and response variables The correlation coefficient, r, treats x and y symmetrically. “Time to swim” is the explanatory variable here and belongs on the x axis. However, in either plot r is the same (r = −0.75). r = -0.75

Changing the units of variables does not change the correlation coefficient “r,” because we get rid of all our units when we standardize (get z-scores). “r” has no unit