Looking at data: relationships - Correlation IPS chapter 2.2 Copyright Brigitte Baldi 2005 ©

Slides:



Advertisements
Similar presentations
Section 6.1: Scatterplots and Correlation (Day 1).
Advertisements

Looking at data: relationships - Correlation Lecture Unit 7.
Correlation Data collected from students in Statistics classes included their heights (in inches) and weights (in pounds): Here we see a positive association.
Scatter Diagrams and Linear Correlation
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
5/17/2015Chapter 41 Scatterplots and Correlation.
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
Chapter 41 Describing Relationships: Scatterplots and Correlation.
Looking at Data-Relationships 2.1 –Scatter plots.
PSY 307 – Statistics for the Behavioral Sciences
Describing Relationships: Scatterplots and Correlation
MATH 2400 Chapter 4 Notes. Response & Explanatory Variables A response variable (a.k.a. dependent variables) measures an outcome of a study. An explanatory.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Association between 2 variables We've described the distribution of 1 variable in Chapter 1 - but what if 2 variables are measured on the same individual?
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Scatter Plots and Linear Correlation. How do you determine if something causes something else to happen? We want to see if the dependent variable (response.
Scatterplots, Association,
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
Chapter 6 Scatterplots and Correlation Chapter 7 Objectives Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots.
IPS Chapter 2 DAL-AC FALL 2015  2.1: Scatterplots  2.2: Correlation  2.3: Least-Squares Regression  2.4: Cautions About Correlation and Regression.
Slide 7-1 Copyright © 2004 Pearson Education, Inc.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Essential Statistics Chapter 41 Scatterplots and Correlation.
Objectives (IPS Chapter 2.1)
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
CHAPTER 4 SCATTERPLOTS AND CORRELATION BPS - 5th Ed. Chapter 4 1.
Chapter 4 Scatterplots and Correlation. Explanatory and Response Variables u Interested in studying the relationship between two variables by measuring.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 7 Scatterplots, Association, and Correlation.
Correlations: Relationship, Strength, & Direction Scatterplots are used to plot correlational data – It displays the extent that two variables are related.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
3.3 Correlation: The Strength of a Linear Trend Estimating the Correlation Measure strength of a linear trend using: r (between -1 to 1) Positive, Negative.
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
BPS - 5th Ed. Chapter 41 Scatterplots and Correlation.
Scatterplots Association and Correlation Chapter 7.
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
Chapter 14 STA 200 Summer I Scatter Plots A scatter plot is a graph that shows the relationship between two quantitative variables measured on the.
Lecture 8 Sections Objectives: Bivariate and Multivariate Data and Distributions − Scatter Plots − Form, Direction, Strength − Correlation − Properties.
Chapter 5 Summarizing Bivariate Data Correlation.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Lecture 4 Chapter 3. Bivariate Associations. Objectives (PSLS Chapter 3) Relationships: Scatterplots and correlation  Bivariate data  Scatterplots (2.
Lecture 3 – Sep 3. Normal quantile plots are complex to do by hand, but they are standard features in most statistical software. Good fit to a straight.
Statistics for Business and Economics Module 2: Regression and time series analysis Spring 2010 Lecture 2: Examining the relationship between two quantitative.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
3. Relationships Scatterplots and correlation
CHAPTER 3 Describing Relationships
Ch. 10 – Scatterplots, Association and Correlation (Day 1)
Looking at data: relationships - Correlation
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 5th Edition
The Practice of Statistics in the Life Sciences Fourth Edition
Chapter 2 Looking at Data— Relationships
Objectives (IPS Chapter 2.3)
CHAPTER 3 Describing Relationships
11A Correlation, 11B Measuring Correlation
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Statistics 101 CORRELATION Section 3.2.
CHAPTER 3 Describing Relationships
Scatterplots contd: Correlation The regression line
3.2 Correlation Pg
Presentation transcript:

Looking at data: relationships - Correlation IPS chapter 2.2 Copyright Brigitte Baldi 2005 ©

Objectives (BPS chapter 2.2) Correlation  The correlation coefficient r  r does not distinguish x and y  r has no units  r ranges from  1 to +1  r measures strength of linear relationships  Influential points

The correlation coefficient is a measure of the direction and strength of a relationship. It is calculated using the mean and the standard deviation of both the x and y variables. The correlation coefficient r Correlation can only be used to describe quantitative variables. Categorical variables don’t have means and standard deviations. Time to swim: x = 35, s x = 0.7 Pulse rate: y = 140 s y = 9.5

Part of the calculation involves finding z, the standardized score we used when working with the normal distribution. You DON'T want to do this by hand. Make sure you learn how to use your calculator!

Standardization: Allows us to compare correlations between data sets where variables are measured in different units or when variables are different. For instance, we might want to compare the correlation between [swim time and pulse], with the correlation between [swim time and breathing rate].

Correlation does not distinguish x and y The correlation coefficient r treats x and y symmetrically. "Time to swim" is the explanatory variable here, and belongs on the x axis. However, in either plot r is the same (r=-0.75). r = -0.75

Changing the units of variables does not change the correlation coefficient r, because we get rid of all our units when we standardize (get z-scores). Correlation has no units r = z-score plot is the same for both plots

Correlation ranges from −1 to +1 r quantifies the strength and direction of a linear relationship between two quantitative variables. Strength: how closely the points follow a straight line. Direction: is positive when individuals with higher x values tend to have higher values of y.

When variability in one or both variables decreases, correlation gets stronger ( r gets closer to +1 or −1).

No matter how strong the association, r does not describe curved relationships. Note: You can sometimes transform a non-linear association to a linear form, for instance by taking the logarithm. You can then calculate a correlation using the transformed data. Correlation only describes linear relationships

Correlations are calculated using means and standard deviations, and thus are NOT resistant to outliers. Influential points Just moving one point away from the general trend here decreases the correlation from −0.91 to −0.75

Adding two outliers decreases r from 0.95 to Try it out for yourself --- companion book website

1) What is the explanatory variable? Describe the form, direction and strength of the relationship? Estimate r. (in 1000’s) 2) If women always marry men 2 years older than themselves, what is the correlation of the ages between husband and wife? Review examples age man = age woman + 2 equation for a straight line r = 1 r = 0.94

Thought quiz on correlation 1.Why is there no distinction between explanatory and response variable in correlation? 2.Why do both variables have to be quantitative? 3.How does changing the units of one variable affect a correlation? 4.What is the effect of outliers on correlations? 5.Why doesn’t a tight fit to a horizontal line imply a strong correlation?