Chapter 7 -Part 1 Correlation. Correlation Topics zCo-relationship between two variables. zLinear vs Curvilinear relationships zPositive vs Negative relationships.

Slides:



Advertisements
Similar presentations
Bivariate Analyses.
Advertisements

Describing Relationships Using Correlation and Regression
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Overview Correlation Regression -Definition
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Chapter 5 Introduction to Inferential Statistics.
Chapter 5 Introduction to Inferential Statistics.
Simple Linear Regression
Correlation 2 Computations, and the best fitting line.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
The Simple Regression Model
Basic Statistical Concepts
Statistics Psych 231: Research Methods in Psychology.
Correlation 2 Computations, and the best fitting line.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Chapter 7 -Part 1 Correlation. Correlation Topics zCorrelational research – what is it and how do you do “co-relational” research? zThe three questions:
SIMPLE LINEAR REGRESSION
Statistics for the Social Sciences Psychology 340 Fall 2006 Relationships between variables.
Chapter 5 Introduction to Inferential Statistics.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Chapter 7 -Part 1 Correlation. Correlation Topics zCorrelational research – what is it and how do you do “co-relational” research? zThe three questions:
REGRESSION AND CORRELATION
SIMPLE LINEAR REGRESSION
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Lecture 17: Correlations – Describing Relationships Between Two Variables 2011, 11, 22.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Correlation 10/30. Relationships Between Continuous Variables Some studies measure multiple variables – Any paired-sample experiment – Training & testing.
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
SIMPLE LINEAR REGRESSION
Correlation Scatter Plots Correlation Coefficients Significance Test.
Correlation and regression 1: Correlation Coefficient
Covariance and correlation
Correlation.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
Stats/Methods I JEOPARDY. Jeopardy CorrelationRegressionZ-ScoresProbabilitySurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
1.6 Linear Regression & the Correlation Coefficient.
Hypothesis of Association: Correlation
Introduction to Linear Regression
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Correlation Correlation is used to measure strength of the relationship between two variables.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
Chapter Bivariate Data (x,y) data pairs Plotted with Scatter plots x = explanatory variable; y = response Bivariate Normal Distribution – for.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Midterm Review Ch 7-8. Requests for Help by Chapter.
1 Data Analysis Linear Regression Data Analysis Linear Regression Ernesto A. Diaz Department of Mathematics Redwood High School.
Advanced Statistical Methods: Continuous Variables REVIEW Dr. Irina Tomescu-Dubrow.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Chapter 12: Correlation and Linear Regression 1.
Chapter 13 Simple Linear Regression
Correlation.
Computations, and the best fitting line.
Correlation 10/27.
Correlation 10/27.
CHAPTER 10 Correlation and Regression (Objectives)
Chapter 2 Looking at Data— Relationships
Correlation and Regression
Presentation transcript:

Chapter 7 -Part 1 Correlation

Correlation Topics zCo-relationship between two variables. zLinear vs Curvilinear relationships zPositive vs Negative relationships zStrength of relationship

Mythical relationship between Baseball and Football performance Al Ben Chuck David Ed Frank George Baseball skill Very good Very poor Good Terrible Poor Average Excellent Football skill Very good Very poor Good Terrible Poor Average Excellent Is this a linear relationship? Baseball skill predicts football skill. Football skill predicts baseball skill. There is a strong relationship.

First we must arrange the scores in “order” Baseball skill Terrible Very Poor Poor Average Good Very Good Excellent Football skill Terrible Very Poor Poor Average Good Very Good Excellent David Ben Ed Frank Chuck Al George

Then we plot the scores * Ben * Ed * Frank * Chuck * Al * David * George Excellent Terrible Very Good Good Average Poor Very Poor ExcellentTerribleVery GoodGoodAveragePoorVery Poor Football Skill Baseball Skill This is definitely a linear relationship!

Let’s get more abstract? Excellent Terrible Very Good Good Average Poor Very Poor ExcellentTerribleVery GoodGoodAveragePoorVery Poor Football Skill Baseball Skill X Y

Linear or nonlinear? Let’s look at another set of values. Football skill Terrible Average Very Good Excellent Good Poor Baseball skill Terrible Very Poor Poor Average Good Very Good Excellent David Ben Ed Frank Chuck Al George Is this a linear relationship?

Is this linear? * Ben* Ed * Frank * Chuck * Al * David * George Excellent Terrible Very Good Good Average Poor Very Poor ExcellentTerribleVery GoodGoodAveragePoorVery Poor Football Skill Baseball Skill NO! It is best described by a curved line. It is a curvilinear relationship!

Positive vs Negative relationships zIn a positive relationship, as one value increases the other value tends to increase as well. Example: The longer a sailboat is, the more it tends to cost. As length goes up, price tends to go up. zIn a negative relationship, as one value increases, the other value decreases. Example: The older a sailboat is, the less it tends to cost. As years go up, price tends to go down.

Positive vs Negative scatterplot Negative relationship Positive relationship

Correlation Characteristics Linear vs Curvilinear

The strength of a relationship tells us approximately how the dots will fall around a best fitting line. zPerfect - scores fall exactly on a straight line. zStrong - most scores fall near the line. zModerate - some are near the line, some not. zWeak – lots of scores fall close to the line, but many fall quite far from it. zIndependent - the scores are not close to the line and form a circular or square pattern

Strength of a relationship Perfect

Strength of a relationship Strong

Strength of a relationship Moderate

Strength of a relationship Independent

What is this relationship?

What is this?

What is this?

What is this?

Comparing apples to oranges? Use t scores! zYou can use correlation to look for the relationship between ANY two values that you can measure of a single subject. zHowever, there may not be any relationship (independent). zA correlation tells us if scores are consistently similar on two measures, consistently different from each other, or have no real pattern

Comparing apples to oranges? Use t scores! zTo compare scores on two different variables, you transform them into t X and t Y scores. zt X and t Y scores can be directly compared to each other to see whether they are consistently similar, consistently quite different, or show no consistent pattern of similarity or difference

Similar t X and t Y scores = positive correlation. dissimilar = negative correlation. No pattern = independence. zWhen t scores are consistently more similar than different, we have a positive correlation. zWhen t scores are consistently more different than similar, we have a negative correlation. zWhen t scores show no consistent pattern of similarity or difference, we have independence.

Comparing variables zAnxiety symptoms, e.g., heartbeat, with number of hours driving to class. zHat size with drawing ability. zMath ability with verbal ability. zNumber of children with IQ. zTurn them all into t scores

Pearson’s Correlation Coefficient zcoefficient - noun, a number that serves as a measure of some property. zThe correlation coefficient indexes the consistency and direction of a correlation zPearson’s rho (  ) is the parameter that characterizes the strength and direction of a linear relationship (and only a linear relationship) between two population variables. zPearsons r is a least squares, unbiased estimate of rho.

Pearson’s Correlation Coefficient zr and rho vary from to zA negative value indicates a negative relationship; a positive value indicates a positive relationship. zValues of r close to or indicate a strong (consistent) relationship; values close to indicate a weak (inconsistent) or independent relationship.

r, strength and direction Perfect, positive+1.00 Strong, positive+.75 Moderate, positive+.50 Weak, positive+.25 Independent.00 Weak, negative -.25 Moderate, negative -.50 Strong, negative -.75 Perfect, negative -1.00

Calculating Pearson’s r zSelect a random sample from a population; obtain scores on two variables, which we will call X and Y. zConvert all the scores into t scores.

Calculating Pearson’s r zFirst, subtract the t Y score from the t X score in each pair. zThen square all of the differences and add them up, that is,  (t X - t Y ) 2.

Calculating Pearson’s r zEstimate the average squared distance between Z X and Z Y by dividing by the sum of squared differences by(n P - 1), that is,  ( t X - t Y ) 2 / (n P - 1) zTo turn this estimate into Pearson’s r, use the formula r =1 - (1/2  ( t X - t Y ) 2 / (n P - 1))

Note seeming exception zUsually we divide a sum of squared deviations around a mean by df to estimate the variance. zHere the sum of squares is not around a mean and we are not estimating a variance. zSo you divide  ( t X - t Y ) 2 by (n P - 1) zn P - 1 is not df for corr & regression (df REG = n P - 2)

Example: Calculate t scores for X DATA  X=30 N= 5 X=6.00 MS W = 40.00/(5-1) = 10 s X = 3.16 (X - X) X - X t x =(X-X)/ s SS W = 40.00

Calculate t scores for Y DATA  Y=55 N= 5 Y=11.00 MS W = 10.00/(5-1) = 2.50 s Y = 1.58 (Y - Y) Y - Y (t y =Y - Y) / s SS W = 10.00

Calculate r t Y t X t X - t Y (t X - t Y )  (t X - t Y ) 2 / (n P - 1)=0.200 r = (1/2 * (  (t X - t Y ) 2 / (n P - 1))) r = (1/2 *.200) = =.900  (t X - t Y ) 2 =0.80 This is a very strong, positive relationship.