Presentation is loading. Please wait.

Presentation is loading. Please wait.

PSY 307 – Statistics for the Behavioral Sciences

Similar presentations


Presentation on theme: "PSY 307 – Statistics for the Behavioral Sciences"— Presentation transcript:

1 PSY 307 – Statistics for the Behavioral Sciences
Chapter 6 – Correlation

2 Midterm Results Top score = 45 Top score for curve = 45 40-53 A 7
36-39 B 4 31-35 C 2 27-30 D 8 0-26 F 3 24

3 Aleks/Holcomb Hint

4 To Find the Cutoff Scores
If you know the mean and standard deviation, you can find what x values cut off certain percentages. Solve for k then multiply the k value by the SD and add/subtract that number from the mean to get the cutoff scores.

5 Does Aleks Quiz 1 Predict Midterm Scores?

6 Adding a Prediction (Regression) Line Provides More Information

7 Does Time Spent on Aleks Predict Quiz Grades?

8 Sometimes the Relationship is Not Linear
r = .47 (quadratic)

9 Lying With Statistics This is the graph as published in a Wall Street Journal editorial (7/13), where they claimed that reducing corporate taxes results in greater revenue. Treating Norway as an outlier, the data instead shows that as taxes increase, so do revenues – the opposite conclusion. Which is right? The correct graph is the one with the best fit – where most of the data points are close to the line drawn (right).

10 Describing Relationships
Positive relationship – high values tend to go with high values, low with low. Negative relationship – high values tend to go with low values, low with high. No relationship – no regularity appears between pairs of scores in two distributions.

11 Relationship Does Not Imply Causality
A relationship can exist without being a CAUSAL relationship. Correlation does not imply causation. Third variable problem -- a third variable is causing both of the variables you are measuring to change – e.g., popsicles & drowning. The direction of causality cannot be determined from the r statistic.

12 Chocolate and Nobel Prizes

13 Scatterplots One variable is measured on the x-axis, the other on the y-axis. Positive relationship – a cluster of dots sloping upward from the lower left to the upper right. Negative relationship – a cluster of dots sloping down from upper left to lower right. No relationship – no apparent slope.

14 Example Positive Correlations

15 Example Negative Correlations
Note that the line slopes in the opposite direction, from upper left to lower right. r=-.54

16 Strength of Relationship
The more closely the dots approximate a straight line, the stronger the relationship. A perfect relationship forms a straight line. Dots forming a line reflect a linear relationship. Dots forming a curved or bent line reflect a curvilinear relationship.

17 More Examples

18 Correlation Coefficient
Pearson’s r –a measure of how well a straight line describes the cluster of dots in a plot. Ranges from -1 to 1. The sign indicates a positive or negative relationship. The value of r indicates strength of relationship. Pearson’s r is independent of units of measure.

19 Interpreting Pearson’s r
The value of r needed to assert a strong relationship depends on: The size of n What is being measured. Pearson’s r is NOT the percent or proportion of a perfect relationship. Correlation is not causation. Experimentation is used to confirm a suspected causal relationship.

20 Calculating Pearson’s r
S zxzy r = _______ n – 1 This formula is most useful when the scores are already z-scores. Computational formulas – use whichever is most convenient for the data at hand.

21 Sum of the Products (SP)

22 Computational Formulas

23 Outliers An outlier that is near where the regression line might normally go, increases the r value. r=.457 r=.336 An outlier away from the regression line decreases the r value.

24 Dealing with Outliers Outliers can dramatically change the value of the r correlation coefficient. Always produce a scatterplot and inspect for outliers before calculating r. Sometimes outliers can be omitted. Sometimes r cannot be used.

25 Other Correlation Coefficients
Spearman’s rho (r) – based on ranks rather than values. Used with ordinal data (qualitative data that can be ordered least to most). Point biserial correlation -- correlations between quantitative data and two coded categories. Cramer’s phi – correlation between two ordered qualitative categories.


Download ppt "PSY 307 – Statistics for the Behavioral Sciences"

Similar presentations


Ads by Google