Introduction to Statistics Introduction to Statistics Correlation Chapter 15 Apr 29-May 4, 2010 Classes #28-29.

Slides:



Advertisements
Similar presentations
Association Between Variables Measured at the Interval-Ratio Level
Advertisements

Lesson 10: Linear Regression and Correlation
Inference for Linear Regression (C27 BVD). * If we believe two variables may have a linear relationship, we may find a linear regression line to model.
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Correlation and Regression
Describing Relationships Using Correlation and Regression
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
CORRELATON & REGRESSION
PPA 501 – Analytical Methods in Administration Lecture 8 – Linear Regression and Correlation.
PPA 415 – Research Methods in Public Administration
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
CORRELATION COEFFICIENTS What Does a Correlation Coefficient Indicate? What is a Scatterplot? Correlation Coefficients What Could a Low r mean? What is.
10-2 Correlation A correlation exists between two variables when the values of one are somehow associated with the values of the other in some way. A.
SIMPLE LINEAR REGRESSION
Correlation and Regression Analysis
Aim: How do we calculate and interpret correlation coefficients with SPSS? SPSS Assignment Due Friday 2/12/10.
Week 9: Chapter 15, 17 (and 16) Association Between Variables Measured at the Interval-Ratio Level The Procedure in Steps.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Correlation and Regression Quantitative Methods in HPELS 440:210.
Chapter 14 in 1e Ch. 12 in 2/3 Can. Ed. Association Between Variables Measured at the Ordinal Level Using the Statistic Gamma and Conducting a Z-test for.
Correlation and Linear Regression
Correlation and Linear Regression
Correlation and Linear Regression Chapter 13 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
STATISTICS ELEMENTARY C.M. Pascual
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
PRED 354 TEACH. PROBILITY & STATIS. FOR PRIMARY MATH Lesson 14 Correlation & Regression.
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
SIMPLE LINEAR REGRESSION
Week 12 Chapter 13 – Association between variables measured at the ordinal level & Chapter 14: Association Between Variables Measured at the Interval-Ratio.
Linear Regression and Correlation
Correlation and Regression
Correlation.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 15 Correlation and Regression
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Anthony Greene1 Correlation The Association Between Variables.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Hypothesis of Association: Correlation
Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Elementary Statistics Correlation and Regression.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 15: Correlation and Regression Part 2: Hypothesis Testing and Aspects of a Relationship.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Correlation & Regression Correlation does not specify which variable is the IV & which is the DV.  Simply states that two variables are correlated. Hr:There.
The Statistical Imagination Chapter 15. Correlation and Regression Part 2: Hypothesis Testing and Aspects of a Relationship.
Chapter Bivariate Data (x,y) data pairs Plotted with Scatter plots x = explanatory variable; y = response Bivariate Normal Distribution – for.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Introduction to Statistics Introduction to Statistics Correlation Chapter 15 April 23-28, 2009 Classes #27-28.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Correlation. u Definition u Formula Positive Correlation r =
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Understand.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Chapter 15 Association Between Variables Measured at the Interval-Ratio Level.
Solutions to Healey 2/3e #13.1 (1e #15.1) Turnout by Unemployment and by Negative Campaigning.
Chapter Thirteen McGraw-Hill/Irwin
Presentation transcript:

Introduction to Statistics Introduction to Statistics Correlation Chapter 15 Apr 29-May 4, 2010 Classes #28-29

Correlation Chapter 15: –Correlation pp –Not responsible for remainder of the chapter

Correlation A statistical technique that is used to measure and describe a relationship between two variables –For example: GPA and TD’s scored Statistics exam scores and amount of time spent studying

Notation A correlation requires two scores for each individual –One score from each of the two variables –They are normally identified as X and Y

Three characteristics of X and Y are being measured… The direction of the relationship –Positive or negative The form of the relationship –Usually linear form The strength or consistency of the relationship –Perfect correlation = 1.00; no consistency would be 0.00 –Therefore, a correlation measures the degree of relationship between two variables on a scale from 0.00 to 1.00.

Assumptions There are 3 main assumptions… –1. The dependent and independent are normally distributed. We can test this by looking at the histograms for the two variables –2. The relationship between X and Y is linear. We can check this by looking at the scattergram –3. The relationship is homoscedastic. We can test homoscedasticity by looking at the scattergram and observing that the data points form a “roughly symmetrical, cigar-shaped pattern” about the regression line. If the above 3 assumptions have been met, then we can use correlation and test r for significance

Pearson r The most commonly used correlation Measures the degree of straight-line relationship Computation: r = SP / (SS X )(SS Y )

Example 1 A researcher predicts that there is a high correlation between scores on the stats final exam (100 pts max) and scores on the university’s exit exam for graduating seniors (330 pts max)

Example 1 X Y X ,444 2,704 8,100 9,025 22,173 Y 2 25,600 32,400 44,100 57, ,100 XY 4,800 6,840 9,360 18,900 22,800 62,700 (  X) (  X 2 ) (  Y) (  Y 2 ) (  XY)

Example 1 SS X SS X = X2 X2 X2 X2 - (  X) 2 (  X) 2 = 22, = n 5 = 22, /5 = 22, ,605 = 3,568 SS Y = Y2 Y2 - (  Y) 2 = 192, = n 5 = 192, ,900/5 = 192, ,180 = 3,920

Example 1 SP =  XY  XY - (  X)(  Y) (  X)(  Y) = n 62,700 - (305)(970) 5 = 62, ,850/5 = 62, ,170 = 3,530

Example 1 r = SP / (SS X )(SS Y ) = 3,530 / (3,568)(3,920) = 3,530 / 13,986,560 = 3,530 / 3, =.944

Pearson Correlation: “Rule of Thumb” If r = 1.00 Perfect Correlation If r = 1.00 Perfect Correlation +.70 to +.99 Very strong positive relationship +.40 to +.69 Strong positive relationship +.30 to +.39 Moderate positive relationship +.20 to +.29 Weak positive relationship +.01 to +.19 No or negligible relationship -.01 to -.19 No or negligible relationship -.20 to -.29 Weak negative relationship -.30 to -.39 Moderate negative relationship -.40 to -.69 Strong negative relationship -.70 or higher Very strong negative relationship

Example 1: Interpretation An r of indicates an extremely strong relationship between scores on the stats final exam and scores on the exit exam. As scores on the stats final go up so too do scores on the exit exam. An r of indicates an extremely strong relationship between scores on the stats final exam and scores on the exit exam. As scores on the stats final go up so too do scores on the exit exam. –But we are not finished with the interpretation  See next slide 

Interpretation (Continued) Coefficient of Determination (r 2 ) The value r 2 is called the coefficient of determination because it measures the proportion in variability in one variable that can be determined from the relationship with the other variable The value r 2 is called the coefficient of determination because it measures the proportion in variability in one variable that can be determined from the relationship with the other variable –For example:  A correlation of r =.944 means that r 2 =.891 (or 89.1%) of the variability in the Y scores can be predicted from the relationship with the X scores

Coefficient of Determination (r 2 ) and Interpret: The coefficient of determination is r 2 =.891. Scores on the stats final exam, by itself, accounts for 89.1% of the variation of the exit exam scores.

Example 2 A researcher predicts that there is a high correlation between years of education and voter turnout –She chooses Alamosa, Boston, Chicago, Detroit, and NYC to test her theory

Example 2 The scores on each variable are displayed in table format: –Y = % Turnout –X = Years of Education CityXY Alamosa Boston Chicago Detroit NYC13.070

Scatterplot The relationship between X and Y is linear.

Make a Computational Table XY X2X2X2X2 Y2Y2Y2Y2XY ∑ X = 62.5 ∑Y = 318 ∑ X 2 = ∑Y 2 = ∑XY =

Example 2 SS X SS X = X2 X2 X2 X2 - (  X) 2 (  X) 2 = = n 5 = /5 = – = 0.9 SS Y = Y2 Y2 - (  Y) 2 = = n 5 = /5 = – =

Example 2 SP =  XY  XY - (  X)(  Y) (  X)(  Y) = n (62.5)(318) 5 = /5 = – = 11.40

Example 2: Find Pearson r r= SP / (SS X )(SS Y ) = 11.4 / (0.9)(149.2) = 11.4 / = 11.4/ =.984

Example 2: Interpretation An r of indicates an extremely strong relationship between years of education and voter turnout for these five cities. As level of education increases, % turnout increases. An r of indicates an extremely strong relationship between years of education and voter turnout for these five cities. As level of education increases, % turnout increases. –But we are not finished with the interpretation  See next slide 

Coefficient of Determination (r 2 ) and Interpret: The coefficient of determination is r 2 =.968. Education, by itself, accounts for 96.8% of the variation in voter turnout.

Pearson’s r Had the relationship between % college educated and turnout, r =.32. –This relationship would have been positive and weak to moderate. Had the relationship between % college educated and turnout, r = –This relationship would have been negative and weak.

Hypothesis Testing with Pearson We can have a two-tailed hypothesis: H o : ρ = 0.0 H 1 : ρ ≠ 0.0 We can have a one-tailed hypothesis: H o : ρ = 0.0 H 1 : ρ 0.0) Note that ρ (rho) is the population parameter, while r is the sample statistic

Find r critical See Table B.6 (page 537) –You need to know the alpha level –You need to know the sample size –See that we always will use: df = n-2

Find r calculated See previous slides for formulas

Make you decision… r calculated < r critical then Retain H 0 r calculated > r critical then Reject H 0

Always include a brief summary of your results: Was it positive or negative? Was it significant ? Explain the correlation Explain the variation –Coefficient of Determination (r 2 )

Credits Example using Healey P. 418 Problem 15.1