Correlation Analysis. Correlation Analysis: Introduction Management questions frequently revolve around the study of relationships between two or more.

Slides:



Advertisements
Similar presentations
Inferential Statistics and t - tests
Advertisements

Describing Relationships Using Correlation and Regression
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Correlation Chapter 9.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
CORRELATON & REGRESSION
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Don’t spam class lists!!!. Farshad has prepared a suggested format for you final project. It will be on the web
Cal State Northridge  320 Andrew Ainsworth PhD Correlation.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
CORRELATION COEFFICIENTS What Does a Correlation Coefficient Indicate? What is a Scatterplot? Correlation Coefficients What Could a Low r mean? What is.
10-2 Correlation A correlation exists between two variables when the values of one are somehow associated with the values of the other in some way. A.
SIMPLE LINEAR REGRESSION
Relationships Among Variables
Topics: Significance Testing of Correlation Coefficients Inference about a population correlation coefficient: –Testing H 0 :  xy = 0 or some specific.
SIMPLE LINEAR REGRESSION
AM Recitation 2/10/11.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Hypothesis Testing:.
Data Analysis.
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
Hypothesis Testing II The Two-Sample Case.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Research Methods for Counselors COUN 597 University of Saint Joseph Class # 9 Copyright © 2014 by R. Halstead. All rights reserved.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
14 Elements of Nonparametric Statistics
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Hypothesis Testing PowerPoint Prepared by Alfred.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.
Research & Statistics Looking for Conclusions. Statistics Mathematics is used to organize, summarize, and interpret mathematical data 2 types of statistics.
Association between 2 variables
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Welcome to MM570 Psychological Statistics
Chapter Eight: Using Statistics to Answer Questions.
Data Analysis.
Correlation & Regression Analysis
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Go to Table of Content Correlation Go to Table of Content Mr.V.K Malhotra, the marketing manager of SP pickles pvt ltd was wondering about the reasons.
Hypothesis Testing and Statistical Significance
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.
Pearson’s Correlation The Pearson correlation coefficient is the most widely used for summarizing the relation ship between two variables that have a straight.
Reasoning in Psychology Using Statistics
Statistical Inference about Regression
Inferential Statistics
SIMPLE LINEAR REGRESSION
Reasoning in Psychology Using Statistics
Presentation transcript:

Correlation Analysis

Correlation Analysis: Introduction Management questions frequently revolve around the study of relationships between two or more variables. Thus a relational hypothesis is necessary. Various Objectives are served with correlation analysis. The strength, direction, shape and other features of the relationship may be discovered. With correlation, one calculates an index to measure the nature of the relationship between variables

Bivariate Correlation Analysis (BCA) The Pearson (product moment) correlation coefficient varies over a range of +1 through to – 1. The designation r symbolizes the coefficient’s estimate of linear association based on sampling data. The coefficient p represents the population correlation.

BCA contd. Correlation coefficient reveal the magnitude and direction of relationships. The magnitude is the degree to which variables move in unison or opposition. The size of a correlation of +.40 is the same as one of The sign says nothing about size. The degree of correlation is modest.

BCA contd. Direction tells us whether two variables move in the same direction, opposite direction. When variables move in the same direction, the two variables have a positive relationship: –As one increases, the other also increases –Family income, e.g., is positively related to household food expenditure. As income increases, food expenditures increase.

BCA contd. Some variables move in the opposite direction, e.g., the prices of products/ services and their demand. These variables are inversely related. The absence of a relationship is expressed by a coefficient of approximately zero.

Scatterplots for Exploring Relationships Scattoerplot are essential for understanding the relationships between variables. They provide a means for visual inspection of data that a list of values for two variables cannot. Both the direction and the shape of a relationship are conveyed in a plot. The magnitude of the relationship can also be seen.

Simple Bivariate (i.e., two-variable) plot:

Correlation Matrix The correlation is one of the most common and most useful statistics. A correlation is a single number that describes the degree of relationship between two variables.

Correlation Example Let's assume that we want to look at the relationship between two variables, height (in inches) and self esteem. Our hypothesis is that height affects one's self esteem (The direction of causality is not taken into account, i.e., it's not likely that self esteem causes one’s height). Data on the age and height of twenty individuals are collected. We know that the average height differs for males and females, so, to keep this example simple, the example uses males only. Height is measured in inches. Self esteem is measured based on the average of 10, 1-to-5, rating items (where higher scores mean higher self esteem). Here's the data for the 20 cases: Table 1 in WORD Format

Calculating the Correlation:

Example contd. We use the symbol r to stand for the correlation. The value of r will always be between -1.0 and if the correlation is negative, we have a negative relationship; if it's positive, the relationship is positive. N = 20 ∑XY = ∑X = 1308 ∑Y = 75.1 ∑X 2 = ∑Y 2 =

Example contd. Plugging these values into the formula given above, we get the value of r: r = 0.73 So, the correlation for our twenty cases (.73) is shows a fairly strong positive relationship. It seems that there is a relationship between height and self esteem, at least in this made up data!

Testing the Significance of a Correlation After having computed a correlation, we can determine the probability that the observed correlation occurred by chance. This can be done by conducting a significance test. Most often we are interested in determining the probability that the correlation is a real one and not a chance occurrence. In this case, we are testing the mutually exclusive hypotheses:hypotheses

Testing the Significance Null Hypothesis: r = 0 Alternative Hypothesis: r <> 0 The easiest way to test this hypothesis is to find a statistics book that has a table of critical values of r. Most introductory statistics texts would have a table like this. As in all hypotheses testing, we need to first, determine the significance level. significance level

Testing the Significance contd. Here, we will use the common significance level of alpha =.05. –This means that we are conducting a test where the odds that the correlation is a chance occurrence are no more than 5 out of 100. Second, before we look up the critical value in a table we also have to compute the degrees of freedom (df). –The df is simply equal to N-2 or, in this example, is 20-2 = 18.

Testing the Significance contd. Finally, we have to decide whether we are doing a one-tailed or two-tailed test. one-tailedtwo-tailed In this example, since we have no strong prior theory to suggest whether the relationship between height and self esteem would be positive or negative, we will opt for the two-tailed test.

Testing the Significance contd. With the following three pieces of information –the significance level (alpha =.05)), –degrees of freedom (df = 18), and –type of test (two-tailed) we can now test the significance of the correlation we have found. The critical value in the statistics book is.4438.

Testing the Significance contd. This means that if the computed value of correlation is greater than.4438 or less than (remember, this is a two-tailed test), we can conclude that the odds are less than 5 out of 100 that this is a chance occurrence. Since our computed correlation 0f 0.73 is quite higher, we conclude that it is not a chance finding and that the correlation is "statistically significant" (given the parameters of the test). We can reject the null hypothesis and accept the alternative.