Covariance and Correlation

Slides:



Advertisements
Similar presentations
Chapter 11: The t Test for Two Related Samples
Advertisements

Chapter 16: Correlation.
Linear regression and correlation
Table of Contents Exit Appendix Behavioral Statistics.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
CJ 526 Statistical Analysis in Criminal Justice
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Lesson Fourteen Interpreting Scores. Contents Five Questions about Test Scores 1. The general pattern of the set of scores  How do scores run or what.
Correlation and Covariance
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Basic Statistical Concepts
Statistics Psych 231: Research Methods in Psychology.
SOME STATISTICAL CONCEPTS Chapter 3 Distributions of Data Probability Distribution –Expected Rate of Return –Variance of Returns –Standard Deviation –Covariance.
Perfect Negative Correlation Perfect Positive Correlation Non-Existent Correlation Imperfect Negative Correlation Imperfect Positive Correlation.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
11. Multivariate Analysis CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science, IUPUI.
8/10/2015Slide 1 The relationship between two quantitative variables is pictured with a scatterplot. The dependent variable is plotted on the vertical.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Chapter 4 Two-Variables Analysis 09/19-20/2013. Outline  Issue: How to identify the linear relationship between two variables?  Relationship: Scatter.
Lecture 16 Correlation and Coefficient of Correlation
February  Study & Abstract StudyAbstract  Graphic presentation of data. Graphic presentation of data.  Statistical Analyses Statistical Analyses.
Answering Descriptive Questions in Multivariate Research When we are studying more than one variable, we are typically asking one (or more) of the following.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Correlation and regression 1: Correlation Coefficient
1 MBF 2263 Portfolio Management & Security Analysis Lecture 2 Risk and Return.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Covariance and correlation
Correlation1.  The variance of a variable X provides information on the variability of X.  The covariance of two variables X and Y provides information.
Multivariate Analysis Trying to establish a mathematical relationship between multiple data sets. (e.g. smoking/cancer, salary/productivity, pressure/volume,
Basic linear regression and multiple regression Psych Fraley.
Lecture 3 A Brief Review of Some Important Statistical Concepts.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Scores & Norms Derived Scores, scales, variability, correlation, & percentiles.
Wednesday, October 12 Correlation and Linear Regression.
Correlation and Regression PS397 Testing and Measurement January 16, 2007 Thanh-Thanh Tieu.
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
MEASURES of CORRELATION. CORRELATION basically the test of measurement. Means that two variables tend to vary together The presence of one indicates the.
Chapter 6 Foundations of Educational Measurement Part 1 Jeffrey Oescher.
Examining Relationships in Quantitative Research
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Covariance and Correlation Questions: What does it mean to say that two variables are associated with one another? How can we mathematically formalize.
Regression MBA/510 Week 5. Objectives Describe the use of correlation in making business decisions Apply linear regression and correlation analysis. Interpret.
Individual Differences & Correlations Psy 425 Tests & Measurements Furr & Bacharach Ch 3, Part 1.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
EXPERIMENT VS. CORRELATIONAL STUDY. EXPERIMENT Researcher controls all conditions Experimental group – 1 or more groups of subjects Control group – controlled.
Chapter 10 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 A perfect correlation implies the ability to predict one score from another perfectly.
Regression and Least Squares The need for a mathematical construct… Insert fig 3.8.
LECTURE 9 Tuesday, 24 FEBRUARY STA291 Fall Administrative 4.2 Measures of Variation (Empirical Rule) 4.4 Measures of Linear Relationship Suggested.
5.4 Line of Best Fit Given the following scatter plots, draw in your line of best fit and classify the type of relationship: Strong Positive Linear Strong.
Psy302 Quantitative Methods
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8- 1.
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
CORRELATION ANALYSIS.
ContentDetail  Two variable statistics involves discovering if two variables are related or linked to each other in some way. e.g. - Does IQ determine.
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
Chapter 12 Understanding Research Results: Description and Correlation
Correlation and Covariance
Covariance and Correlation
Descriptive Statistics:
Correlation and Covariance
Presentation transcript:

Covariance and Correlation Questions: What does it mean to say that two variables are associated with one another? How can we mathematically formalize the concept of association?

Limitation of covariance One limitation of the covariance is that the size of the covariance depends on the variability of the variables. As a consequence, it can be difficult to evaluate the magnitude of the covariation between two variables. If the amount of variability is small, then the highest possible value of the covariance will also be small. If there is a large amount of variability, the maximum covariance can be large.

Limitations of covariance Ideally, we would like to evaluate the magnitude of the covariance relative to maximum possible covariance How can we determine the maximum possible covariance?

Go vary with yourself Let’s first note that, of all the variables a variable may covary with, it will covary with itself most strongly In fact, the “covariance of a variable with itself” is an alternative way to define variance:

Go vary with yourself Thus, if we were to divide the covariance of a variable with itself by the variance of the variable, we would obtain a value of 1. This will give us a standard for evaluating the magnitude of the covariance. Note: I’ve written the variance of X as sX  sX because the variance is the SD squared

Go vary with yourself However, we are interested in evaluating the covariance of a variable with another variable (not with itself), so we must derive a maximum possible covariance for these situations too. By extension, the covariance between two variables cannot be any greater than the product of the SD’s for the two variables. Thus, if we divide by sxsy, we can evaluate the magnitude of the covariance relative to 1.

Spine-tingling moment Important: What we’ve done is taken the covariance and “standardized” it. It will never be greater than 1 (or smaller than –1). The larger the absolute value of this index, the stronger the association between two variables.

Spine-tingling moment When expressed this way, the covariance is called a correlation The correlation is defined as a standardized covariance.

Correlation It can also be defined as the average product of z-scores because the two equations are identical. The correlation, r, is a quantitative index of the association between two variables. It is the average of the products of the z-scores. When this average is positive, there is a positive correlation; when negative, a negative correlation

Mean of each variable is zero A, D, & B are above the mean on both variables E & C are below the mean on both variables F is above the mean on x, but below the mean on y

+  + = +   + =  +   =     = +

Correlation

Correlation The value of r can range between -1 and + 1. If r = 0, then there is no correlation between the two variables. If r = 1 (or -1), then there is a perfect positive (or negative) relationship between the two variables.

r = + 1 r = 0 r = - 1

Correlation The absolute size of the correlation corresponds to the magnitude or strength of the relationship When a correlation is strong (e.g., r = .90), then people above the mean on x are substantially more likely to be above the mean on y than they would be if the correlation was weak (e.g., r = .10).

r = + .70 r = + .30 r = + 1

Correlation Advantages and uses of the correlation coefficient Provides an easy way to quantify the association between two variables Employs z-scores, so the variances of each variable are standardized & = 1 Foundation for many statistical applications