Math 3680 Lecture #18 Correlation. The Correlation Coefficient: Intuition.

Slides:



Advertisements
Similar presentations
1 Objective Investigate how two variables (x and y) are related (i.e. correlated). That is, how much they depend on each other. Section 10.2 Correlation.
Advertisements

Chapter 10 Relationships between variables
Correlation and Regression Analysis
Reading a scatterplot Examples: –Mars rocks (sulfate is measured as a percentage & redness is measured as a ratio of red to blue in light spectra) –American.
SIMPLE LINEAR REGRESSION
Statistics for the Social Sciences Psychology 340 Fall 2006 Relationships between variables.
SIMPLE LINEAR REGRESSION
Correlation & Regression Math 137 Fresno State Burger.
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Correlation and Linear Regression
STATISTICS ELEMENTARY C.M. Pascual
Descriptive Methods in Regression and Correlation
Linear Regression.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
SIMPLE LINEAR REGRESSION
Chapters 8 and 9: Correlations Between Data Sets Math 1680.
Sections 9-1 and 9-2 Overview Correlation. PAIRED DATA Is there a relationship? If so, what is the equation? Use that equation for prediction. In this.
Data Analysis 17 Data are obtained from a random sample of adult women with regard to their ages and their monthly expenditures on health products. The.
Is there a relationship between the lengths of body parts ?
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Describing the Relation Between Two Variables
9.1 Correlation Key Concepts: –Scatter Plots –Correlation –Sample Correlation Coefficient, r –Hypothesis Testing for the Population Correlation Coefficient,
EC339: Lecture 3 Chapter 2: Correlation and univariate analysis: A step back.
MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 5.
Examining Relationships Prob. And Stat. 2.2 Correlation.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
The Correlation Coefficient. Social Security Numbers.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
LECTURE 5 Correlation.
4.1 Scatter Diagrams and Correlation. 2 Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
2 Variable Stats Does the amount of time spent on homework improve your grade? If you eat more junk food will you gain weight? Does the amount of rainfall.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Chapter 10 Correlation and Regression
 Graph of a set of data points  Used to evaluate the correlation between two variables.
STA291 Statistical Methods Lecture 10. Measuring something over time… o Like a company’s stock value: o It kind of just sits there, making your eyes glaze.
The Correlational Research Strategy
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
U Describes the relationship between two or more variables. Describes the strength of the relationship in terms of a number from -1.0 to Describes.
CHAPTER 4: TWO VARIABLE ANALYSIS E Spring.
Creating a Residual Plot and Investigating the Correlation Coefficient.
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Section 5.1: Correlation. Correlation Coefficient A quantitative assessment of the strength of a relationship between the x and y values in a set of (x,y)
Correlation The apparent relation between two variables.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Correlation. Correlation is a measure of the strength of the relation between two or more variables. Any correlation coefficient has two parts – Valence:
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Scatterplots Chapter 7 Definition and components Describing Correlation Correlation vs. Association.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
Introduction to Engineering Microsoft Excel - 2 Agenda Scatter Plots Trend Lines.
The Correlational Research Strategy Chapter 12. Correlational Research The goal of correlational research is to describe the relationship between variables.
Lecture 29 Dr. MUMTAZ AHMED MTH 161: Introduction To Statistics.
7.1 Draw Scatter Plots and Best Fitting Lines Pg. 255 Notetaking Guide Pg. 255 Notetaking Guide.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
Discovering Mathematics Week 9 – Unit 6 Graphs MU123 Dr. Hassan Sharafuddin.
Correlation and Median-Median Line Statistics Test: Oct. 20 (Wednesday)
CORRELATION ANALYSIS.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Correlation and Regression Ch 4. Why Regression and Correlation We need to be able to analyze the relationship between two variables (up to now we have.
Correlation and Regression Stats. T-Test Recap T Test is used to compare two categories of data – Ex. Size of finch beaks on Baltra island vs. Isabela.
Correlation and Regression. O UTLINE Introduction  10-1 Scatter plots.  10-2 Correlation.  10-3 Correlation Coefficient.  10-4 Regression.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Sit in your permanent seat
Correlation and Regression
CORRELATION ANALYSIS.
Correlation & Trend Lines
Presentation transcript:

Math 3680 Lecture #18 Correlation

The Correlation Coefficient: Intuition

Lingo: Age = independent variable Height = dependent variable

Correlations can be either strong or weak. Also, they can be either positive or negative, as illustrated in the following figures. The correlation coefficient, denoted by r, is a measure of linear association – of clustering around a line. It turns out that r always lies between -1 and 1.

Summary: Five numbers quantify the relationship between the two variables:  The average of the x-values  The SD of the x-values  The average of the y-values  The SD of the y-values  The correlation coefficient, r. In a while we will discuss how to compute r. For now, we will focus on its significance.

Note: 1. r = 0.80 does not mean 80% of the points are on the line. 2. r = 0.80 does not indicate twice as much linearity as a scatter plot with r = 0.40

Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. If women always married men who were five years older, the correlation between the ages of husbands and wives would be ________________.

Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. The correlation between a student’s age and his/her mother’s age is ___________.

Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. The correlation between a car’s age and fuel efficiency is _____________.

Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. On a 10-question multiple choice test, the correlation between number of correct answers and number of wrong answers is ____________.

Choose from the following options for each of the examples: exactly -1; nearly -1; somewhat negative; close to 0; somewhat positive; nearly 1; exactly 1 Example. Darts are thrown at a dartboard. The correlation between the x-coordinate of a dart’s landing and its y-coordinate is ____________.

Example. A study finds the following data for the IQs in a population: Husbands: Average = 100,SD = 15 Wives:Average = 100,SD = 15 r = 0.6 Draw a rough, rough sketch of the scatter diagram for this data.

The Correlation Coefficient: Computation

How to compute the correlation coefficient Convert the x-values and y-values into standard units. The average of the products (divide by n - 1, not n, just like with sample standard deviations) is the correlation coefficient.

Excel: =CORREL(A1:A6,B1:B6) To make a scatter plot in Excel 2007: select the data, and on the Insert tab, choose Scatter and click on the picture with only markers. For Excel 2003 or before, see book p. 416

Exercise:

Questions 1.How would we arrange the data so that r was as positive as possible? As negative as possible? 2.Suppose we interchange the x and y columns? Does the correlation coefficient change?

Questions 3. Suppose we add 5 to all of the x-values. Does the correlation coefficient change? 4. Suppose we triple all of the y-values. Does the correlation coefficient change?

The blue dots were obtained by tripling the x- and y- values of the red dots; the y-values were then increased by 8. Which set has the higher correlation coefficient?

These “changes of scale” do not affect the strength of the association. The association is measured in relative terms, not absolute terms.

Find the correlation coefficient of the following five data sets: yx yx yx yx yx yx yx r = 6 / 7 r = 11 / 14