BIVARIATE DATA: CORRELATION AND REGRESSION Two variables of interest: X, Y. GOAL: Quantify association between X and Y: correlation. Predict value of Y.

Slides:



Advertisements
Similar presentations
Describing the Relation Between Two Variables
Advertisements

REGRESSION Want to predict one variable (say Y) using the other variable (say X) GOAL: Set up an equation connecting X and Y. Linear regression linear.
Statistics for the Social Sciences Psychology 340 Fall 2006 Relationships between variables.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Correlation Relationship between Variables. Statistical Relationships What is the difference between correlation and regression? Correlation: measures.
Correlation: Relationship between Variables
Chapter 9: Correlation and Regression
Correlation & Regression Math 137 Fresno State Burger.
Chapter 21 Correlation. Correlation A measure of the strength of a linear relationship Although there are at least 6 methods for measuring correlation,
Linear Regression Analysis
Correlation vs. Causation What is the difference?.
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Chapter 7 Scatterplots and Correlation Scatterplots: graphical display of bivariate data Correlation: a numerical summary of bivariate data.
Correlation and Regression
Correlation and Regression
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Correlation and Regression
Chapter 3 Describing Bivariate Data General Objectives: Sometimes the data that are collected consist of observations for two variables on the same experimental.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Describing the Relation Between Two Variables
Unit 3 Section : Correlation  Correlation – statistical method used to determine whether a relationship between variables exists.  The correlation.
WELCOME TO THETOPPERSWAY.COM.
1 Examining Relationships in Data William P. Wattles, Ph.D. Francis Marion University.
4.1 Scatter Diagrams and Correlation. 2 Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
© 2010 Pearson Prentice Hall. All rights reserved. CHAPTER 12 Statistics.
Exploring Relationships Between Variables Chapter 7 Scatterplots and Correlation.
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
Correlation.
CHAPTER 3 INTRODUCTORY LINEAR REGRESSION. Introduction  Linear regression is a study on the linear relationship between two variables. This is done by.
Introduction to Correlation Analysis. Objectives Correlation Types of Correlation Karl Pearson’s coefficient of correlation Correlation in case of bivariate.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Chapter Bivariate Data (x,y) data pairs Plotted with Scatter plots x = explanatory variable; y = response Bivariate Normal Distribution – for.
April 1 st, Bellringer-April 1 st, 2015 Video Link Worksheet Link
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Correlation & Regression
Correlations AP Psychology. Correlations  Co-relation  It describes the relationship b/w two variables.  Example #1  How is studying related to grades?
9.1 - Correlation Correlation = relationship between 2 variables (x,y): x= independent or explanatory variable y= dependent or response variable Types.
ContentDetail  Two variable statistics involves discovering if two variables are related or linked to each other in some way. e.g. - Does IQ determine.
Summarizing Data Graphical Methods. Histogram Stem-Leaf Diagram Grouped Freq Table Box-whisker Plot.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Correlation and Regression. O UTLINE Introduction  10-1 Scatter plots.  10-2 Correlation.  10-3 Correlation Coefficient.  10-4 Regression.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Correlation & Linear Regression Using a TI-Nspire.
Part II Exploring Relationships Between Variables.
Two-Variable Data Analysis
CHAPTER 10 & 13 Correlation and Regression
Exploring Relationships Between Variables
Warm Up Scatter Plot Activity.
Chapter 5 STATISTICS (PART 4).
SIMPLE LINEAR REGRESSION MODEL
Correlation A Lecture for the Intro Stat Course
Correlation and Regression
CHAPTER 10 Correlation and Regression (Objectives)
Simple Linear Regression
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Ch. 11: Quantifying and Interpreting Relationships Among Variables
Descriptive Analysis and Presentation of Bivariate Data
2. Find the equation of line of regression
Correlation and Regression
Correlation & Regression
CORRELATION & REGRESSION compiled by Dr Kunal Pathak
Presentation transcript:

BIVARIATE DATA: CORRELATION AND REGRESSION Two variables of interest: X, Y. GOAL: Quantify association between X and Y: correlation. Predict value of Y from the value of X: regression. EXAMPLES: (height, weight), (yrs. of education, salary), (hrs. of studying, exam score), (SAT score, GPA), (chemical reaction time, temperature), (rainfall, runoff volume), (demand, price), etc. BIVARIATE DATA: PAIRS (X, Y): (x1, y1), (x2, y2), …, (xn, yn). (xi, yi) – i th observation, values of X and y on the i th “subject”. Correlation studies: study type and amount of association between X and Y. Regression studies: aim to predict Y from X by constructing a simple equation relating Y to X.

CORRELATION GRAPHICAL REPRESENTATION OF BIVARIATE DATA – SCATTER PLOT: plot observations (x1, y1), (x2, y2), …, (xn, yn) as points on the plane. Types of association/relationship between vars: positive, negative, none. Positive association: Two variables are positively associated if large values of one tend to be associated (occur) with large values of the other variable and small values of one tend to be associated with small values of the other variable. Example: Height and weight are usually positively associated Positive association

CORRELATION, contd. Negative association: Two variables are negatively associated if large values of one tend to be associated (occur) with small values of the other. The variables tend to “move in opposite directions”. No association: If there is no association, the points in the scatter plot show no pattern. Negative association No association E.g. High demand often occurs with low price.

CORRELATION COEFFICIENT Measure of strength of association: correlation coefficient: r xy. Data: (x 1, y 1 ), (x 2, y 2 ), …, (x n, y n ). Sample statistics: sample means: Sample standard deviations: s x and s y. Sample correlation coefficient: Correlation coefficient measures strength of LINEAR association.

PROPERTIES OF THE SAMPLE CORRELATION COEFFICIENT r XY. r xy > 0 indicates positive association between X and Y. r xy < 0 indicates negative association between X and Y. r xy ≈ 0 indicates no association between X and Y. -1 ≤ r xy ≤ 1, the closer | r xy | to 1, the stronger the relationship between X and Y. Computational formula for r: given sample stats we can compute r as

CORRELATION COEFFICIENT AND ASSOCIATION r = 0.75r = 0.9 r = 0.28 r = 0.5 r = Almost perfect association, but not linear, r small. Strong association Moderate association Weak association No association

CORRELATION, CONTD. Correlation does not imply CAUSATION! Watch out for hidden (lurking) variables. Example. Study of fires. X=amount of damage, Y = # of firefighters. r XY ≈0.85. The more firefighters, the more damage? Hidden variable: Size of the fire.

EXAMPLE In a study of income an savings, data was collected from 10 households. Both savings and income are reported in thousands of $ in the following table. Find the correlation coefficient between income and savings. Solution: Summary statistics: X=income, Y=savings Σx i = 463, Σx 2 i = 23533, Σy i = 27.4, Σy 2 i = , Σx i y i = incomesavings There is a strong positive association between family income and savings.

EXAMPLE, MINITAB Correlations (Pearson) Correlation of income and savings = 0.963, P-Value = 0.000