Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation and Regression
Scatter Diagrams and Linear Correlation
Correlation Correlation is the relationship between two quantitative variables. Correlation coefficient (r) measures the strength of the linear relationship.
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Introduction to Regression Analysis
Elementary Statistics Larson Farber 9 Correlation and Regression.
PPA 501 – Analytical Methods in Administration Lecture 8 – Linear Regression and Correlation.
PPA 415 – Research Methods in Public Administration
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Introduction to Probability and Statistics Linear Regression and Correlation.
Chapter 9: Correlation and Regression
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Lecture 5 Correlation and Regression
Correlation & Regression
Correlation and Linear Regression
STATISTICS ELEMENTARY C.M. Pascual
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
SIMPLE LINEAR REGRESSION
Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Correlation and Regression
Correlation and Regression
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Chapter 6 & 7 Linear Regression & Correlation
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Introduction to Linear Regression
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Elementary Statistics Correlation and Regression.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Correlation & Regression Analysis
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Go to Table of Content Correlation Go to Table of Content Mr.V.K Malhotra, the marketing manager of SP pickles pvt ltd was wondering about the reasons.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
REGRESSION AND CORRELATION SIMPLE LINEAR REGRESSION 10.2 SCATTER DIAGRAM 10.3 GRAPHICAL METHOD FOR DETERMINING REGRESSION 10.4 LEAST SQUARE METHOD.
Correlation and Regression
Correlation and Linear Regression
Scatter Plots and Correlation
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Correlation and Regression
Correlation and Simple Linear Regression
Correlation and Regression
Correlation and Regression
SIMPLE LINEAR REGRESSION
Simple Linear Regression and Correlation
Product moment correlation
SIMPLE LINEAR REGRESSION
Warsaw Summer School 2017, OSU Study Abroad Program
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Correlation and Regression

Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes smoked per day Score on SAT Height Hours of Training Explanatory (Independent) Variable Response (Dependent) Variable A quantitative relationship between two interval or ratio level variables Number of Accidents Shoe SizeHeight Lung Capacity Grade Point Average IQ

Correlation  measures and describes the strength and direction of the relationship  Bivariate techniques requires two variable scores from the same individuals (dependent and independent variables)  Multivariate when more than two independent variables (e.g effect of advertising and prices on sales)  Variables must be ratio or interval scale

Negative Correlation–as x increases, y decreases x = hours of training (horizontal axis) y = number of accidents (vertical axis) Scatter Plots and Types of Correlation Hours of Training Accidents

Positive Correlation–as x increases, y increases x = SAT score y = GPA GPA Scatter Plots and Types of Correlation Math SAT

No linear correlation x = height y = IQ Scatter Plots and Types of Correlation Height IQ

Strong, negative relationship but non-linear! Scatter Plots and Types of Correlation

Correlation Coefficient A measure of the strength and direction of a linear relationship between two variables The range of r is from –1 to 1. If r is close to 1 there is a strong positive correlation. If r is close to –1 there is a strong negative correlation. If r is close to 0 there is no linear correlation. –1 0 1

Outliers..... Outliers are dangerous Here we have a spurious correlation of r=0.68 without IBM, r=0.48 without IBM & GE, r=0.21

x y Absences Final Grade Application Final Grade X Absences

xy x 2 y2y2 Computation of r x y

r is the correlation coefficient for the sample. The correlation coefficient for the population is (rho). The sampling distribution for r is a t-distribution with n – 2 d.f. Standardized test statistic For a two tail test for significance: Hypothesis Test for Significance (The correlation is not significant) (The correlation is significant)

A t-distribution with 5 degrees of freedom Test of Significance The correlation between the number of times absent and a final grade r = – There were seven pairs of data.Test the significance of this correlation. Use = Write the null and alternative hypothesis. 2. State the level of significance. 3. Identify the sampling distribution. (The correlation is not significant) (The correlation is significant) = 0.01

t –4.032 Rejection Regions Critical Values ± t 0 4. Find the critical value. 5. Find the rejection region. 6. Find the test statistic. df\p

t 0 –4.032 t = –9.811 falls in the rejection region. Reject the null hypothesis. There is a significant negative correlation between the number of times absent and final grades. 7. Make your decision. 8. Interpret your decision.

The equation of a line may be written as y = mx + b where m is the slope of the line and b is the y- intercept. The line of regression is: The slope m is: The y-intercept is: Regression indicates the degree to which the variation in one variable X, is related to or can be explained by the variation in another variable Y Once you know there is a significant linear correlation, you can write an equation describing the relationship between the x and y variables. This equation is called the line of regression or least squares line. The Line of Regression

Ad $ = a residual (xi,yi)(xi,yi) = a data point revenue = a point on the line with the same x-value Best fitting straight line

Calculate m and b. Write the equation of the line of regression with x = number of absences and y = final grade. The line of regression is:= –3.924x xy x 2 y2y2 x y

Absences Final Grade m = –3.924 and b = The line of regression is: Note that the point = (8.143, ) is on the line. The Line of Regression

The regression line can be used to predict values of y for values of x falling within the range of the data. The regression equation for number of times absent and final grade is: Use this equation to predict the expected grade for a student with (a) 3 absences(b) 12 absences (a) (b) Predicting y Values = –3.924(3) = = –3.924(12) = = –3.924x

The correlation coefficient of number of times absent and final grade is r = – The coefficient of determination is r 2 = (–0.975) 2 = Interpretation: About 95% of the variation in final grades can be explained by the number of times a student is absent. The other 5% is unexplained and can be due to sampling error or other variables such as intelligence, amount of time studied, etc. Strength of the Association The coefficient of determination, r 2, measures the strength of the association and is the ratio of explained variation in y to the total variation in y.