Correlation & Regression Chapter 10. Outline Section 10-1Introduction Section 10-2Scatter Plots Section 10-3Correlation Section 10-4Regression Section.

Slides:



Advertisements
Similar presentations
Correlation and regression Dr. Ghada Abo-Zaid
Advertisements

1 Objective Investigate how two variables (x and y) are related (i.e. correlated). That is, how much they depend on each other. Section 10.2 Correlation.
Correlation and Regression
Correlation and Regression
Chapter 4 Describing the Relation Between Two Variables
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Correlation and Regression
Linear Regression and Correlation Analysis
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Chapter 9: Correlation and Regression
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Describing the Relation between Two Variables 4.
Describing Relationships: Scatterplots and Correlation
Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Correlation and Linear Regression
STATISTICS ELEMENTARY C.M. Pascual
Correlation and Regression
Correlation and Regression
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Correlation.
Correlation and Regression
Sections 9-1 and 9-2 Overview Correlation. PAIRED DATA Is there a relationship? If so, what is the equation? Use that equation for prediction. In this.
Is there a relationship between the lengths of body parts ?
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Unit 3 Section : Correlation  Correlation – statistical method used to determine whether a relationship between variables exists.  The correlation.
EQT 272 PROBABILITY AND STATISTICS
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
INTRODUCTORY LINEAR REGRESSION SIMPLE LINEAR REGRESSION - Curve fitting - Inferences about estimated parameter - Adequacy of the models - Linear.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Correlation and Regression The Big Picture From Bluman 5 th Edition slides © McGraw Hill.
Correlation and Regression
Chapter 10 Correlation and Regression
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
Correlation & Regression
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
Unit 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 13 Multiple Regression
Multiple Correlation and Regression
Chapter 10 Correlation and Regression Lecture 1 Sections: 10.1 – 10.2.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Go to Table of Content Correlation Go to Table of Content Mr.V.K Malhotra, the marketing manager of SP pickles pvt ltd was wondering about the reasons.
Video Conference 1 AS 2013/2012 Chapters 10 – Correlation and Regression 15 December am – 11 am Puan Hasmawati Binti Hassan
Unit 3 Sections 10-1 & What we will be able to do throughout this chapter…  Determine relationships between two or more variables  Determine strengths.
REGRESSION AND CORRELATION SIMPLE LINEAR REGRESSION 10.2 SCATTER DIAGRAM 10.3 GRAPHICAL METHOD FOR DETERMINING REGRESSION 10.4 LEAST SQUARE METHOD.
Correlation and Regression. O UTLINE Introduction  10-1 Scatter plots.  10-2 Correlation.  10-3 Correlation Coefficient.  10-4 Regression.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
CHAPTER 10 & 13 Correlation and Regression
Review and Preview and Correlation
Correlation & Regression
CHS 221 Biostatistics Dr. wajed Hatamleh
Correlation and Regression
Chapter 5 STATISTICS (PART 4).
SIMPLE LINEAR REGRESSION MODEL
Correlation and Regression
Elementary Statistics
Correlation and Regression
CHAPTER 10 Correlation and Regression (Objectives)
Correlation and Regression
Chapter 10 Correlation and Regression
Correlation and Regression Lecture 1 Sections: 10.1 – 10.2
Presentation transcript:

Correlation & Regression Chapter 10

Outline Section 10-1Introduction Section 10-2Scatter Plots Section 10-3Correlation Section 10-4Regression Section 10-5Coefficient of Determination and Standard Error of the Estimate Section 10-7Summary

Section 10-1introduction RECALL: Inferential Statistics is a branch of statistics that consists of generalizing from samples to populations, performing hypothesis testing, determining relationships among variables, and making predictions Our focus in this chapter will involve determining whether a relationship between two or more numerical or quantitative variables exists

Examples A businessperson may want to know whether the volume of sales for a given month is related to the amount of advertising the firm does that month. Educators are interested in determining whether the number of hours a student studies is related to the student’s score on a particular exam A zoologist may want to know whether the birth weight of a certain animal is related to its life span Medical researchers are interested in such questions as Is caffeine related to heart damage? Is there a relationship between a person’s age and his/her blood pressure?

Introduction CorrelationRegression A statistical method used to determine whether a relationship between two or more variables exists A statistical method used to describe the nature of the relationship (i.e., the mathematical relationship) between variables.

Questions to answer Are two or more variables related? If so, what is the strength of the relationship? What type of relationship exists? ◦Linear or nonlinear? What kind of prediction can be made from the relationship? And, how valuable is the prediction?

How to answer the questions Are two or more variables related? AND If so, what is the strength of the relationship? Visual Interpretation ◦Scatter plots  Consider overall pattern  Direction (slope)  Outliers (possible influential points) Algebraic Interpretation ◦Correlation Coefficient  Numerical measure of the strength of the relationship between two quantitative variables

How to answer the questions What type of relationship exists? Two Types of Relationships Simple Has two variables (paired or bivariate data) Independent variable, x Dependent variable, y Can be positive or negative Positive—both variables increase or decrease at same time Negative –one variable increases, as another variable decreases, or vice versa Multiple ◦Two or more independent variables are used to predict one dependent variable

How to answer the questions What kind of predictions can be made from the relationship? Predictions are within in the scope of the sample data only Some predictions are more accurate than others, due to the strength of the relationship ◦The stronger the relationship is between variables, the more accurate the prediction is

Section 10-2scatter plots About Scatter Plots ◦First review section 2.5 regarding Paired Data and Scatter Plots (p ) ◦See questions 1-14 p. 88

Identify the variables Independent Variable Dependent Variable What we typically call “x” Input variable Variable that is controlled or manipulated Plotted on Horizontal axis What we typically call “y” Output variable Variable that CANNOT be controlled or manipulated Plotted on Vertical axis (dependent variable) depends on (independent variable)

Scatterplots of Paired Data Figure 10-2

ley. Scatterplots of Paired Data Figure 10-2

Listed below are the weights (in pounds) and the highway fuel consumption (in mi/gal) of randomly selected cars. Is there a linear correlation between weight and highway fuel consumption? What does the result suggest about a national program to reduce the consumption of imported oil? Weight (lbs) Fuel Consumption (mpg)

Section 10-3correlation How to measure the strength of a linear relationship present in the scatter plot? Calculate the linear correlation coefficient!

Correlation coefficient Aka Pearson product moment correlation coefficient (PPMC) Named after Karl Pearson who pioneered the research in this area Computed from the sample data measures the strength and direction of a LINEAR relationship between two variables Symbol for the sample correlation coefficient is r Symbol for the population correlation coefficient is  (Greek letter rho) Range is from -1 to 1 inclusive

Notation for the Linear Correlation Coefficient n represents the number of pairs of data present.  denotes the addition of the items indicated.  x denotes the sum of all x- values.  x 2 indicates that each x- value should be squared and then those squares added. (  x ) 2 indicates that the x- values should be added and the total then squared.  xy indicates that each x -value should be first multiplied by its corresponding y- value. After obtaining all such products, find their sum. r represents linear correlation coefficient for a sample.  represents linear correlation coefficient for a population.

The linear correlation coefficient r measures the strength of a linear relationship between the paired values in a sample. Calculators can compute r. We will use MINITAB and EXCEL.

Round to three decimal places so that it can be compared to critical values in Table I, p Rounding the Linear Correlation Coefficient r

Is the Correlation coefficient significant? Use Table I on page 642 ◦will be stated  Measure error level in the calculation ◦Degrees of Freedom, n – 2  Designated by d.f.  The number of variables allowed before the calculation is determined (see. P. 358) Compare the absolute value of r to the Table I value for given alpha value and the d.f. based on n

Listed below are the weights (in pounds) and the highway fuel consumption (in mi/gal) of randomly selected cars. Is there a linear correlation between weight and highway fuel consumption? What does the result suggest about a national program to reduce the consumption of imported oil? Weight (lbs) Fuel Consumption (mpg)

Causation & Correlation Can we conclude that if two variables are related (correlation) that one variable causes the other variable? Correlation does not imply causation Let’s say that again: Correlation does not imply causation

Causation & Correlation If there is a strong linear correlation between two variables, then one of the following situations may be true about the relationship between the two variables: There is a direct cause-and-effect relationship between the two variables. That is, x causes y Heat causes ice to melt There is a reverse cause-and-effect relationship between the two variables. That is, y causes x A researcher believes excessive coffee consumption causes nervousness, BUT it may be that an extremely nervous person craves coffee to calm his nerves

Causation & Correlation  The relationship may be caused by the interaction of several other variables ◦ A researcher may find a significant relationship between students’ high school grades and college grades. But there are probably many other variables involved, such as IQ, hours of study, influence of parents, motivation, age, and instructors.  The apparent relationship may be strictly a coincidence ◦ A researcher may find a significant relationship between the increase in the number of people who are exercising and the increase in the number of people who are committing crimes. But so-called common sense suggests that any relationship between these two values is coincidental.

Causation & Correlation The relationship may be caused by a third variable (lurking variable) A statistician correlated the number of deaths due to drowning and the number of soft drinks consumed daily during the summer, but we can not say the soft drink is necessarily responsible for the deaths, since both variables may be related to heat and humidity