Linear Regression Modeling with Data. The BIG Question Did you prepare for today? If you did, mark yes and estimate the amount of time you spent preparing.

Slides:



Advertisements
Similar presentations
Simple Linear Regression and Correlation by Asst. Prof. Dr. Min Aung.
Advertisements

Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Correlation and Regression
Correlation Correlation is the relationship between two quantitative variables. Correlation coefficient (r) measures the strength of the linear relationship.
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
CORRELATON & REGRESSION
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
SIMPLE LINEAR REGRESSION
ASSESSING THE STRENGTH OF THE REGRESSION MODEL. Assessing the Model’s Strength Although the best straight line through a set of points may have been found.
Linear Regression and Correlation Analysis
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Chapter 9: Correlation and Regression
SIMPLE LINEAR REGRESSION
Lecture 5 Correlation and Regression
Correlation and Linear Regression Chapter 13 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Linear Regression.
SIMPLE LINEAR REGRESSION
Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Correlation and Regression
Correlation.
Correlation and Regression
© The McGraw-Hill Companies, Inc., 2000 Business and Finance College Principles of Statistics Lecture 10 aaed EL Rabai week
9.1 Correlation Key Concepts: –Scatter Plots –Correlation –Sample Correlation Coefficient, r –Hypothesis Testing for the Population Correlation Coefficient,
CHAPTER 14 MULTIPLE REGRESSION
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-5 Multiple Regression.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Chapter 10 Correlation and Regression
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Elementary Statistics Correlation and Regression.
CHAPTER 3 INTRODUCTORY LINEAR REGRESSION. Introduction  Linear regression is a study on the linear relationship between two variables. This is done by.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Click to edit Master title style Midterm 3 Wednesday, June 10, 1:10pm.
Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise.
Chapter 9 Correlation and Regression.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Example x y We wish to check for a non zero correlation.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Go to Table of Content Correlation Go to Table of Content Mr.V.K Malhotra, the marketing manager of SP pickles pvt ltd was wondering about the reasons.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
Pearson’s Correlation The Pearson correlation coefficient is the most widely used for summarizing the relation ship between two variables that have a straight.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical.
Correlation & Linear Regression Using a TI-Nspire.
Correlation and Regression
Lecture #25 Tuesday, November 15, 2016 Textbook: 14.1 and 14.3
Correlation and Linear Regression
Regression and Correlation
Correlation and Simple Linear Regression
CHAPTER 10 Correlation and Regression (Objectives)
Correlation and Simple Linear Regression
2. Find the equation of line of regression
Correlation and Simple Linear Regression
Correlation and Regression
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Topic 8 Correlation and Regression Analysis
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Linear Regression Modeling with Data

The BIG Question Did you prepare for today? If you did, mark yes and estimate the amount of time you spent preparing on your frequency log.

Problem Suppose we are given the following data about father and son heights to analyze. What can we conclude about it?

Connect How about if we formulate a hypothesis to investigate such as: Is there a correlation between a father’s height and his son’s height? : There is a correlation between a father’s height and his son’s height. : There is no correlation between a father’s height and his son’s height. Is there anything we have studied that can help you think where to start?

Definitions For a problem such as this one, we are trying to determine if there is a relationship between two variables. This is called a correlation. The data can be represented as ordered pairs (x, y). Does anyone recall what the x and y are called? The x-variable is the independent (or explanatory) variable and the y-variable is the dependent (or response) variable. This is similar to the concepts you have seen in algebra. In our example, the father’s height is the independent variable and the son’s height is the dependent variable.

Scatter plot A scatter plot is a plotting of the ordered pairs (x, y) which is used to see what kind of correlation two variables might have. Example 1: What kind of correlation would you guess these data sets to have? Negative Linear Correlation Positive Linear Correlation Nonlinear CorrelationNo Correlation

Father and Son Data Scatter plot Using SPSS, I loaded the father and son height data into the software. I then generated a scatter plot for the data which looks like: What kind of correlation does it look like it might have? Looks like a positive linear relationship.

Question Is there a way can we can calculate to find out if there is a correlation and how strong it might be? The correlation coefficient, denoted as r, gives us a measure of the strength and direction of a linear relationship between two variables. The population correlation coefficient is denoted as ρ. How do we calculate the correlation coefficient? The formula is: Where n is the number of data pairs.

What is the correlation coefficient for the father and son data? Using SPSS we have the following output: This is the correlation coefficient. 0 1 If r = -1 there is a perfect negative correlation If r is close to 0 there is no linear correlation If r = 1 there is a perfect positive correlation ● About where.668 is. What is the range for the correlation coefficient?

Analysis Since the correlation coefficient is.668, this implies there seems to be a positive linear relationship between a father’s height and his son’s height. However, does this imply that this relationship is significant enough to use it to predict if it would hold as a population correlation coefficient for ρ? We would use r as the test statistic and could use the standardized test statistic t with degrees of freedom n - 2. How do we calculate the t statistic here?

Hypothesis testing for significance Testing the null hypothesis that there is no linear relationship between the independent and dependent variables, we would use the model: : ρ = 0 : ρ ≠ 0  Degrees of freedom would be 11 – 2 = 9. Thus at a.05 significance, the rejection region starts at - = and = Example

Calculate and Summarize By running a model analysis in SPSS we have: At the.05 level of significance, the t-value is The test statistic lies inside of the rejection region which starts at Thus there is enough evidence to reject the null hypothesis and conclude there is a significant linear correlation between a father’s height and his son’s height.

Finding the Regression Line Now that we know that there is a significant linear correlation between a father and son’s height, we can find the regression line. The regression line is the line that best models the data. It can be used to predict the value of y given a value of x. In SPSS we find the regression line to the right:

Question Can we find the exact equation of the regression line? Yes, the equation is similar to the equation of a line from algebra. Who recalls the equation of a line?