Anthony Greene1 Regression Using Correlation To Make Predictions.

Slides:



Advertisements
Similar presentations
Chapter 12 Simple Linear Regression
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Regression BPS chapter 5 © 2006 W.H. Freeman and Company.
Regression Analysis Simple Regression. y = mx + b y = a + bx.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
1 Simple Linear Regression and Correlation The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES Assessing the model –T-tests –R-square.
Definition  Regression Model  Regression Equation Y i =  0 +  1 X i ^ Given a collection of paired data, the regression equation algebraically describes.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
2.2 Correlation Correlation measures the direction and strength of the linear relationship between two quantitative variables.
9. SIMPLE LINEAR REGESSION AND CORRELATION
Regression and Correlation
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Linear Regression MARE 250 Dr. Jason Turner.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Introduction to Probability and Statistics Linear Regression and Correlation.
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Pertemua 19 Regresi Linier
Least Squares Regression
Introduction to Regression Analysis, Chapter 13,
Linear Regression/Correlation
Correlation & Regression Math 137 Fresno State Burger.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Linear Regression and Correlation
Descriptive Methods in Regression and Correlation
Linear Regression.
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Slide Copyright © 2008 Pearson Education, Inc. Chapter 4 Descriptive Methods in Regression and Correlation.
Simple Linear Regression Models
Biostatistics Unit 9 – Regression and Correlation.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Chapter 6 & 7 Linear Regression & Correlation
Review of Statistical Models and Linear Regression Concepts STAT E-150 Statistical Methods.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Business Research Methods William G. Zikmund Chapter 23 Bivariate Analysis: Measures of Associations.
Linear Regression Least Squares Method: the Meaning of r 2.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Regression BPS chapter 5 © 2010 W.H. Freeman and Company.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Lecture 10: Correlation and Regression Model.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
MARE 250 Dr. Jason Turner Linear Regression. Linear regression investigates and models the linear relationship between a response (Y) and predictor(s)
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Chapter 8: Simple Linear Regression Yang Zhenlin.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Correlation and Regression Ch 4. Why Regression and Correlation We need to be able to analyze the relationship between two variables (up to now we have.
Inference for Least Squares Lines
Regression Analysis Week 4.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Chapter 14, part C Goodness of Fit..
Presentation transcript:

Anthony Greene1 Regression Using Correlation To Make Predictions

Anthony Greene2 Making a prediction To obtain the predicted value of y based on a known value of x and a known correlation. Note what happens for positive and negative values of r and for high and low values of r and for near-zero values of r.

Anthony Greene3 Graph of y = 5 – 3 x

Anthony Greene4 y-Intercept and Slope For a linear equation y = a + bx, the constant a is the y-intercept and the constant b is the slope. x and y are related variables

Anthony Greene5 Straight-line graphs of three linear equations Y = a + bX a = y-intercept b = slope (rise/run)

Anthony Greene6 Graphical Interpretation of Slope The straight-line graph of the linear equation y = a +bx slopes upward if b > 0, slopes downward if b < 0, and is horizontal if b = 0

Anthony Greene7 Graphical interpretation of slope

Anthony Greene8 Four data points

Anthony Greene9 Scatter plot

Anthony Greene10 Two possible straight-line fits to the data points

Anthony Greene11 Determining how well the data points in are fit by Line A Vs.Line B

Anthony Greene12 Least-Squares Criterion The straight line that best fits a set of data points is the one having the smallest possible sum of squared errors. Recall that the sum of squared errors is error variance.

Anthony Greene13 Regression Line and Regression Equation Regression line: The straight line that best fits a set of data points according to the least-squares criterion. Regression equation: The equation of the regression line.

Anthony Greene14 The best-fit line minimizes the distance between the actual data and the predicted value

Anthony Greene15 Residual, e, of a data point

Anthony Greene16 We define SS x, SS P and SS y by Notation Used in Regression and Correlation

Anthony Greene17 Regression Equation The regression equation for a set of n data points is

Anthony Greene18 The relationship between b and r That is, the regression slope is just the correlation coefficient scaled up to the right size for the variables x and y.

Anthony Greene19

Anthony Greene20 Criterion for Finding a Regression Line Before finding a regression line for a set of data points, draw a scatter diagram. If the data points do not appear to be scattered about a straight line, do not determine a regression line.

Anthony Greene21 Linear regression requires linear data: (a) Data points scattered about a curve (b) Inappropriate straight line fit to the data Higher order regression equations exist but are outside the range of this course

Anthony Greene22 Uniform Variance Math Proficiency By Grade

Anthony Greene23 Assumptions for Regression Inferences

Anthony Greene24 Table for obtaining the three sums of squares for the used car data

Anthony Greene25 Regression line and data points for used car data What is a fair asking price for a 2.5 year old car? So since the price unit is $100s, the best prediction is $17,271

Anthony Greene26 Extrapolation in the used car example

27 Total sum of squares, SST: The variation in the observed values of the response variable: Regression sum of squares, SSR: The variation in the observed values of the response variable that is explained by the regression: Error sum of squares, SSE: The variation in the observed values of the response variable that is not explained by the regression: Sums of Squares in Regression

Anthony Greene28 Regression Identity The total sum of squares equals the regression sum of squares plus the error sum of squares. In symbols, SST = SSR + SSE.

Anthony Greene29 Graphical portrayal of regression for used cars y = a + bx

Anthony Greene30 What sort of things could regression be used for? Any instance where a known correlation exists, regression can be used to predict a new score. Examples: 1. If you knew that there was a past correlation between the amount of study time and the grade on an exam, you could make a good prediction about the grade before it happened. 2. If you knew that certain features of a stock correlate with its price, you can use regression to predict the price before it happens.

Anthony Greene31 Regression Example: Low Correlation Find the regression equation for predicting height based on knowledge of weight. The existing data is for 10 male stats students?

Anthony Greene32 X Y

Anthony Greene33 X Y XY X 2 Y 2

Anthony Greene34 X Y XY X 2 Y 2 

Anthony Greene35 SS x =  x 2 - (  x) 2 /n = 465, ,472.4 = 32,372 S P =  xy -  x  y/n= 151, , b=S P /SS x, so b = 1,213/32,372=0.03 a = (1/n)(  y-b  x), so a = 0.1( ) = 66 So, Y=0.03x+66 X Y XY X 2 Y 2 2, , ,844 52,147  ^

Anthony Greene36 Y=0.03x+66 ^

Anthony Greene37 Regression Example: High Correlation Find the regression equation for predicting probability of a teenage suicide attempt based on weekly heroine usage.

38 XYXYX2X2 Y2Y

39 XYXYX2X2 Y2Y

40 XYXYX2X2 Y2Y Σ

41 n = 21 SS x =  x 2 - (  x) 2 /n = = 84 S P =  xy -  x  y/n= – = 9.14 b=S P /SS x, so b = 9.14/84 = a=(1/n)(  y-b  x), so a = (1/21)( ) = So, Y= 0.109x XYXYX2X2 Y2Y Σ ^

Anthony Greene42 Why Is It Called Regression? For low correlations, the predicted value is close to the mean For zero correlations the prediction is the mean Only for perfect correlations R 2 = 1.0 do the predicted scores show as much variation as the actual scores Since perfect correlations are rare, we say that the predicted scores show regression towards the mean