Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Forecasting Using the Simple Linear Regression Model and Correlation
Correlation and Linear Regression.
Inference for Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation and Regression
Simple Linear Regression and Correlation
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Chapter 12 Simple Linear Regression
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Chapter 12 Simple Regression
Statistics for Business and Economics
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
The Simple Regression Model
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Simple Linear Regression Analysis
T-tests and ANOVA Statistical analysis of group differences.
Lecture 5 Correlation and Regression
Correlation & Regression
Correlation and Regression
Lecture 16 Correlation and Coefficient of Correlation
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
Introduction to Linear Regression
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Elementary Statistics Correlation and Regression.
Correlation & Regression
Regression. Population Covariance and Correlation.
Examining Relationships in Quantitative Research
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
MARKETING RESEARCH CHAPTER 18 :Correlation and Regression.
Lecture 10: Correlation and Regression Model.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Correlation & Regression Analysis
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
REGRESSION G&W p
Statistics for Managers using Microsoft Excel 3rd Edition
Inference for Regression
Correlation and Simple Linear Regression
Correlation and Regression
Correlation and Simple Linear Regression
Product moment correlation
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or more independent variables.

Correlation A correlation describes a relationship between two variables Correlation tries to answer the following questions: –What is the relationship between variable X and variable Y? –How are the scores on one measure associated with scores on another measure? –To what extent do the high scores on one variable go with the high scores on the second variable?

Simple Linear Regression Understanding relationships between variables: –Prediction –Explanation

Design Requirements and Assumptions Two continuous variables Variables are linearly related Random Sampling Independence Bivariate Normality N >= 30

Example You are the admissions committee in the Sociology department of a large west coast University. You are trying to make decisions about who to admit to the Master’s program. You would like to be able to predict how well the applicants you are deciding about will do at your school. Your department has been analyzing the performance of it’s graduate students over the years. One thing it has been looking at it is relationship between undergraduate GPA and graduate GPA. From regression analyses done over the years, you are able to make some educated guesses about how applicants will perform once admitted.

How Used in Making Predictions

The Regression Coefficient? What Slope? What Altitude?

Fitting the Regression Line: The Best Fit (Least Squares) Y'= a + b y X The predicted value of Y(Y') for a value of X is computed by: –Multiplying a score (X) by the regression coefficient (b y ) –Adding the regression constant (a) to this product The prediction of Y from X based on linear relationship of X and Y so that errors are minimized

Least Squares Fit: Visual * * Where the average squared distance of the points from the regression line is minimized

Minimizing Prediction Error: What that Means (For Math Types)

The Regression Coefficient: Close Your Eyes if You Don’t Want the Derivation b y = r xy (s y /s x ) –b y = regression coefficient –r = correlation between X and Y –s y = standard deviation of Y –s x = standard deviation of X Compute b y : divide the standard deviation of Y (s y ) by the standard deviation of X (s x ) then multiply by the Pearson correlation (r xy )between X and Y

The Constant (a): More Math Regression Constant (a): the altitude of the regression line; the value where the regression line intercepts Y where X = 0 (the Y intercept) a = Y - b y X –a = the regression constant –Y = mean of Y –b y = regression coefficient –X = mean of X Compute a: multiply X (mean of X) by the regression coefficient (b y ) and then subtract that product from Y (mean of Y)

Plotting Regression Line Need compute two predicted scores: –For X (undergrad GPA) = 2.75 Y’ = a + b y X = (2.75) = 3.59 –For X (undergrad GPA) = 3.60 Y’ = a + b y X = (3.60) = 3.79 Draw regression line through scatter plot using these two points

Plotting the Regression Line: Visual * *

Errors of Prediction

Standard Error of Estimate The magnitude of the error made in estimating Y from X: a measure of dispersion around the regression line The average error of prediction

The Standard Error of Estimate: A Visual Representation Graduate GPA Undergraduate GPA

Standard Error of Estimate: Another Visual Representation Y

Is the prediction worth pursuing? Standard error Amount of variance explained by X Testing the regression coefficient (b) for significance

Explaining Variance: How much? Total Variance Predicted Variance Unpredicted Variance Y

Assessing Prediction Accuracy: Explaining Variance Total Variance: = Predicted variance + Residual (unexplained) variance Coefficient of Determination (r 2 ):Proportion of total variance in Y that has been predicted by variable X (r 2 = s 2 y’ /s 2 y ) –Our example: r =.56, so r 2 =.3136 Coefficient of Non-Determination (1-r 2): : Proportion of total variance in Y that is not predicted by X –Our example: 1- r 2 = =.69

Proportion of Explained (Predicted) and Unexplained (Residual) Variance X Y r xy =.56 r 2 =.31 (31%) Explained variance (1-r 2 ) =.69 (69%) Unexplained variance

t-Test for Individual Regression Coefficients (b y ) H 0 :  = 0 (where  is the population regression coefficient) H 1 :  not= 0 Compute a t statistic: T = (b -  )/s b = b/s b (how many standard error points b is from the hypothesized population parameter under the null hypothesis,  = 0 )

t-Test of b: Our Example t =.24/.12 = 2.00 Set alpha at.05 (two-tailed) Figure out df (N-2): 8 t critical (05/2,8) = Decision: t observed (2.00) < t critical (2.306) so do not reject the null hypothesis Conclusion: cannot conclude that the slope is significantly different from 0 in the population.

Our Conclusion: Do not reject the null hypothesis * *

Warnings Simple regression assumes a straight line relationship Outliers can control regression results Assumes random samples for making proper generalizations Regression is correlational and does not show a causal link between x causes y