Simple Regression Model

Slides:



Advertisements
Similar presentations
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Advertisements

Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
 Coefficient of Determination Section 4.3 Alan Craig
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Multiple Regression Analysis
Chapter 13 Multiple Regression
Chapter 12 Simple Regression
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Chapter 12 Multiple Regression
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Linear Regression Example Data
Empirical Estimation Review EconS 451: Lecture # 8 Describe in general terms what we are attempting to solve with empirical estimation. Understand why.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Chapter 6 (cont.) Regression Estimation. Simple Linear Regression: review of least squares procedure 2.
Chapter 13 Simple Linear Regression
1 Simple Linear Regression 1. review of least squares procedure 2. inference for least squares lines.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Chapter 2 Overview of the Data Mining Process 1. Introduction Data Mining – Predictive analysis Tasks of Classification & Prediction Core of Business.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Measures of relationship Dr. Omar Al Jadaan. Agenda Correlation – Need – meaning, simple linear regression – analysis – prediction.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Ch4 Describing Relationships Between Variables. Pressure.
Managerial Economics Demand Estimation. Scatter Diagram Regression Analysis.
You want to examine the linear dependency of the annual sales of produce stores on their size in square footage. Sample data for seven stores were obtained.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
MBP1010H – Lecture 4: March 26, Multiple regression 2.Survival analysis Reading: Introduction to the Practice of Statistics: Chapters 2, 10 and 11.
Chap 13-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 12.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
ANOVA for Regression ANOVA tests whether the regression model has any explanatory power. In the case of simple regression analysis the ANOVA test and the.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Slide 1 DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Lecture 2: Review of Multiple Regression (Ch. 4-5)
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Lecture 10: Correlation and Regression Model.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 18 Introduction to Simple Linear Regression (Data)Data.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Statistics for Managers Using Microsoft® Excel 5th Edition
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Chapter 12 Simple Linear Regression.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Simple linear regression and correlation Regression analysis is the process of constructing a mathematical model or function that can be used to predict.
REGRESSION REVISITED. PATTERNS IN SCATTER PLOTS OR LINE GRAPHS Pattern Pattern Strength Strength Regression Line Regression Line Linear Linear y = mx.
Chapter 12 Simple Regression Statistika.  Analisis regresi adalah analisis hubungan linear antar 2 variabel random yang mempunyai hub linear,  Variabel.
The simple linear regression model and parameter estimation
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Simple Linear Regression
Correlation and Regression
Ch 4.1 & 4.2 Two dimensions concept
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Regression Concept & Examples, Latent Variables, & Partial Least Squares (PLS)

Simple Regression Model Make prediction about the starting salary of a current college graduate Data set of starting salaries of recent college graduates Data Set Compute Average Salary How certain are of this prediction? There is variability in the data.

Simple Regression Model Use total variation as an index of uncertainty about our prediction Compute Total Variation The smaller the amount of total variation the more accurate (certain) will be our prediction.

Simple Regression Model How “explain” the variability - Perhaps it depends on the student’s GPA Salary GPA

Simple Regression Model Find a linear relationship between GPA and starting salary As GPA increases/decreases starting salary increases/decreases

Simple Regression Model Least Squares Method to find regression model Choose a and b in regression model (equation) so that it minimizes the sum of the squared deviations – actual Y value minus predicted Y value (Y-hat)

Simple Regression Model How good is the model? a= 4,779 & b = 5,370 A computer program computed these values u-hat is a “residual” value The sum of all u-hats is zero The sum of all u-hats squared is the total variance not explained by the model “unexplained variance” is 7,425,926

Simple Regression Model Total Variation = 23,000,000

Simple Regression Model Total Unexplained Variation = 7,425,726

Simple Regression Model Relative Goodness of Fit Summarize the improvement in prediction using regression model Compute R2 – coefficient of determination Regression Model (equation) a better predictor than guessing the average salary The GPA is a more accurate predictor of starting salary than guessing the average R2 is the “performance measure“ for the model. Predicted Starting Salary = 4,779 + 5,370 * GPA

Detailed Regression Example

Data Set Obs # Salary GPA Months Work 1 20000 2.8 48 2 24500 3.4 24 3 23000 3.2 4 25000 3.8 5 6 22500 36 7 27500 4.0 20 8 19000 2.6 9 24000 10 28500 12

Scatter Plot - GPA vs Salary

Scatter Plot - Work vs Salary

Pearson Correlation Coefficients -1 <= r <= 1   Salary GPA Months Work 1 0.898007 -0.93927 -0.82993

Three Regressions Salary = f(GPA) Salary = f(Work) Salary = f(GPA, Work) Interpret Excel Output

Interpreting Results Regression Statistics Statistical Significance Multiple R, R2, R2adj Standard Error Sy Statistical Significance t-test p-value F test

Regression Statistics Table Multiple R R = square root of R2 R2 Coefficient of Determination R2adj used if more than one x variable Standard Error Sy This is the sample estimate of the standard deviation of the error (actual – predicted)

ANOVA Table Table 1 gives the F statistic Tests the claim there is no significant relationship between your all of your independent and dependent variables The significance F value is a p-value should reject the claim: Of NO significant relationship between your independent and dependent variables if p< Generally  = 0.05

Regression Coefficients Table Coefficients Column gives b0 , b1, ,b2 , … , bn values for the regression equation. The b0 is the intercept b1value is next to your independent variable x1 b2 is next to your independent variable x2. b3 is next to your independent variable x3

Regression Coefficients Table p values for individual t tests each independent variables t test - tests the claim that there is no relationship between the independent variable (in the corresponding row) and your dependent variable. Should reject the claim Of NO significant relationship between your independent variable (in the corresponding row) and dependent variable if p<.

Regression Statistics Salary = f(GPA) Regression Statistics f(GPA) Multiple R 0.898006642 R Square 0.806415929 Adjusted R Square 0.78221792 Standard Error 1479.019946 Observations 10 ANOVA   df SS MS F Significance F Regression 1 72900000 33.32571 0.00041792 Residual 8 17500000 2187500 Total 9 90400000   Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 1928.571429 3748.677 0.514467 0.620833 -6715.89326 10573.04 GPA 6428.571429 1113.589 5.772843 0.000418 3860.63173 8996.511

Regression Statistics Salary = f(Work) Regression Statistics f(Work)  Multiple R 0.939265177 R Square 0.882219073 Adjusted R Square 0.867496457 Standard Error 1153.657002 Observations 10 ANOVA   df SS MS F Significance F Regression 1 79752604.17 79752604 59.92271 5.52993E-05 Residual 8 10647395.83 1330924 Total 9 90400000   Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 30691.66667 1010.136344 30.38369 1.49E-09 28362.28808 33021.0453 Months Work -227.864583 29.43615619 -7.74098 5.53E-05 -295.7444812 -159.98469

Regression Statistics Salary = f(GPA, Work) Regression Statistics  f(GPA,Work) Multiple R 0.962978985 R Square 0.927328525 Adjusted R Square 0.906565246 Standard Error 968.7621974 Observations 10 ANOVA   df SS MS F Significance F Regression 2 83830499 41915249 44.66195 0.00010346 Residual 7 6569501 938500.2 Total 9 90400000   Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 19135.92896 5608.184 3.412144 0.011255 5874.682112 32397.176 GPA 2725.409836 1307.468 2.084495 0.075582 -366.2602983 5817.08 Months Work -151.2124317 44.30826 -3.41274 0.011246 -255.9848174 -46.440046

Compare Three “Models” Regression Statistics f(GPA) Multiple R 0.898006642 R Square 0.806415929 Adjusted R Square 0.78221792 Standard Error 1479.019946 Observations 10 Regression Statistics f(Work)  Multiple R 0.939265177 R Square 0.882219073 Adjusted R Square 0.867496457 Standard Error 1153.657002 Observations 10 Regression Statistics  f(GPA,Work) Multiple R 0.962978985 R Square 0.927328525 Adjusted R Square 0.906565246 Standard Error 968.7621974 Observations 10

Latent Variables (Theoretical Entities)

Latent Variables Latent Variables Explanatory Variables that are not directly measured Identified by “Exploratory Factor Analysis” Confirmed by “Confirmatory Factor Analysis” Statistical Methods for Latent Variables Principles Components Analysis PLS SEM

Example: Confirmatory Factor Analysis Intention to Use Travelocity Website

Research Instrument