Download presentation

1
**Regression Concept & Examples, Latent Variables, & Partial Least Squares (PLS)**

2
**Simple Regression Model**

Make prediction about the starting salary of a current college graduate Data set of starting salaries of recent college graduates Data Set Compute Average Salary How certain are of this prediction? There is variability in the data.

3
**Simple Regression Model**

Use total variation as an index of uncertainty about our prediction Compute Total Variation The smaller the amount of total variation the more accurate (certain) will be our prediction.

4
**Simple Regression Model**

How “explain” the variability - Perhaps it depends on the student’s GPA Salary GPA

5
**Simple Regression Model**

Find a linear relationship between GPA and starting salary As GPA increases/decreases starting salary increases/decreases

6
**Simple Regression Model**

Least Squares Method to find regression model Choose a and b in regression model (equation) so that it minimizes the sum of the squared deviations – actual Y value minus predicted Y value (Y-hat)

7
**Simple Regression Model**

How good is the model? a= 4,779 & b = 5,370 A computer program computed these values u-hat is a “residual” value The sum of all u-hats is zero The sum of all u-hats squared is the total variance not explained by the model “unexplained variance” is 7,425,926

8
**Simple Regression Model**

Total Variation = 23,000,000

9
**Simple Regression Model**

Total Unexplained Variation = 7,425,726

10
**Simple Regression Model**

Relative Goodness of Fit Summarize the improvement in prediction using regression model Compute R2 – coefficient of determination Regression Model (equation) a better predictor than guessing the average salary The GPA is a more accurate predictor of starting salary than guessing the average R2 is the “performance measure“ for the model. Predicted Starting Salary = 4, ,370 * GPA

11
**Detailed Regression Example**

12
**Data Set Obs # Salary GPA Months Work 1 20000 2.8 48 2 24500 3.4 24 3**

23000 3.2 4 25000 3.8 5 6 22500 36 7 27500 4.0 20 8 19000 2.6 9 24000 10 28500 12

13
**Scatter Plot - GPA vs Salary**

14
**Scatter Plot - Work vs Salary**

15
**Pearson Correlation Coefficients -1 <= r <= 1**

Salary GPA Months Work 1

16
**Three Regressions Salary = f(GPA) Salary = f(Work)**

Salary = f(GPA, Work) Interpret Excel Output

17
**Interpreting Results Regression Statistics Statistical Significance**

Multiple R, R2, R2adj Standard Error Sy Statistical Significance t-test p-value F test

18
**Regression Statistics Table**

Multiple R R = square root of R2 R2 Coefficient of Determination R2adj used if more than one x variable Standard Error Sy This is the sample estimate of the standard deviation of the error (actual – predicted)

19
**ANOVA Table Table 1 gives the F statistic Tests the claim**

there is no significant relationship between your all of your independent and dependent variables The significance F value is a p-value should reject the claim: Of NO significant relationship between your independent and dependent variables if p< Generally = 0.05

20
**Regression Coefficients Table**

Coefficients Column gives b0 , b1, ,b2 , … , bn values for the regression equation. The b0 is the intercept b1value is next to your independent variable x1 b2 is next to your independent variable x2. b3 is next to your independent variable x3

21
**Regression Coefficients Table**

p values for individual t tests each independent variables t test - tests the claim that there is no relationship between the independent variable (in the corresponding row) and your dependent variable. Should reject the claim Of NO significant relationship between your independent variable (in the corresponding row) and dependent variable if p<.

22
**Regression Statistics**

Salary = f(GPA) Regression Statistics f(GPA) Multiple R R Square Adjusted R Square Standard Error Observations 10 ANOVA df SS MS F Significance F Regression 1 Residual 8 Total 9 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept GPA

23
**Regression Statistics**

Salary = f(Work) Regression Statistics f(Work) Multiple R R Square Adjusted R Square Standard Error Observations 10 ANOVA df SS MS F Significance F Regression 1 E-05 Residual 8 Total 9 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 1.49E-09 Months Work 5.53E-05

24
**Regression Statistics**

Salary = f(GPA, Work) Regression Statistics f(GPA,Work) Multiple R R Square Adjusted R Square Standard Error Observations 10 ANOVA df SS MS F Significance F Regression 2 Residual 7 Total 9 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept GPA Months Work

25
**Compare Three “Models”**

Regression Statistics f(GPA) Multiple R R Square Adjusted R Square Standard Error Observations 10 Regression Statistics f(Work) Multiple R R Square Adjusted R Square Standard Error Observations 10 Regression Statistics f(GPA,Work) Multiple R R Square Adjusted R Square Standard Error Observations 10

26
**Latent Variables (Theoretical Entities)**

27
**Latent Variables Latent Variables**

Explanatory Variables that are not directly measured Identified by “Exploratory Factor Analysis” Confirmed by “Confirmatory Factor Analysis” Statistical Methods for Latent Variables Principles Components Analysis PLS SEM

28
**Example: Confirmatory Factor Analysis Intention to Use Travelocity Website**

29
Research Instrument

Similar presentations

© 2024 SlidePlayer.com Inc.

All rights reserved.

To make this website work, we log user data and share it with processors. To use this website, you must agree to our Privacy Policy, including cookie policy.

Ads by Google