Regression Examples. Gas Mileage 1993 SOURCES: Consumer Reports: The 1993 Cars - Annual Auto Issue (April 1993), Yonkers, NY: Consumers Union. PACE New.

Slides:



Advertisements
Similar presentations
Topic 12: Multiple Linear Regression
Advertisements

Regression and correlation methods
Chapter 12 Simple Linear Regression
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
EPI 809/Spring Probability Distribution of Random Error.
Simple Linear Regression and Correlation
Chapter 12 Simple Linear Regression
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Creating Graphs on Saturn GOPTIONS DEVICE = png HTITLE=2 HTEXT=1.5 GSFMODE = replace; PROC REG DATA=agebp; MODEL sbp = age; PLOT sbp*age; RUN; This will.
Chapter 13 Multiple Regression
Multiple regression analysis
Chapter 11: Inferential methods in Regression and Correlation
Econ Prof. Buckles1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 4. Further Issues.
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 4. Further Issues.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
1 Regression Analysis Regression used to estimate relationship between dependent variable (Y) and one or more independent variables (X). Consider the variable.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
This Week Continue with linear regression Begin multiple regression –Le 8.2 –C & S 9:A-E Handout: Class examples and assignment 3.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Correlation and Linear Regression
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Correlation and Linear Regression
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Ms. Khatijahhusna Abd Rani School of Electrical System Engineering Sem II 2014/2015.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide Simple Linear Regression Part A n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
INTRODUCTORY LINEAR REGRESSION SIMPLE LINEAR REGRESSION - Curve fitting - Inferences about estimated parameter - Adequacy of the models - Linear.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
1 1 Slide Simple Linear Regression Coefficient of Determination Chapter 14 BA 303 – Spring 2011.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
1 Javier Aparicio División de Estudios Políticos, CIDE Primavera Regresión.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
Topic 13: Multiple Linear Regression Example. Outline Description of example Descriptive summaries Investigation of various models Conclusions.
Regression Analysis Relationship with one independent variable.
Inference with computer printouts. Coefficie nts Standard Errort StatP-value Lower 95% Upper 95% Intercept
Simple Linear Regression. Data available : (X,Y) Goal : To predict the response Y. (i.e. to obtain the fitted response function f(X)) Least Squares Fitting.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
1 Experimental Statistics - week 12 Chapter 12: Multiple Regression Chapter 13: Variable Selection Model Checking.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 Experimental Statistics - week 13 Multiple Regression Miscellaneous Topics.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
1 Experimental Statistics - week 12 Chapter 11: Linear Regression and Correlation Chapter 12: Multiple Regression.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
INTRODUCTION TO MULTIPLE REGRESSION MULTIPLE REGRESSION MODEL 11.2 MULTIPLE COEFFICIENT OF DETERMINATION 11.3 MODEL ASSUMPTIONS 11.4 TEST OF SIGNIFICANCE.
Stats Methods at IC Lecture 3: Regression.
Multiple Regression.
Lecture 11: Simple Linear Regression
Statistics for Business and Economics (13e)
Slides by JOHN LOUCKS St. Edward’s University.
Multiple Regression.
St. Edward’s University
Presentation transcript:

Regression Examples

Gas Mileage 1993 SOURCES: Consumer Reports: The 1993 Cars - Annual Auto Issue (April 1993), Yonkers, NY: Consumers Union. PACE New Car & Truck 1993 Buying Guide (1993), Milwaukee, WI: Pace Publications Inc. Specifications are given for 93 new car models for the 1993 year.

Gas Mileage vs Weight Several measures are available, such as price, mpg ratings, engine size, cylinders, weight, horsepower, etc. We consider the relationship between weight and highway mpg. Since more fuel is needed to move more weight, an increase in weight should result in a decrease in mpg.

Pearson Correlation Coefficients, N = 93 Prob > |r| under H0: Rho=0 mpg weight mpg <.0001 weight <.0001 SAS Output

The REG Procedure Model: MODEL1 Dependent Variable: mpg Number of Observations Read 93 Number of Observations Used 93 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept <.0001 weight <.0001

Interpretation From the graph it is evident that there is a fairly strong negative correlation between weight and mpg. The correlation output tells us that r=-.81. The regression output, under parameter estimates, tells us that the equation for the least squares line (best fit line) is mpg= *weight. Also, there are standard errors for the estimates that can be used to build confidence intervals, and there are t- values and p-values for a test of the hypothesis H o : Parameter=0 (vs not =0). Since the p-value for weight is.0001, we can reject H o and conclude that there is a linear relationship between weight and mpg (weight contributes information about mpg, or helps to predict mpg).

Gas Mileage vs Engine Size Generally speaking, larger engines burn more fuel (but is this due to weight?). We can check the relationship between liters (engine displacement) and mpg. We expect a negative relationship, since a larger engine would tend to decrease mpg.

Pearson Correlation Coefficients, N = 93 Prob > |r| under H0: Rho=0 mpg liters mpg <.0001 liters <.0001

The REG Procedure Model: MODEL1 Dependent Variable: mpg Number of Observations Read 93 Number of Observations Used 93 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept <.0001 liters <.0001

Interpretation From the first graph it is evident that there is a negative relationship between weight and mpg. However, the pattern is not purely linear. It seems to be some kind of curve. Thus we will not expect a linear analysis to tell the whole story. The correlation output tells us that r=-.63. The regression output, tells us that the equation for the least squares line is mpg=37.68 – 3.22*liters. The p-value for liters is.0001, so we conclude that mpg has a linear relationship with engine size. Which is a better predictor of mpg, engine size or weight? We can use the R-square value to determine that. For weight it is.6572, and for liters it is.3929.

More Advanced Interpretation Which is a better predictor of mpg, engine size or weight? We can use the R-square value to determine that. For weight it is.6572, and for liters it is R-square is an indication of the proportion of changes in y that are accounted for by x, so a larger value corresponds to a better predictor. Thus weight is a better predictor than engine size. The graphs show the best fit line and a best fit parabola (quadratic equation). The latter is provided for comparison purposes only. Note that the quadratic equation, even though it fits the points better, may not be a better model, because it shows mpg rising as engine sizes get very large. This does not make sense.

Brief Look at Multiple Regression Now you might think, what if we wanted to use both weight and engine size to predict mpg? This idea is called multiple regression, and it involves making an equation with two or more “x” variables to predict y. The next regression output shows this. Compare the R-square and p-values to previous results.

The REG Procedure Model: MODEL1 Dependent Variable: mpg Number of Observations Read 93 Number of Observations Used 93 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept <.0001 liters weight <.0001

Interpretation The least squares equation is mpg= *liters *weight. The R-square for weight alone is In the new model, it is It has gone up, but not much. This means that adding engine size to the equation does not improve predicted mpg very much. The p-value for weight is still very small, but the p-value for liters is now suspiciously large. Using alpha=.05, we would not reject that the coefficient of liters is zero, which means we are not able to detect a contribution to mpg.