7 - 1 8 - 1 Chapter 8: Regression Analysis PowerPoint Slides Prepared By: Alan Olinsky Bryant University Management Science: The Art of Modeling with Spreadsheets,

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Lesson 10: Linear Regression and Correlation
Correlation and regression
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Regression Analysis Using Excel. Econometrics Econometrics is simply the statistical analysis of economic phenomena Here, we just summarize some of the.
MANAGEMENT SCIENCE The Art of Modeling with Spreadsheets STEPHEN G. POWELL KENNETH R. BAKER Compatible with Analytic Solver Platform FOURTH EDITION CHAPTER.
Chapter 10 Simple Regression.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Gordon Stringer, UCCS1 Regression Analysis Gordon Stringer.
Simple Linear Regression Analysis
SIMPLE LINEAR REGRESSION
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression and Correlation
Spreadsheet Problem Solving
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
MANAGEMENT SCIENCE The Art of Modeling with Spreadsheets STEPHEN G. POWELL KENNETH R. BAKER Compatible with Analytic Solver Platform FOURTH EDITION CHAPTER.
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Simple Linear Regression
Correlation and Linear Regression
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Correlation and Linear Regression
Chapter 191 Data Analysis and Data Presentation Chapter 19 Achieving Quality Through Continual Improvement Claude W. Burrill / Johannes Ledolter Published.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Simple Linear Regression Models
Bivariate Regression (Part 1) Chapter1212 Visual Displays and Correlation Analysis Bivariate Regression Regression Terminology Ordinary Least Squares Formulas.
3 - 1 Chapter 3: Basic Excel Skills Management Science: The Art of Modeling With Spreadsheets, 2e PowerPoint Slides Prepared By: Alan Olinsky Bryant University.
7 - 1 Chapter 7: Data Analysis for Modeling PowerPoint Slides Prepared By: Alan Olinsky Bryant University Management Science: The Art of Modeling with.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
4 - 1 Chapter 4: Advanced Excel Skills Management Science: The Art of Modeling with Spreadsheets, 3e S.G. Powell K.R. Baker © John Wiley and Sons, Inc.
Chapter 5: Regression Analysis Part 1: Simple Linear Regression.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Reid & Sanders, Operations Management © Wiley 2002 Forecasting 8 C H A P T E R.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
7 - 1 Chapter 7: Data Analysis in the Service of Modeling The Art of Modeling with Spreadsheets S.G. Powell and K.R. Baker © John Wiley and Sons, Inc.
Chapter 9: Short-Term Forecasting PowerPoint Slides Prepared By: Alan Olinsky Bryant University Management Science: The Art of Modeling with.
MANAGEMENT SCIENCE The Art of Modeling with Spreadsheets STEPHEN G. POWELL KENNETH R. BAKER Compatible with Analytic Solver Platform FOURTH EDITION OPTIMIZATION.
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
6 - 1 Chapter 6: Analysis Using Spreadsheets PowerPoint Slides Prepared By: Alan Olinsky Bryant University Management Science: The Art of Modeling with.
Chapter 13 Simple Linear Regression
Chapter 4: Basic Estimation Techniques
Regression Analysis AGEC 784.
Chapter 17: Optimization In Simulation
Correlation and Regression
CHAPTER 29: Multiple Regression*
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Presentation transcript:

Chapter 8: Regression Analysis PowerPoint Slides Prepared By: Alan Olinsky Bryant University Management Science: The Art of Modeling with Spreadsheets, 2e S.G. Powell K.R. Baker © John Wiley and Sons, Inc.

8 - 2 Modeling Relationships  In some circumstances, data can be valuable in helping to determine the parameters in a relationship or its structural form.  The process of using data to formulate relationships is known as regression analysis.  In this approach, we identify one variable as the response variable, which means that it can be predicted from the values of other variables.  Those other variables are called explanatory variables.

8 - 3 Types of Regression Models  Regression models that involve one explanatory variable are called simple regressions  When two or more explanatory variables are involved, the relationships are called multiple regressions.  Regression models are also divided into linear and nonlinear models, depending on whether the relationship between the response and explanatory variables is linear or nonlinear.

8 - 4 Estimating Relationships  Scatter plot – visualize association  Correlation:  n – number of pairs of observations for x, y  s x, s y – standard deviations of x, y  r – measures strength of linear relationship between x and y

8 - 5 r-statistic  Independent of units of measurement  Lies in range [-1, 1]  r > 0 – positive association  r < 0 – negative association  r close to 1 (or –1) implies a strong association  r close to 0 implies a weak association  Excel function: CORREL(xrange,yrange)

8 - 6 Simple Linear Regression  y = a + bx + e  y - dependent variable  x - independent variable  e - an “error” term.  Constants a and b represent the intercept and slope, respectively, of the regression line.

8 - 7 Error Term in Regression  Unexplained “noise” in the relationship  May represent limitations of knowledge  Or may represent random deviations of the dependent variable from its mean, y

8 - 8 Regression Goal  Want to find line to most closely match the observed relationship between x and y  Define “most closely” as minimizing sum of squared differences between observed and model values Minimizing sum of differences would set y equal to its mean Penalizes large differences more than small differences

8 - 9 Performing Regression  Residuals: e i = y i – y = y i – (a + bx i )  Sum of squared differences between observations and model : SS =  The regression problem: choose a and b to minimize SS

Regression Analysis  Assumes residuals are normally distributed with mean 0  Regression parameters can be calculated directly from the data  Simpler to use Excel’s regression tool (Under Data Analysis menu)

Goodness of Fit  Coefficient of determination: R 2  Lies in range [0, 1]  Closer to one – better fit  Measures how much of the variation in y- values is explained by model 1 – perfect match to model 0 – equation explains none of observed variation

Regression Window

Regression Output R Squared Degree of significance (under 0.1 is significant) Estimate for a Estimate for b P values of under 0.1 are statistically significant

Regression Statistics  Four measures are used to judge the statistical qualities of a regression: R 2 : Measures the percent of variation in the explanatory variable accounted for by the regression model. F-statistic (Significance F): Measures the probability of observing the given R2 (or higher) when all the true regression coefficients are zero. p-value: Measures the probability of observing the given estimate of the regression coefficient (or a larger value, positive or negative) when the true coefficient is zero. Confidence interval: Gives a range within which the true regression coefficient lies with given probability.

Simple Nonlinear Regression  A straight line may not be the most plausible description of dependency, e.g., y = ax b.  Can follow previous ideas to minimize sum of squared differences No Excel functions or simple formulas  Or can transform non-linear relationship into linear one, e.g., log y = log a + b log x Give up some intuition for convenience

Multiple Linear Regression  Multiple independent variables y = a 0 + a 1 x 1 + a 2 x 2 + … + a m x m + e  Work with n observations – each has: One observation of dependent variable One observation each of the m independent variables  Seek to minimize the sum of squared differences  Put all independent variables into x-range in Excel’s regression tool

Regression Output Coefficient of multiple determination Coefficients of regression equation P values of under 0.1 are statistically significant Square root of R square Accounts for presence of multiple variables

Values to Include in Regression  Ideally pick values that can be justified based on practical or theoretical grounds  Could choose set that generates largest value of adjusted R 2  Also could choose based on those with significant p-values for coefficients  Remember that good models require good forecasts for the independent variables.

Regression Assumptions  Errors in the regression model: Follow a Normal distribution Are mutually independent Have the same variance  Linearity is assumed to hold

*Using the Excel Tools Trendline and LINEST  Excel provides several alternative methods for performing regression analysis.  Trendline is a charting option that allows the user to fit one of six families of curves to a set of data and to add the resulting regression line to the plot.  LINEST is an array function that can be used to compute regression statistics and use them directly as parameters in a model.

*Trendline  The Trendline option appears in Excel only when a chart has been selected.  Trendline offers the option to fit any one of the following six families of curves: Linear: y = a + bx Logarithmic:y = a+ b×ln(x) Polynomial:y = a + bx + cx 2 + dx 3 + …(the user selects the Order of the polynomial, which is the largest exponent of x). Power:y = ax b Exponential: y = ae bx Moving average: y = average of previous n y-values (the user selects the Period for the moving average, which is n, the number of previous values used to calculate the result).

Trendline Window

Linear Trendline Linear trendline for BPI data

Power Trendline Power trendline for BPI data

*LINEST  LINEST is an Excel function that calculates regression parameters and measures of goodness-of-fit for simple or multiple regressions.  It is one of a set of functions including SLOPE, INTERCEPT, and TREND, which can be used as alternatives to the Data Analysis add-in.  LINEST is an array function, which means that it physically occupies more than one cell in the spreadsheet.  Like all Excel functions, it is linked to the underlying data, so if the data change the regression parameters calculated by LINEST change automatically.  This is not true of regression results calculated using the Analysis Toolpak: if the underlying data change, the user must be careful to re-run the Regression procedure.

Forecasting Model Using LINEST.

Function Wizard for LINEST

Summary  Modeling is the central task for the analyst and data collection and statistical analysis support the modeling task where appropriate.  When sensitivity testing indicates that certain parameters or relationships must be determined precisely, we often collect data and perform statistical analysis to refine the parameters and relations in our models.  Regression analysis is a means for using data to help formulate relationships among variables.  All regression methods are based on the idea of fitting a family of curves to data by choosing parameters that minimize the sum of squared residuals.

Summary  The simplest regression model is a linear relationship with one explanatory variable, although regression can also be applied in cases where there are multiple explanatory variables and nonlinear relationships..  The most complete method in Excel is the Regression option within the Analysis Toolpak add-in.  Other useful methods include Trendline, which can be used to fit any one of six families of curves to plotted data, and LINEST, which can be used to calculate regression estimates dynamically.

Copyright 2008 John Wiley & Sons, Inc. All rights reserved. Reproduction or translation of this work beyond that permitted in section 117 of the 1976 United States Copyright Act without express permission of the copyright owner is unlawful. Request for further information should be addressed to the Permissions Department, John Wiley & Sons, Inc. The purchaser may make back-up copies for his/her own use only and not for distribution or resale. The Publisher assumes no responsibility for errors, omissions, or damages caused by the use of these programs or from the use of the information herein.