Regression Analysis AGEC 784.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Chapter 10 Simple Regression.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Simple Linear Regression Analysis
SIMPLE LINEAR REGRESSION
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Lecture 19 Simple linear regression (Review, 18.5, 18.8)
Correlation & Regression
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Correlation and Linear Regression
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Simple Linear Regression Models
Chapter 8: Regression Analysis PowerPoint Slides Prepared By: Alan Olinsky Bryant University Management Science: The Art of Modeling with Spreadsheets,
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Chapter 5: Regression Analysis Part 1: Simple Linear Regression.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Lecture 10: Correlation and Regression Model.
7 - 1 Chapter 7: Data Analysis in the Service of Modeling The Art of Modeling with Spreadsheets S.G. Powell and K.R. Baker © John Wiley and Sons, Inc.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Chapter 13 Simple Linear Regression
The simple linear regression model and parameter estimation
Chapter 4: Basic Estimation Techniques
Chapter 20 Linear and Multiple Regression
Inference for Least Squares Lines
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Statistics for Managers using Microsoft Excel 3rd Edition
Correlation and Simple Linear Regression
Basic Estimation Techniques
Linear Regression and Correlation Analysis
Simple Linear Regression
Chapter 11 Simple Regression
Chapter 13 Simple Linear Regression
Correlation and Simple Linear Regression
Correlation and Regression
CHAPTER 29: Multiple Regression*
6-1 Introduction To Empirical Models
Linear Regression/Correlation
Prepared by Lee Revere and John Large
Correlation and Simple Linear Regression
Simple Linear Regression
Correlation and Regression
SIMPLE LINEAR REGRESSION
Simple Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Chapter Thirteen McGraw-Hill/Irwin
Chapter 13 Simple Linear Regression
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Regression Analysis AGEC 784

Modeling Relationships In some circumstances, data can be valuable in helping to determine the parameters in a relationship or its structural form. The process of using data to formulate relationships is known as regression analysis. In this approach, we identify one variable as the response variable, which means that it can be predicted from the values of other variables. Those other variables are called explanatory variables.

Types of Regression Models Regression models that involve one explanatory variable are called simple regressions When two or more explanatory variables are involved, the relationships are called multiple regressions. Regression models are also divided into linear and nonlinear models, depending on whether the relationship between the response and explanatory variables is linear or nonlinear.

Estimating Relationships Scatter plot – visualize association Correlation: n – number of pairs of observations for x, y sx, sy – standard deviations of x, y r – measures strength of linear relationship between x and y

r-statistic Independent of units of measurement Lies in range [-1, 1] r > 0 – positive association r < 0 – negative association r close to 1 (or –1) implies a strong association r close to 0 implies a weak association Excel function: CORREL(xrange,yrange)

Simple Linear Regression y = a + bx + e y - dependent variable x - independent variable e - an “error” term. Constants a and b represent the intercept and slope, respectively, of the regression line.

Error Term in Regression Unexplained “noise” in the relationship May represent limitations of knowledge Or may represent random deviations of the dependent variable from its mean, y

Regression Goal Want to find line to most closely match the observed relationship between x and y Define “most closely” as minimizing sum of squared differences between observed and model values Minimizing sum of differences would set y equal to its mean Penalizes large differences more than small differences

Performing Regression Residuals: ei = yi – y = yi – (a + bxi) Sum of squared differences between observations and model : SS = The regression problem: choose a and b to minimize SS

Regression Analysis Assumes residuals are normally distributed with mean 0 Regression parameters can be calculated directly from the data Simpler to use Excel’s regression tool (Under Data Analysis menu)

Goodness of Fit Coefficient of determination: R2 Lies in range [0, 1] Closer to one – better fit Measures how much of the variation in y-values is explained by model 1 – perfect match to model 0 – equation explains none of observed variation

Regression Window

Regression Output R Squared Degree of significance (under 0.1 is significant) P values of under 0.1 are statistically significant Estimate for a Estimate for b

Regression Statistics Four measures are used to judge the statistical qualities of a regression: R2: Measures the percent of variation in the explanatory variable accounted for by the regression model. F-statistic (Significance F): Measures the probability of observing the given R2 (or higher) when all the true regression coefficients are zero. p-value: Measures the probability of observing the given estimate of the regression coefficient (or a larger value, positive or negative) when the true coefficient is zero. Confidence interval: Gives a range within which the true regression coefficient lies with given probability.

Simple Nonlinear Regression A straight line may not be the most plausible description of dependency, e.g., y = axb . Can follow previous ideas to minimize sum of squared differences No Excel functions or simple formulas Or can transform non-linear relationship into linear one, e.g., log y = log a + b log x Give up some intuition for convenience

Multiple Linear Regression Multiple independent variables y = a0 + a1x1 + a2x2 + … + amxm + e Work with n observations – each has: One observation of dependent variable One observation each of the m independent variables Seek to minimize the sum of squared differences Put all independent variables into x-range in Excel’s regression tool

Regression Output Square root of R square Coefficient of multiple determination Accounts for presence of multiple variables P values of under 0.1 are statistically significant Coefficients of regression equation

Values to Include in Regression Ideally pick values that can be justified based on practical or theoretical grounds Could choose set that generates largest value of adjusted R2 Also could choose based on those with significant p-values for coefficients Remember that good models require good forecasts for the independent variables.

Regression Assumptions Errors in the regression model: Follow a Normal distribution Are mutually independent Have the same variance Linearity is assumed to hold