Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Linear regression models
Simple Linear Regression
Chapter 12 Simple Linear Regression
Chapter 10 Simple Regression.
Chapter 12 Simple Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Introduction to Probability and Statistics Linear Regression and Correlation.
SIMPLE LINEAR REGRESSION
Korelasi dalam Regresi Linear Sederhana Pertemuan 03 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Correlation and Regression Analysis
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Go to Table of ContentTable of Content Analysis of Variance: Randomized Blocks Farrokh Alemi Ph.D. Kashif Haqqi M.D.
Multiple Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Regression Analysis (2)
Simple Linear Regression Models
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 13 Multiple Regression
Regression Analysis Relationship with one independent variable.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
INTRODUCTION TO MULTIPLE REGRESSION MULTIPLE REGRESSION MODEL 11.2 MULTIPLE COEFFICIENT OF DETERMINATION 11.3 MODEL ASSUMPTIONS 11.4 TEST OF SIGNIFICANCE.
Chapter 13 Simple Linear Regression
Regression Analysis AGEC 784.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Statistics for Managers using Microsoft Excel 3rd Edition
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
Chapter 11 Simple Regression
Relationship with one independent variable
CHAPTER 29: Multiple Regression*
Prepared by Lee Revere and John Large
Relationship with one independent variable
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Presentation transcript:

Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.

Go to Table of Content 2 Additional Reading For additional reading see Chapter 15 and Chapter 14 in Michael R. Middleton’s Data Analysis Using Excel, Duxbury Thompson Publishers, Example described in this lecture is based in part on Chapter 17 and Chapter 18 of Keller and Warrack’s Statistics for Management and Economics. Fifth Edition, Duxbury Thompson Learning Publisher, Read any introductory statistics book about single and multiple variable regression.

Go to Table of Content 3 Which Approach Is Appropriate When? Choosing the right method for the data is the key statistical expertise that you need to have. You might want to review a decision tool that we have organized for you to help you in choosing the right statistical method.decision tool

Go to Table of Content 4 Do I Need to Know the Formulas? You do not need to know exact formulas. You do need to know where they are in your reference book. You do need to understand the concept behind them and the general statistical concepts imbedded in the use of the formulas. You do not need to be able to do correlation and regression by hand. You must be able to do it on a computer using Excel or other software.

Go to Table of Content 5 Table of Content Objectives Purpose of Regression Correlation or Regression? First Order Linear Model Probabilistic Linear RelationshipProbabilistic Linear Relationship Estimating Regression ParametersEstimating Regression Parameters Assumptions Sum of squares Tests Percent of variation explainedPercent of variation explained Example Regression Analysis in ExcelRegression Analysis in Excel Normal Probability Plot Residual Plot Goodness of Fit ANOVA For Regression

Go to Table of Content 6 Objectives To learn the assumptions behind and the interpretation of single and multiple variable regression. To use Excel to calculate regressions and test hypotheses.

Go to Table of Content 7 Purpose of Regression To determine whether values of one or more variable are related to the response variable. To predict the value of one variable based on the value of one or more variables. To test hypotheses.

Go to Table of Content 8 Correlation or Regression? Use correlation if you are interested only in whether a relationship exists. Use Regression if you are interested in building a mathematical model that can predict the response variable. Use regression if you are interested in the relative effectiveness of several variables in predicting the response variable.

Go to Table of Content 9 First Order Linear Model A deterministic mathematical model between y and x: y =  0 +  1 * x  0 is the intercept with y axis, the point at which x = 0  1 is the angle of the line, the ratio of rise divided by the run in figure to the right. It measures the change in y for one unit of change in x.

Go to Table of Content 10 Probabilistic Linear Relationship But relationship between x and y is not always exact. Observations do not always fall on a straight line. To accommodate this, we introduce a random error term referred to as epsilon: y =  0 +  1 * x +  The task of regression analysis then is to estimate the parameters b 0 and b 1 in the equation: = b 0 + b 1 * x so that the difference between y and is minimized ^ y ^ y

Go to Table of Content 11 Estimating Regression Parameters Red dots show the observations The solid line shows the estimated regression line The distance between each observation and the solid line is called residual Minimize the sum of the squared residuals (differences between line and observations). Residual Regression line

Go to Table of Content 12 Assumptions The dependent (response) variable is measured on an interval scale The probability distribution of the error is Normal with mean zero The standard deviation of error is constant and does not depend on values of x The error terms associated with any particular value of Y is independent of error term associated with other values of Y

Go to Table of Content 13 Sum of Squares Variation in y = SSR + SSE MSR divided by MSE is the test statistic for ability of regression to explain the data

Go to Table of Content 14 Tests The hypothesis that the regression equation does not explain variation in Y and can be tested using F test. The hypothesis that the coefficient for x is zero can be tested using t statistic. The hypothesis that the intercept is 0 can be tested using t statistic

Go to Table of Content 15 Percent of Variation Explained R 2 is the coefficient of determination. The minimum R 2 is zero. The maximum is R 2 is the variation left unexplained. If Y is not related to X or related in a non-linear fashion, then R 2 will be small. Adjusted R 2 shows the value of R 2 after adjustment for degrees of freedom. It protects against having an artificially high R 2 by increasing the number of variables in the model.

Go to Table of Content 16 Example Is waiting time related to satisfaction ratings? Predict what will happen to satisfaction ratings if waiting time reaches 15 minutes?

Go to Table of Content 17 Regression Analysis in Excel Select tools Select data analysis Select regression analysis Identify the x and y data of equal length Ask for residual plots to test assumptions Ask for normal probability plot to test assumption

Go to Table of Content 18 Normal Probability Plot Normal Probability Plot compares the percent of errors falling in particular bins to the percentage expected from Normal distribution. If assumption is met then the plot should look like a straight line.

Go to Table of Content 19 Residual Plot Tests that residuals have mean of zero and constant standard deviation Tests that residuals are not dependent on values of x

Go to Table of Content 20 Linear Equation Satisfaction = – 4.8* Waiting time At 15 minutes waiting time, satisfaction is predicted to be: * 15 = The t statistic related to both the intercept and waiting time coefficient are statistically significant. The hypotheses that the coefficients are zero are rejected.

Go to Table of Content 21 Goodness of Fit 57% of variation in satisfaction ratings is explained by the equation 43% of variation in satisfaction ratings is left unexplained

Go to Table of Content 22 ANOVA For Regression The regression model has mean sum of square of 347. The mean sum of errors is 33. Note the error term is called residuals in Excel. F statistics is 10, the probability of observing this statistic is The hypothesis that the MSR and MSE are equal is rejected. Significant variation is explained by regression.

Go to Table of Content 23 Take Home Lesson Regression is based on SS approach, similar to ANOVA Regression assumptions can be examined by looking at residuals Several hypotheses can be tested using regression analysis