REGRESSION DIAGNOSTICS

Slides:



Advertisements
Similar presentations
ASSUMPTION CHECKING In regression analysis with Stata
Advertisements

Week 13 November Three Mini-Lectures QMM 510 Fall 2014.
Items to consider - 3 Multicollinearity
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r Assumptions.
Lecture 25 Multiple Regression Diagnostics (Sections )
Additional Topics in Regression Analysis
Lecture 24 Multiple Regression (Sections )
Regression Diagnostics Checking Assumptions and Data.
Slide Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
EPI809/Spring Testing Individual Coefficients.
Finding help. Stata manuals You have all these as pdf! Check the folder /Stata12/docs.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
Simple linear regression and correlation analysis
1 MULTI VARIATE VARIABLE n-th OBJECT m-th VARIABLE.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Multiple Collinearity, Serial Correlation,
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Anaregweek11 Regression diagnostics. Regression Diagnostics Partial regression plots Studentized deleted residuals Hat matrix diagonals Dffits, Cook’s.
Dr. C. Ertuna1 Issues Regarding Regression Models (Lesson - 06/C)
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Assumption checking in “normal” multiple regression with Stata.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Environmental Modeling Basic Testing Methods - Statistics III.
 Relationship between education level, income, and length of time out of school  Our new regression equation: is the predicted value of the dependent.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Applied Quantitative Analysis and Practices LECTURE#30 By Dr. Osman Sadiq Paracha.
B AD 6243: Applied Univariate Statistics Multiple Regression Professor Laku Chidambaram Price College of Business University of Oklahoma.
Statistical Data Analysis 2010/2011 M. de Gunst Lecture 10.
KNN Ch. 3 Diagnostics and Remedial Measures Applied Regression Analysis BUSI 6220.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
1 Regression Review Population Vs. Sample Regression Line Residual and Standard Error of Regression Interpretation of intercept & slope T-test, F-test.
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
CE 525. ESRI VIDEO Take notes! is/player.cfm
PADM 692 | Data Analysis II Session II Linear Regression Diagnostics March 17, 2012 University of La Verne Soomi Lee, PhD Copyright © by Soomi Lee Do not.
Market Intelligence Class 11. Regression - Basics Terminology – In simple regression with a single variable, we get a zero-order effect (full effect)
Multiple Regression Analysis Bernhard Kittel Center for Social Science Methodology University of Oldenburg.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Ch5 Relaxing the Assumptions of the Classical Model
Chapter 11 Multiple Regression
Model Diagnostics Political Analysis II.
The Problem of Large Correlations Among the Independent Variables
Kakhramon Yusupov June 15th, :30pm – 3:00pm Session 3
REGRESSION DIAGNOSTIC II: HETEROSCEDASTICITY
Regression Diagnostics
Regression Analysis Simple Linear Regression
REGRESSION DIAGNOSTIC I: MULTICOLLINEARITY
Chapter 12: Regression Diagnostics
Lab 9 – Regression Diagnostics
Fundamentals of regression analysis
بحث في التحليل الاحصائي SPSS بعنوان :
Serial Correlation and Heteroskedasticity in Time Series Regressions
REGRESSION DIAGNOSTIC I: MULTICOLLINEARITY
I271b Quantitative Methods
Serial Correlation and Heteroscedasticity in
Regression Diagnostics
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
Regression Forecasting and Model Building
Checking Assumptions Primary Assumptions Secondary Assumptions
Chapter 13 Additional Topics in Regression Analysis
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
Multicollinearity What does it mean? A high degree of correlation amongst the explanatory variables What are its consequences? It may be difficult to separate.
Serial Correlation and Heteroscedasticity in
Presentation transcript:

REGRESSION DIAGNOSTICS 1

Problems Influentials and outliers Heteroscedasticity Autocorrelation Multicollinearity – relationship of independent variables

EXCURSUS - RESIDUALS 3

Residuals - review Unstandardized residuals H = hat matrix Predicted residuals

Residuals - review Standardized residuals Jackknife residuals

I. Influentials If omitted from computation big change in regression coeffs can be found. Goal: to find and exclude

Influentials -diagnosis DFBETA(-i)=b-b(-i) Rule of thumb: Problem if NDFBETA>2/√n Note : NDFFIT problem if NDFFIT>2/√(n/p)

II. Heteroscedasticity Assumption for regression: variance of error is the same for all values of indep. variable Checking: Charts for residuals vs. ind. vars Consequence: big standard errors of coeffs, t-tests statistically insig. Tests - Glejser, Goldfeld-Quandt tests Analytical solution: weighted LS (WLS)

Glejser’s test Model for residuals on ind. vars :

III. Multicollinearity Estimate: Strong relationship between ind. variables: X´X is singular matrix or nearly singular Consequence: standard errors of coeffs are inflated, t-test statistically insignificant, estimates are not stable

Multicollinearity Diagnosis: Correlation of ind. vars – cor. coeff>0,8 Other options: a) Tolerance (1-R2j) b) VIF = 1/(1-R2j) VIF diagonál cells R-1 c) Conditon index: square root of ratio: max lambda/min lambda ROT* > 30 → problem *ROT=Rules of thumb

Multicollinearity Solution Ignore  Leave out variable Get more data Use factor analysis (see later) Ridge regression Biased estimates but smaller standard erorrs (slight change in diagonal’s elements)

IV. Autocorrelation Assumption for regression: variance of error is independent for individual observations Checking: Charts for residuals vs. time Consequence: big standard errors of coeffs, t-tests statistically insig. Tests - Durbin-Watson Solution: weighted LS Autocorrelation is present in time series (usually not used in social sciences)