Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Specification: Choosing the Independent.

Slides:



Advertisements
Similar presentations
Further Inference in the Multiple Regression Model Hill et al Chapter 8.
Advertisements

All Possible Regressions and Statistics for Comparing Models
Chapter 5 Multiple Linear Regression
Multiple Regression Analysis
Chapter 12 Simple Linear Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Specifying an Econometric Equation and Specification Error
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
MULTIPLE REGRESSION. OVERVIEW What Makes it Multiple? What Makes it Multiple? Additional Assumptions Additional Assumptions Methods of Entering Variables.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Chapter 10 Simple Regression.
Statistics for Managers Using Microsoft® Excel 5th Edition
Statistics for Managers Using Microsoft® Excel 5th Edition
January 6, morning session 1 Statistics Micro Mini Multiple Regression January 5-9, 2008 Beth Ayers.
The Simple Regression Model
1 Chapter 9 Variable Selection and Model building Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Lecture 11 Multivariate Regression A Case Study. Other topics: Multicollinearity  Assuming that all the regression assumptions hold how good are our.
Further Inference in the Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Topic 3: Regression.
Multiple Linear Regression
Ch. 14: The Multiple Regression Model building
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Linear Regression Models Powerful modeling technique Tease out relationships between “independent” variables and 1 “dependent” variable Models not perfect…need.
Chapter 15: Model Building
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Correlation & Regression
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Problems with Incorrect Functional Form You cannot compare R 2 between two different functional forms. ▫ Why? TSS will be different. One should also remember.
Chapter 12 Multiple Regression and Model Building.
Simple Linear Regression Models
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Selecting Variables and Avoiding Pitfalls Chapters 6 and 7.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Multiple Collinearity, Serial Correlation,
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Shonda Kuiper Grinnell College. Statistical techniques taught in introductory statistics courses typically have one response variable and one explanatory.
Model Selection1. 1. Regress Y on each k potential X variables. 2. Determine the best single variable model. 3. Regress Y on the best variable and each.
Basics of Regression Analysis. Determination of three performance measures Estimation of the effect of each factor Explanation of the variability Forecasting.
Welcome to Econ 420 Applied Regression Analysis Study Guide Week Six.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Ordinary Least Squares Regression.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Lecture 10: Correlation and Regression Model.
Chapter 4 The Classical Model Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
 Relationship between education level, income, and length of time out of school  Our new regression equation: is the predicted value of the dependent.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Chap 6 Further Inference in the Multiple Regression Model
B AD 6243: Applied Univariate Statistics Multiple Regression Professor Laku Chidambaram Price College of Business University of Oklahoma.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
1 Building the Regression Model –I Selection and Validation KNN Ch. 9 (pp )
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Stats Methods at IC Lecture 3: Regression.
Why Model? Make predictions or forecasts where we don’t have data.
Regression Analysis AGEC 784.
Inference for Least Squares Lines
Chapter 9 Multiple Linear Regression
Further Inference in the Multiple Regression Model
Multivariate Analysis Lec 4
Quantitative Methods Simple Regression.
Multiple Regression Models
Linear Model Selection and regularization
Simple Linear Regression
Regression Analysis.
Chapter 13 Additional Topics in Regression Analysis
Presentation transcript:

Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Specification: Choosing the Independent Variables

Common Assumptions 1)Errors are independent, that is not correlated with X or the errors of other cases. 2)Errors have identical distributions (iid), with mean zero and equal variance (Homoscedasticity) for every value of X. 3)Errors are normally distributed. Conducting Social Research

Violation of Assumptions Nonconstant error variance Correlation among errors Nonnormal errors Conducting Social Research Other Problems Omitted variables Nonlinear relationships Influential cases

Omitted Variables An important variable that is left out of the model Causes Unmeasured Variable Forgotten Variable Consequences No coefficient for the Omitted Variable Bias in the other Coefficients No longer Minimum Variance Conducting Social Research

Omitted Variable Bias (Specification Bias) Conducting Social Research

Ramsey’s Regression Specification Error Test Conducting Social Research

Irrelevant Variables An unimportant variable that is included in the model Causes Error in Theory Measurement Error Improper Variable Selection Procedure Consequence No longer Minimum Variance Conducting Social Research

Irrelevant Variable (Not biased)

Irrelevant Variable (Nonminimum Variance) Conducting Social Research

1.Theory: Is the variable’s place in the equation unambiguous and theoretically sound? 2.t-Test: Is the variable’s estimated coefficient significant in the expected direction. 3.Adjusted Coefficient of Determination: Does the overall fit of the equation improve when the variable is added. 4.Bias: Do other variables’ coefficients change significantly when the variable is added to the equation? Conducting Social Research Model Selection Principles

When the four criteria do not agree, use careful judgment and do not rely on a single criterion. Conducting Social Research Model Selection Principles

1.Rely on theory rather than statistical fit as much as possible. 2.Minimize the number of equations estimated. 3.Reveal “all” alternative specifications estimated. Conducting Social Research Model Selection Principles

1.Searching tends to exploit chance patterns in the sample. 2.If variables are correlated, searching will often exclude one or more of them making it appear unimportant. 3.If variables are positively correlated but their effect on the dependent variable is opposite, one or both may be excluded. Conducting Social Research Model Selection-Searching

1.Forward inclusion: Start with no variables in the model and then add variables based on the largest increase in R 2. 2.Backward elimination: Start with all possible variables in the model and then delete variables that produce the smallest decrease in R 2. 3.Stepwise selection: Iteratively including and eliminating variables based on R 2, F- test, or t-tests with a stopping rule. Conducting Social Research Model Selection-Searching

1.Statistical significance is overstated because the significance of the previous models is ignored. 2.The researchers criteria for selecting various results are not disclosed. Conducting Social Research Model Selection-Searching

1.Purposely estimating a number of alternative specifications the determine the robustness of the model. 2.Attempts to find results that are significant in a number of specifications. 3.Opposite Selection Searching, which attempts to find one specification that is different from the other specifications. Conducting Social Research Sensitivity Analysis

1.Develop a model using one sample. 2.Validate the model using a second sample. Conducting Social Research Model Validation (Data Mining with Dual Samples)

1.Information Criteria are measures of goodness of fit or uncertainty for the range of values of the data. 2.Information criteria measure the difference between a given model and the “true” underlying model. 3.AIC is a function of the number of observations n, the SSE and the number of parameters p. 4.There are two parts, lack of fit term and a parameter penalty term. Conducting Social Research Information Criteria

Conducting Social Research Akaike Information Criterion

1.As the number of parameters in the model increases, the lack of fit term decreases while the penalty term increases. 2.The model with the smallest AIC is deemed the “best” model since it minimizes the difference from the given model to the “true” model. Conducting Social ResearchAIC

Bayesian Information Criterion (Schwarz Criterion)

1.The BIC is an increasing function of RSS and an increasing function of k. 2.Unexplained variation in the dependent variable and the number of explanatory variables increase the value of BIC. 3.Lower BIC implies either fewer explanatory variables, better fit, or both. 4.The BIC penalizes free parameters more strongly than does the Akaike information criterion. Conducting Social ResearchBIC

1.“Excessive” intercorrelations among X variables. 2.The degree of collinearity can be defined as R 2 for a model that regresses X k variable on all the other X variables, the proportion of the variance of X k explained by the other X’s. 3.The tolerance of X k is the proportion of its variance not shared by the other X’s, 1 - R 2. Conducting Social Research Multiple Collinearity