Model Selection II: datasets with several explanatory variables

Slides:



Advertisements
Similar presentations
Quantitative Methods Model Selection II: datasets with several explanatory variables.
Advertisements

Qualitative predictor variables
Topic 12: Multiple Linear Regression
Multicollinearity.
More on understanding variance inflation factors (VIFk)
Quantitative Methods Using more than one explanatory variable.
Note 14 of 5E Statistics with Economics and Business Applications Chapter 12 Multiple Regression Analysis A brief exposition.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
19-1 Chapter Nineteen MULTIVARIATE ANALYSIS: An Overview.
Quantitative Methods Using more than one explanatory variable.
Every achievement originates from the seed of determination. 1Random Effect.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
Stat 217 – Day 26 Regression, cont.. Last Time – Two quantitative variables Graphical summary  Scatterplot: direction, form (linear?), strength Numerical.
Slide 1 Larger is better case (Golf Ball) Linear Model Analysis: SN ratios versus Material, Diameter, Dimples, Thickness Estimated Model Coefficients for.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
A (second-order) multiple regression model with interaction terms.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Business Statistics, 4e by Ken Black Chapter 15 Building Multiple Regression Models.
CHAPTER 14 MULTIPLE REGRESSION
Chapter 12: Linear Regression 1. Introduction Regression analysis and Analysis of variance are the two most widely used statistical procedures. Regression.
Name: Angelica F. White WEMBA10. Teach students how to make sound decisions and recommendations that are based on reliable quantitative information During.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Chap 14-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics.
Chapter 7 Relationships Among Variables What Correlational Research Investigates Understanding the Nature of Correlation Positive Correlation Negative.
Copyright ©2011 Nelson Education Limited Linear Regression and Correlation CHAPTER 12.
Exam 1 Review. Data referenced throughout review The Federal Trade Commission annually rates varieties of domestic cigarettes according to their tar,
Solutions to Tutorial 5 Problems Source Sum of Squares df Mean Square F-test Regression Residual Total ANOVA Table Variable.
Sequential sums of squares … or … extra sums of squares.
1 Nested (Hierarchical) Designs In certain experiments the levels of one factor (eg. Factor B) are similar but not identical for different levels of another.
1 Every achievement originates from the seed of determination.
Multiple regression. Example: Brain and body size predictive of intelligence? Sample of n = 38 college students Response (Y): intelligence based on the.
Overview of our study of the multiple linear regression model Regression models with more than one slope parameter.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
1 Building the Regression Model –I Selection and Validation KNN Ch. 9 (pp )
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
Multicollinearity. Multicollinearity (or intercorrelation) exists when at least some of the predictor variables are correlated among themselves. In observational.
Interaction regression models. What is an additive model? A regression model with p-1 predictor variables contains additive effects if the response function.
Agenda 1.Exam 2 Review 2.Regression a.Prediction b.Polynomial Regression.
Designs for Experiments with More Than One Factor When the experimenter is interested in the effect of multiple factors on a response a factorial design.
732G21/732G28/732A35 Lecture 6. Example second-order model with one predictor 2 Electricity consumption (Y)Home size (X)
1 Multiple Regression. 2 Model There are many explanatory variables or independent variables x 1, x 2,…,x p that are linear related to the response variable.
Model selection and model building. Model selection Selection of predictor variables.
Design and Analysis of Experiments
General Full Factorial Design
Multiple Regression.
Chapter 15 Multiple Regression Model Building
Chapter 20 Linear and Multiple Regression
Chapter 15 Multiple Regression and Model Building
Introduction to Regression Lecture 6.2
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
9/19/2018 ST3131, Lecture 6.
Business Statistics, 4e by Ken Black
...Relax... 9/21/2018 ST3131, Lecture 3 ST5213 Semester II, 2000/2001
Cases of F-test Problems with Examples
Inference for Regression Lines
CHAPTER 29: Multiple Regression*
Solutions for Tutorial 3
Solutions of Tutorial 10 SSE df RMS Cp Radjsq SSE1 F Xs c).
Multiple Regression.
Regression Models - Introduction
Shudong Wang, NWEA Liru Zhang, Delaware DOE G. Gage Kingsbury, NWEA
Using more than one explanatory variable
Multiple Regression Chapter 14.
Model Selection I: principles of model choice and designed experiments
CHAPTER 14 MULTIPLE REGRESSION
Solutions of Tutorial 9 SSE df RMS Cp Radjsq SSE1 F Xs c).
Essentials of Statistics for Business and Economics (8e)
Business Statistics, 4e by Ken Black
Analysis of Covariance
Regression Models - Introduction
Presentation transcript:

Model Selection II: datasets with several explanatory variables Quantitative Methods Model Selection II: datasets with several explanatory variables

The problem of model choice Model Selection II: several explanatory variables The problem of model choice

The problem of model choice Model Selection II: several explanatory variables The problem of model choice

The problem of model choice Model Selection II: several explanatory variables The problem of model choice With 5 x-variables, there are 25=32 possible models, not including interactions. If we include two-way interactions without squared terms, there are 1x1 + 5x1 + 10x2 + 10x8 + 5x64 + 1x1024 = 1450 models If we do allow squared terms, there are 1x1 + 5x2 + 10x8 + 10x64 + 5x1024 + 1x32768 = 38619 models. With multiple models, there are many p-values and possible “right-leg/left-leg” and “poets’ dates” effects.

The problem of model choice Model Selection II: several explanatory variables The problem of model choice Economy of variables Multiplicity of p-values Marginality

The problem of model choice Model Selection II: several explanatory variables The problem of model choice

Model Selection II: several explanatory variables Economy of variables

Model Selection II: several explanatory variables Economy of variables

Economy of variables Model Selection II: several explanatory variables all variables increase R2 F<1 - adding the variable decreased R2 adj F>1 - adding the variable increased R2 adj

Economy of variables Model Selection II: several explanatory variables continuous

Model Selection II: several explanatory variables Economy of variables

Economy of variables Model Selection II: several explanatory variables (Predictions for datapoint 39)

Multiplicity of p-values Model Selection II: several explanatory variables Multiplicity of p-values

Multiplicity of p-values Model Selection II: several explanatory variables Multiplicity of p-values

Multiplicity of p-values Model Selection II: several explanatory variables Multiplicity of p-values Focus, don’t fish - reduce number of X-variables - use outside information to decide on inclusion - use outside information to decide on exclusion Stringency - reduce nominal p-value Combine model terms - for once, reverse the usual splitting

Multiplicity of p-values Model Selection II: several explanatory variables Multiplicity of p-values

Multiplicity of p-values Model Selection II: several explanatory variables Multiplicity of p-values DF SeqSS 1 366.9 1 42.7 1 14.7 3 424.3 MS=424.3/3=141.4 F = 141.4/108.9 = 1.30 on 3 and 30 DF Single p-value from Minitab using CDF: p=0.293 CDF 1.30 K1; F 3 30. LET K2=1-K1

Model Selection II: several explanatory variables Stepwise regression

Model Selection II: several explanatory variables Stepwise regression

Stepwise regression Model Selection II: several explanatory variables General Linear Model: LRGWHAL versus Source DF Seq SS Adj SS Adj MS F P VIS 1 61.166 61.166 61.166 193.35 0.000 Error 230 72.759 72.759 0.316 Total 231 133.925 Term Coef SE Coef T P Constant -4.52464 0.06116 -73.98 0.000 VIS 0.125222 0.009005 13.91 0.000

Stepwise regression Model Selection II: several explanatory variables General Linear Model: LRGWHAL versus Source DF Seq SS Adj SS Adj MS F P VIS 1 61.166 61.166 61.166 193.35 0.000 Error 230 72.759 72.759 0.316 Total 231 133.925 Term Coef SE Coef T P Constant -4.52464 0.06116 -73.98 0.000 VIS 0.125222 0.009005 13.91 0.000

Stepwise regression Model Selection II: several explanatory variables General Linear Model: LRGWHAL versus Source DF Seq SS Adj SS Adj MS F P VIS 1 61.166 61.166 61.166 193.35 0.000 Error 230 72.759 72.759 0.316 Total 231 133.925 Term Coef SE Coef T P Constant -4.52464 0.06116 -73.98 0.000 VIS 0.125222 0.009005 13.91 0.000

Stepwise regression Model Selection II: several explanatory variables General Linear Model: LRGWHAL versus Source DF Seq SS Adj SS Adj MS F P VIS 1 61.166 61.166 61.166 193.35 0.000 Error 230 72.759 72.759 0.316 Total 231 133.925 Term Coef SE Coef T P Constant -4.52464 0.06116 -73.98 0.000 VIS 0.125222 0.009005 13.91 0.000

Model Selection II: several explanatory variables Stepwise regression

Stepwise regression Model Selection II: several explanatory variables Forward ≠ Backward Forward = Backward

Model Selection II: several explanatory variables Stepwise regression

Model Selection II: several explanatory variables Stepwise regression

Model Selection II: several explanatory variables Stepwise regression

Model Selection II: several explanatory variables Stepwise regression

Model Selection II: several explanatory variables Stepwise regression

Random Effects Read Chapter 12 Last words… Model Selection II: several explanatory variables Last words… Economy of variables: prediction, adjusted R2 Multiplicity: outside information, focussing, stringency, combining model terms Stepwise regressions not usually suitable -- but are for initial sifting of a large number of potential predictors in a preliminary study Random Effects Read Chapter 12