Regression Models for Quantitative (Numeric) and Qualitative (Categorical) Predictors KNNL – Chapter 8.

Slides:



Advertisements
Similar presentations
Xuhua Xia Fitting Several Regression Lines Many applications of statistical analysis involves a continuous variable as dependent variable (DV) but both.
Advertisements

Multiple Linear Regression Polynomial Regression.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
12 Multiple Linear Regression CHAPTER OUTLINE
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Qualitative Variables and
EPI809/Spring Models With Two or More Quantitative Variables.
Introduction the General Linear Model (GLM) l what “model,” “linear” & “general” mean l bivariate, univariate & multivariate GLModels l kinds of variables.
Lecture 27 Polynomial Terms for Curvature Categorical Variables.
General Linear Models -- #1 things to remember b weight interpretations 1 quantitative predictor 1 quantitative predictor & non-linear component 1 2-group.
Multiple Linear Regression
1 G Lect 11M Binary outcomes in psychology Can Binary Outcomes Be Studied Using OLS Multiple Regression? Transforming the binary outcome Logistic.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
CHAPTER 6 ECONOMETRICS x x x x x Dummy Variable Regression Models Dummy, or indicator, variables take on values of 0 or 1 to indicate the presence or absence.
Meta-Analysis and Meta- Regression Airport Noise and Home Values J.P. Nelson (2004). “Meta-Analysis of Airport Noise and Hedonic Property Values: Problems.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Multiple Linear Regression Response Variable: Y Explanatory Variables: X 1,...,X k Model (Extension of Simple Regression): E(Y) =  +  1 X 1 +  +  k.
Objectives of Multiple Regression
Estimation of Demand Prof. Ravikesh Srivastava Lecture-8.
Simple Linear Regression
Chapter 12 Multiple Regression and Model Building.
 Combines linear regression and ANOVA  Can be used to compare g treatments, after controlling for quantitative factor believed to be related to response.
CHAPTER 9 DUMMY VARIABLE REGRESSION MODELS
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Business Statistics, 4e by Ken Black Chapter 15 Building Multiple Regression Models.
Model Selection1. 1. Regress Y on each k potential X variables. 2. Determine the best single variable model. 3. Regress Y on the best variable and each.
Chapter 8: Regression Models for Quantitative and Qualitative Predictors Ayona Chatterjee Spring 2008 Math 4813/5813.
CHAPTER 17 Model Building
Copyright © 2012 Pearson Education, Inc. All rights reserved. Chapter 5 Principles of Model Building.
Multiple Regression I KNNL – Chapter 6. Models with Multiple Predictors Most Practical Problems have more than one potential predictor variable Goal is.
Multiple regression models Experimental design and data analysis for biologists (Quinn & Keough, 2002) Environmental sampling and analysis.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Linear Regression Analysis 5E Montgomery, Peck & Vining 1 Chapter 8 Indicator Variables.
Multiple Regression BPS chapter 28 © 2006 W.H. Freeman and Company.
Curvilinear Regression
Copyright © 2009 Cengage Learning 18.1 Chapter 20 Model Building.
Plotting Linear Main Effects Models Interpreting 1 st order terms w/ & w/o interactions Coding & centering… gotta? oughta? Plotting single-predictor models.
Analysis of Covariance Combines linear regression and ANOVA Can be used to compare g treatments, after controlling for quantitative factor believed to.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Scatterplots and Correlations
9.1 Chapter 9: Dummy Variables A Dummy Variable: is a variable that can take on only 2 possible values: yes, no up, down male, female union member, non-union.
Curvilinear Regression. Monotonic but Non-Linear The relationship between X and Y may be monotonic but not linear. The linear model can be tweaked to.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
1 Multiple Regression and Correlation KNN Ch. 6 CC Ch 3.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Analysis of Covariance KNNL – Chapter 22. Analysis of Covariance Goal: To Compare treatments (1-Factor or Multiple Factors) after Controlling for Numeric.
The General Linear Model. Estimation -- The General Linear Model Formula for a straight line y = b 0 + b 1 x x y.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
12b - 1 © 2000 Prentice-Hall, Inc. Statistics Multiple Regression and Model Building Chapter 12 part II.
1. Refresher on the general linear model, interactions, and contrasts UCL Linguistics workshop on mixed-effects modelling in R May 2016.
Chapter 13 Nonlinear and Multiple Regression
Chapter 11 Multiple Regression
Regression Chapter 6 I Introduction to Regression
Linear Regression CSC 600: Data Mining Class 13.
Multiple Regression Analysis and Model Building
Introduction to the General Linear Model (GLM)
Business Statistics, 4e by Ken Black
Multiple Regression II
Regression Model Building - Diagnostics
Nonlinear Relationships
Multiple Regression II
Rainfall Example The data set contains cord yield (bushes per acre) and rainfall (inches) in six US corn-producing states (Iowa, Nebraska, Illinois, Indiana,
Indicator Variables Often, a data set will contain categorical variables which are potential predictor variables. To include these categorical variables.
Regression Model Building - Diagnostics
Soc 3306a Lecture 11: Multivariate 4
Nonlinear Relationships
Regression Model Building - Diagnostics
Regression and Categorical Predictors
Business Statistics, 4e by Ken Black
Analysis of Covariance
Presentation transcript:

Regression Models for Quantitative (Numeric) and Qualitative (Categorical) Predictors KNNL – Chapter 8

Polynomial Regression Models Useful in 2 Settings: – True relation between response and predictor is polynomial – True relation is complex nonlinear function that can be approximated by polynomial in specific range of X-levels Models with 1 Predictor: Including p polynomial terms in model, creates p-1 “bends” – 2 nd order Model: E{Y} =  0 +  1 x +  2 x 2 (x = centered X) – 3 rd order Model: E{Y} =  0 +  1 x +  2 x 2 +  3 x 3 Response Surfaces with 2 (or more) predictors – 2 nd order model with 2 Predictors:

Modeling Strategies Use Extra Sums of Squares and General Linear Tests to compare models of increasing complexity (higher order) Use coding in fitting models (centered/scaled) predictors to reduce multicollinearity when conducting testing. Fit models in original units, or back-transform for plotting on original scale * (see below for quadratic) For Response Surfaces, include multiple replicates at center point for goodness-of-fit tests

Regression Models with Interaction Term(s) Interaction  Effect (Slope) of one predictor variable depends on the level other predictor variable(s) Formulated by including cross-product term(s) among predictor variables 2 Variable Models: E{Y} =  0 +  1 X 1 +  2 X 2 +  3 X 1 X 2

Qualitative Predictors Often, we wish to include categorical variables as predictors (e.g. gender, region of country, …) Trick: Create dummy (indicator) variable(s) to represent effects of levels of the categorical variables on response Problem: If variable has c categories, and we create c dummy variables, the model is not full rank when we include intercept Solution: Create c – 1 dummy variables, leaving one level as the control/baseline/reference category

Example – Salary vs Experience by Region Solution, just use the Region 1 dummy (X 2 ) and the region 2 dummy (X 3 ), making Region 3 the “reference” region (Note: it is arbitrary which region is the reference)

Example – Salary vs Experience by Region

Interactions Between Qualitative and Quantitative Predictors We can allow the slope wrt to a Quantitative Predictor to differ across levels of the Categorical Predictor Trick: Create cross-product terms between Quantitative Predictor and each of the c-1 dummy variables Can conduct General Linear Test to determine whether slopes differ (or t-test when qualitative predictor has c=2 levels) These models generalize to any number of quantitative and qualitative predictors