The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating interaction effects from OLS coefficients: Interaction between 1 categorical.

Slides:



Advertisements
Similar presentations
Dummy Variables Dummy variables refers to the technique of using a dichotomous variable (coded 0 or 1) to represent the separate categories of a nominal.
Advertisements

The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Planning a speech and designing effective slides Jane E. Miller, PhD.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Statistics for Managers Using Microsoft® Excel 5th Edition
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Correlation and Regression Analysis
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Structural Equation Models – Path Analysis
Example of Simple and Multiple Regression
Objectives of Multiple Regression
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Organizing data in tables and charts: Different criteria for different tasks Jane.
Logarithmic specifications Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition.
Soc 3306a Lecture 10: Multivariate 3 Types of Relationships in Multiple Regression.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Creating effective tables and charts Jane E. Miller, PhD.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating interaction patterns from logit coefficients: Interaction between two.
Comparing overall goodness of fit across models
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating the shape of a polynomial from regression coefficients Jane E. Miller,
Statistics and Quantitative Analysis U4320 Segment 8 Prof. Sharyn O’Halloran.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Presenting statistical results to nonstatistical audiences Jane E. Miller, PhD.
Soc 3306a Multiple Regression Testing a Model and Interpreting Coefficients.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Preparing speaker’s notes and practicing your talk Jane E. Miller, PhD.
The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Implementing “generalization, example, exception”: Behind-the-scenes work for summarizing.
Writing about ratios Jane E. Miller, PhD The Chicago Guide to Writing about Numbers, 2nd Edition.
Chapter 9 Analyzing Data Multiple Variables. Basic Directions Review page 180 for basic directions on which way to proceed with your analysis Provides.
Chapter 17 Partial Correlation and Multiple Regression and Correlation.
The Chicago Guide to Writing about Numbers, 2 nd edition. Basics of writing about numbers: Reporting one number Jane E. Miller, PhD.
School of Information - The University of Texas at Austin LIS 397.1, Introduction to Research in Library and Information Science LIS Introduction.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Writing prose to present results of interactions Jane E. Miller, PhD.
The Chicago Guide to Writing about Numbers, 2 nd edition. Preparing speaker’s notes and practicing your talk Jane E. Miller, PhD.
The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Data structure for a discrete-time event history analysis Jane E. Miller, PhD.
Planning how to create the variables you need from the variables you have Jane E. Miller, PhD The Chicago Guide to Writing about Numbers, 2 nd edition.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Criteria for choosing a reference category Jane E. Miller, PhD.
The Chicago Guide to Writing about Numbers, 2nd Edition. Planning a speech and designing effective slides Jane E. Miller, PhD.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Defining the Goldilocks problem Jane E. Miller, PhD.
Chapter 13 Multiple Regression
The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Conducting post-hoc tests of compound coefficients using simple slopes for a categorical.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Interpreting multivariate OLS and logit coefficients Jane E. Miller, PhD.
Standardized coefficients Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Choosing tools to present numbers: Tables, charts, and prose Jane E. Miller, PhD.
The Chicago Guide to Writing about Numbers, 2 nd edition. Choosing a comparison group Jane E. Miller, PhD.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Resolving the Goldilocks problem: Variables and measurement Jane E. Miller, PhD.
Overview of Regression Analysis. Conditional Mean We all know what a mean or average is. E.g. The mean annual earnings for year old working males.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Introduction to testing statistical significance of interactions Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition.
Testing statistical significance of differences between coefficients Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2nd.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Visualizing shapes of interaction patterns between two categorical independent.
The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Conducting post-hoc tests of compound coefficients using simple slopes for a categorical.
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 18 Introduction to Simple Linear Regression (Data)Data.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Visualizing shapes of interaction patterns with continuous independent variables.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Resolving the Goldilocks problem: Presenting results Jane E. Miller, PhD.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Creating charts to present interactions Jane E. Miller, PhD.
Approaches to testing statistical significance of interactions Jane E. Miller, PhD The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Resolving the Goldilocks problem: Model specification Jane E. Miller, PhD.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Overview of categorical by categorical interactions: Part I: Concepts, definitions, and shapes Interactions in regression models occur when the association.
Calculating interaction effects from OLS coefficients: Interaction between two categorical independent variables Jane E. Miller, PhD As discussed in the.
Using alternative reference categories to test statistical significance of an interaction This podcast is the last in the series on testing statistical.
SIMPLE LINEAR REGRESSION MODEL
Creating variables and specifying models to test for interactions between two categorical independent variables This lecture is the third in the series.
Introduction to interactions in regression models: Concepts and equations Jane E. Miller, PhD Interactions in regression models occur when the association.
Overview of categorical by continuous interactions: Part II: Variables, specifications, and calculations Interactions in regression models occur when.
Chapter 14 Multiple Regression
Testing whether a multivariate specification can be simplified
Presentation transcript:

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating interaction effects from OLS coefficients: Interaction between 1 categorical and 1 continuous independent variable Jane E. Miller, PhD

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Overview General equation for a model with main effects and interactions Coding of main effects and interaction terms Solving for the interaction pattern based on estimated coefficients – Intercept – Slope Graphical depiction of the sum of coefficients for particular combinations of the independent variables

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Review: Contingency of coefficients in an interaction model Y = β 0 + β 1 X 1 + β 2 X 2 + β 3 X 1 _ X 2, Inclusion of the interaction term X 1 _ X 2 means that the β i s on the main effects terms X 1 and X 2 no longer apply to all values of X 1 and X 2. – The main effects and interactions β i s for X 1 and X 2 are contingent upon one another and cannot be considered separately.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Review: Implications for interpreting main effects and interaction βs Y = β 0 + β 1 X 1 + β 2 X 2 + β 3 X 1 _ X 2, In the interaction model: – β 1 estimates the effect of X 1 on Y when X 2 = 0, – β 2 estimates the effect of X 2 on Y when X 1 = 0, – β 3 must also be considered in order to calculate the shape of the overall pattern among X 1, X 2, and Y. E.g., when X 1 and X 2 take on other values.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Review: Some possible patterns of association between IPR, race, and birth weight IPR BW IPR BW IPR BW IPR BW White Black No racial difference in IPR/BW relation: intercept and slope same for blacks & whites. Blacks & whites have same intercept but different slope of IPR/BW curves Blacks & whites have different slope and intercepts of IPR/BW curves Blacks & whites have same slope but different intercepts of IPR/BW curves

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. General equation for predicted value of DV based on an interaction model The general equation to calculate the predicted value of the dependent variable includes – main effects coefficients – interaction term coefficients – values of the independent variables = β 0 + (β NHB × NHB) + (β IPR × IPR) + (β NHB_IPR × NHB_IPR)

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating overall effect of interaction for specific case characteristics = β 0 + (β NHB × NHB) + (β IPR × IPR) + (β NHB_IPR × NHB_IPR) Each coefficient is multiplied by the value of the associated variable for cases with the characteristics of interest. To see which coefficients pertain to which cases, fill in values of variables for different combinations of race and the income-to-poverty ratio (IPR).

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Example: Estimated coefficients β Intercept 3,106 Main effect terms Non-Hispanic black (NHB) –177 Income-to-poverty ratio (IPR) 23 Interaction term NHB_IPR –5–5 IPR = family income ($) / Federal Poverty Level for a family of that size and age composition. Reference category: Non-Hispanic whites.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Interpreting the intercept The intercept β 0 from an OLS model is an estimate of the level of the dependent variable when continuous variables take the value 0, for infants in the reference category for all categorical variables. In a model where – The dependent variable is birth weight in grams. – The reference category is specified to be non-Hispanic white infants. β 0 is an estimate of birth weight when IPR = 0, for non- Hispanic white infants.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Review: Coding of main effect and interaction term variables: race and income Case characteristics – SELECTED VALUES Variables Main effects termsInteraction term NHBIPRNHB_IPR Non-H white & IPR = Non-H white & IPR = Non-H white & IPR = For a two-category race variable (non-Hispanic white = reference category). E.g., IPR = 0.5 means family income is half the Federal Poverty Level (FPL); IPR = 2.0 means family income is twice the FPL. Reference category

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating the value of the intercept for one group NHBIPRNHB_IPR Non-H white & IPR = = β 0 + (β NHB × NHB) + (β IPR × IPR) + (β NHB_IPR × NHB_IPR ) The intercept for non-Hispanic whites is calculated: = β 0 + (β NHB × 0) + (β IPR × 0.0) + (β NHB_IPR × 0.0) = β 0 Thus, the intercept for non-Hispanic white infants (when IPR = 0) collapses to include only β 0 because all of the other coefficients in the formula are multiplied by a value of 0.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Interpreting the IPR/birth weight pattern IPR is a continuous variable – The coefficient is an estimate of the effect on the dependent for a 1-unit increase in the continuous IV, with categorical variables set to their reference category values. So β IPR estimates the increment in birth weight for every one-unit increase in IPR (e.g., from family income at the poverty line to twice the poverty line) – It is the slope of the IPR/birth weight curve for infants in the reference category, in this case, non-Hispanic white infants.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating values for the IPR/birth weight curve for white infants NHBIPRNHB_IPR Non-H white & IPR = = β 0 + (β NHB × 0) + (β IPR × 1.5) + (β NHB_IPR × 0) = β 0 + (β IPR × 1.5) Because non-Hispanic whites are the reference category for race, the equation collapses to include only the IPR main effect (β IPR ) because the other coefficients are multiplied by 0. = β 0 + (β IPR × IPR) = β 0 + (β NHB × NHB) + (β IPR × IPR) + (β NHB_IPR × NHB_IPR)

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating values for the IPR/birth weight curve for white infants NHBIPRNHB_IPR Non-H white & IPR = = β 0 + (β NHB × 0) + (β IPR × 3.0) + (β NHB_IPR × 0) = β 0 + β IPR × 3.0 = β 0 + (β NHB × NHB) + (β IPR × IPR) + (β NHB_IPR × NHB_IPR)

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Interpreting the race main effect The main effect β NHB estimates the difference in birth weight between non-Hispanic black infants and those in the reference category (non- Hispanic whites), when continuous variables are set at the value 0. It is an estimate of the difference in intercept between black and white infants when IPR is 0.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating the intercept for different values of the categorical variable NHBIPRNHB_IPR Non-H white & IPR = NHBIPRNHB_IPR Non-H black & IPR = As we saw a moment ago, for the intercept for non-Hispanic whites is calculated: = β 0 + (β NHB × 0) + (β IPR × 0.0) + (β NHB_IPR × 0.0) = β 0 For non-Hispanic blacks, the intercept is calculated: = β 0 + (β NHB × 1) + (β IPR × 0.0) + (β NHB_IPR × 0.0) = β 0 + β NHB

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. More on the race main effect It is an estimate of the difference in intercept between black and white infants when IPR is 0. = β 0 + β NHB = 3,106 + (– 177) = 2,929 In other words, black infants born to families with an IPR of zero have a predicted birth weight of 2,929 grams. – or 177 grams LOWER than that of their white counterparts.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating values for the IPR/birth weight curve for white infants Because non-Hispanic whites are the reference category for race, the equation collapses to include only the IPR main effect (β IPR ) because the other coefficients are multiplied by 0. = β 0 + (β NHB × NHB) + (β IPR × IPR) + (β NHB_IPR × NHB_IPR) = β 0 + (β NHB × 0) + (β IPR × IPR) + (β NHB_IPR × 0) = β 0 + (β IPR × IPR)

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculating values for the IPR birth weight curve for black infants NHBIPRNHB_IPR Non-H black & IPR = = β 0 + (β NHB × 1) + (β IPR × 1.5) + (β NHB_IPR × 1.5) For Non-Hispanic blacks, the equation includes all three terms (β NHB, β IPR, and β NHB_IPR ) because each of those coefficients is multiplied by a non-zero value.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Interpreting the coefficient on the interaction between race and IPR The slope – for blacks = β IPR + β NHB_IPR = 23 + (–5) = 18 – for whites = β IPR = 23 The race_IPR coefficient tests whether the slope of the IPR/birth weight pattern is different for non-Hispanic black infants than for their non- Hispanic white counterparts. – β NHB_IPR is thus the estimated difference in slope for blacks compared to whites.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. More on the race/IPR interaction The estimated coefficients mean that each 1- unit increase in IPR is associated with  23 grams more birth weight among non-Hispanic white infants.  18 grams more birth weight among non-Hispanic black infants.  Thos values are the slopes of the respective IPR/BW curves for the two racial/ethnic groups.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Preparing to graph the slope of IPR/birthweight by race For infants in the reference category (non- Hispanic white), – Multiply selected values of IPR by β IPR and add to β 0 to obtain predicted birth weight at interesting values of IPR. For non-Hispanic black infants, – Multiply selected values of IPR by β IPR + β NHB_IPR then add to β 0 + β NHB.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Calculated birth weight by race for selected values of IPR IPR (family income in multiples of the FPL) Non-Hispanic whiteNon-Hispanic black FormulaResultFormulaResult 0 = β × β IPR = 3, ×23 3,106 = β 0 + β NHB + 0 × (β IPR + β NHB_IPR ) = 3,106 – × (23 – 5) 2,929 1 = β 0 + 1× β IPR = 3, ×23 = 3, ,129 = β 0 + β NHB + 1 × (β IPR + β NHB_IPR ) = 3,106 – × (23 – 5) = 2, × (18) = 2, ,947 … 6 = β × β IPR = 3, ×23 = 3, ,244 = β 0 + β NHB + 6 × (β IPR + β NHB_IPR ) = 3,106 – × (23 – 5) = 2, × (18) = 2, ,037 β 0 = 3,106; β IPR = 23; β NHB = –177; β NHB_IPR = –5

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Use a spreadsheet to calculate and graph the interaction Spreadsheets can – Store The estimated coefficients The input values of the independent variables The correct generalized formula to calculate the predicted values for many combinations of the IVs involved in the interaction – Graph the overall pattern See spreadsheet template and voice-over explanation

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. 3,200 3,100 3,000 2,900 2,800 * Ref cat = Reference category = non-Hispanic white infants. = β 0 + β NHB = 3,106 + ( – 177) = 2,929 = intercept for black infants 142 3,300 Birth weight (grams) IPR 0 = β 0 = intercept = 3,106 = predicted BW for ref cat * = β IPR = 23 = slope of IPR/ BW curve for ref cat * = β IPR + β NHB_IPR = 23 – 5 = 18 = slope of IPR/ BW curve for non-Hispanic black infants Predicted birth weight by race/ethnicity and IPR 6

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Overall shape of the race/IPR/ birth weight pattern Based on this set of βs, black infants have – a lower birth weight than whites at all IPR levels. Negative coefficient on the NHB main effect yields a lower intercept for blacks than for whites. – a slower rate of birth weight increase as IPR rises. Negative coefficient on NHB_IPR, which yields a shallower slope of the IPR/birth weight curve for blacks than for whites. Thus the deficit in birth weight for blacks widens with increasing IPR.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Using the three-way chart to verify your multivariate results Check the pattern calculated from the estimated coefficients against the simple three-way chart. If the shapes are wildly inconsistent with one another, probably reflects an error in either – How you specified the model, or – How you calculated the overall pattern from the coefficients. Small changes in the shape or size of the pattern may occur due to controlling for other variables in your multivariate model.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Summary An interaction between a continuous and a categorical independent variable will yield differences in the intercept and/or slope of the association between the continuous IV and the DV. Calculating the overall shape of an interaction requires adding together the pertinent main effects and interaction term βs for combinations of the categorical IV and selected values of the continuous IV in the interaction. – A spreadsheet can be helpful for storing and organizing the βs, input values, and formulas.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Suggested resources Chapters 9 and 16 of Miller, J.E The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Chapters 8 and 9 of Cohen et al Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences, 3rd Edition. Florence, KY: Routledge.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Supplemental online resources Podcasts – Introduction to interactions – Creating variables to test for interactions – Specifying models to test for interactions – Interpreting multivariate regression coefficients Spreadsheet template for calculating overall effect of an interaction between a categorical and a continuous independent variable.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Suggested practice exercises Study guide to The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. – Question #4 in the problem set for Chapter 16 – Suggested course extensions for Chapter 16 “Applying statistics and writing” exercise #2.

The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Contact information Jane E. Miller, PhD Online materials available at