Unit 2a: Dealing “Empirically” with Nonlinear Relationships © Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 1

Slides:



Advertisements
Similar presentations
Unit 4a: Basic Logistic (Binomial Logit) Regression Analysis © Andrew Ho, Harvard Graduate School of EducationUnit 4a – Slide 1
Advertisements

Logistic Regression Psy 524 Ainsworth.
Forecasting Using the Simple Linear Regression Model and Correlation
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
© Willett, Harvard University Graduate School of Education, 5/21/2015S052/I.3(b) – Slide 1 More details can be found in the “Course Objectives and Content”
Lecture 8 Relationships between Scale variables: Regression Analysis
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r Assumptions.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Statistics for Managers Using Microsoft® Excel 5th Edition
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Simple Regression Model
Topic 3: Regression.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Correlation and Regression Analysis
Unit 5c: Adding Predictors to the Discrete Time Hazard Model © Andrew Ho, Harvard Graduate School of EducationUnit 5c– Slide 1
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Linear Regression/Correlation
S052/Shopping Presentation – Slide #1 © Willett, Harvard University Graduate School of Education S052: Applied Data Analysis Shopping Presentation: A.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Unit 5c: Adding Predictors to the Discrete Time Hazard Model © Andrew Ho, Harvard Graduate School of EducationUnit 5c– Slide 1
Unit 4c: Taxonomies of Logistic Regression Models © Andrew Ho, Harvard Graduate School of EducationUnit 4c – Slide 1
Unit 3b: From Fixed to Random Intercepts © Andrew Ho, Harvard Graduate School of EducationUnit 3b – Slide 1
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Correlation & Regression
Unit 2b: Dealing “Rationally” with Nonlinear Relationships © Andrew Ho, Harvard Graduate School of EducationUnit 2b – Slide 1
Unit 4c: Taxonomies of Logistic Regression Models © Andrew Ho, Harvard Graduate School of EducationUnit 4c – Slide 1
Unit 4b: Fitting the Logistic Model to Data © Andrew Ho, Harvard Graduate School of EducationUnit 4b – Slide 1
© Willett, Harvard University Graduate School of Education, 8/27/2015S052/I.3(c) – Slide 1 More details can be found in the “Course Objectives and Content”
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Chapter 13: Inference in Regression
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Unit 5b: The Logistic Regression Approach to Life Table Analysis © Andrew Ho, Harvard Graduate School of EducationUnit 5b– Slide 1
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
SEM: Basics Byrne Chapter 1 Tabachnick SEM
Unit 1c: Detecting Influential Data Points and Assessing Their Impact © Andrew Ho, Harvard Graduate School of EducationUnit 1c – Slide 1
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
S052/Shopping Presentation – Slide #1 © Willett, Harvard University Graduate School of Education S052: Applied Data Analysis What Would You Like To Know.
Regression. Types of Linear Regression Model Ordinary Least Square Model (OLS) –Minimize the residuals about the regression linear –Most commonly used.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Unit 3a: Introducing the Multilevel Regression Model © Andrew Ho, Harvard Graduate School of EducationUnit 3a – Slide 1
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
© Willett, Harvard University Graduate School of Education, 12/16/2015S052/I.1(d) – Slide 1 More details can be found in the “Course Objectives and Content”
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Correlation & Regression Analysis
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Residual Analysis Purposes –Examine Functional Form (Linear vs. Non- Linear Model) –Evaluate Violations of Assumptions Graphical Analysis of Residuals.
© Willett, Harvard University Graduate School of Education, 1/19/2016S052/I.2(a) – Slide 1 More details can be found in the “Course Objectives and Content”
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Univariate Point Estimation Confidence Interval Estimation Bivariate: Linear Regression Multivariate: Multiple Regression 1 Chapter 4: Statistical Approaches.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Chapter 15 Multiple Regression Model Building
CHAPTER 26: Inference for Regression
Prepared by Lee Revere and John Large
Simple Linear Regression
Basic Practice of Statistics - 3rd Edition Inference for Regression
CHAPTER 12 More About Regression
Product moment correlation
Correlation and Regression Lecture 1 Sections: 10.1 – 10.2
Presentation transcript:

Unit 2a: Dealing “Empirically” with Nonlinear Relationships © Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 1

Revisiting the importance of the linearity assumption Understanding Tukey’s Ladder and the Rule of the Bulge Transformations and the Box-Cox Procedure © Andrew Ho, Harvard Graduate School of Education Unit 2a– Slide 2 Multiple Regression Analysis (MRA) Multiple Regression Analysis (MRA) Do your residuals meet the required assumptions? Test for residual normality Use influence statistics to detect atypical datapoints If your residuals are not independent, replace OLS by GLS regression analysis Use Individual growth modeling Specify a Multi-level Model If time is a predictor, you need discrete- time survival analysis… If your outcome is categorical, you need to use… Binomial logistic regression analysis (dichotomous outcome) Multinomial logistic regression analysis (polytomous outcome) If you have more predictors than you can deal with, Create taxonomies of fitted models and compare them. Form composites of the indicators of any common construct. Conduct a Principal Components Analysis Use Cluster Analysis Use non-linear regression analysis. Transform the outcome or predictor If your outcome vs. predictor relationship is non-linear, Use Factor Analysis: EFA or CFA? Course Roadmap: Unit 2a Today’s Topic Area

© Andrew Ho, Harvard Graduate School of Education Unit 2a– Slide 3 “In the population, …Assumption How Does Failure of the Assumption Affect OLS Regression Analysis? Linear Outcome/Predictor Relationships … the bivariate relationship between the outcome and each predictor must be linear.” If the modeled relationship is not linear, then it will be misrepresented by the linear regression analysis, and the fundamental underpinnings of the entire analysis are at risk:  OLS-estimated regression slope will not represent the population relationship.  Assumptions about the population residuals (sometimes called, simply, “errors”) will be violated.  Estimated residuals will be incorrect.  Statistical inference will be incorrect. High-priority conditions must be met for accurate statistical inference with linear OLS regression. (Most of this falls under the heading of “independent and identically normally distributed errors.”

© Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 4 Two General Approaches to Fitting Nonlinear Relationships  Use theory, or knowledge of the field, to postulate a non-linear model for the hypothesized relationship between outcome and predictor.  Use nonlinear regression analysis to fit the postulated trend, and conduct all of your statistical inference there.  Interpret the parameter estimates directly, and produce plots of findings.  Use theory, or knowledge of the field, to postulate a non-linear model for the hypothesized relationship between outcome and predictor.  Use nonlinear regression analysis to fit the postulated trend, and conduct all of your statistical inference there.  Interpret the parameter estimates directly, and produce plots of findings. Next Class Harder to apply, easier to interpret Theory-Driven, “Rational” Approach  Find an ad-hoc transformation of either the outcome or the predictor, or both, that renders their relationship linear.  Use regular linear regression analysis to fit a linear trend in the transformed world, and conduct all statistical inference there.  De-transform fitted model to produce plots of findings, and tell the substantive story in the untransformed world.  Find an ad-hoc transformation of either the outcome or the predictor, or both, that renders their relationship linear.  Use regular linear regression analysis to fit a linear trend in the transformed world, and conduct all statistical inference there.  De-transform fitted model to produce plots of findings, and tell the substantive story in the untransformed world. Today’s Class Easier to apply, harder to interpret Data-Driven, “Empirical” Approach

© Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 5 Nancy Bayley’s Infant #8: The Development of Intelligence DatasetBAYLEY.txt OverviewIQ as a function of age for a female infant, from birth to age 60 months. Source Target child is a female infant (infant #8) from the Berkeley Growth and Guidance Study. More Info To learn more about the data, consult:  The overview of the Oakland and Berkeley Growth and Guidance Studies at the Carolina Population Center. Carolina Population Center  Glen Elder’s presentation on “Longitudinal Studies and the Life Course, the 1960s and 1970s,” prepared for the anniversary of the Institute of Human Development, UC Berkeley (2003).Longitudinal Studies and the Life Course, the 1960s and 1970s Sample sizeOne infant, over 21 occasions of measurement. Last updatedOctober 6, 2007 Structure of Dataset Col. # Variable Name Variable DescriptionVariable Metric/Labels 1IQ Infant’s score on the Bayley Scales of Infant DevelopmentBayley Scales of Infant Development Continuous raw score 2AGEAge of infantMonths IQ AGE IQ AGE This analysis comes with some caveats. 1) We’re interested in the nature of individual growth over time (addressed later in this class and in S-077), and 2) we aren’t fully accounting for differences between individuals (we only have 1), or 3) the problem of “autocorrelation” that can arise in time series data. “Adjacent” errors may not be independent!

© Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 6 Well, a simple linear fit doesn’t look like it’s going to suffice. RQ: What is the functional form of the growth trajectory?

© Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 7 Use residual plots for better diagnosis of regression assumptions Residual plots (including those introduced in Unit 1d with standardized residuals) are far better at detecting nonlinearity than straight scatterplots. These statistics seem quite compelling but are deeply misleading. R-squared understates the strength of the nonlinear relationship, and interpreting the slope of a line, as well as its significance, is an exercise in describing a poorly specified model.

© Andrew Ho, Harvard Graduate School of EducationUnit 5 / Page 8 UP Middle rung: No transformation (power = 1) Middle rung: No transformation (power = 1) Upper rungs: Higher powers (power > 1) Upper rungs: Higher powers (power > 1) Really low rungs: Inverses (power < 0) Really low rungs: Inverses (power < 0) Increasing power Decreasing power Lower rungs: Roots (0 < power < 1) Lower rungs: Roots (0 < power < 1).. DOWN

© Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 9 Which transformation? For which variable? UP Increasing power Decreasing power.. DOWN

© Andrew Ho, Harvard Graduate School of EducationUnit 1b – Slide 10

© Andrew Ho, Harvard Graduate School of EducationUnit 1b – Slide 11

© Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 12 A Rough, Shallow, Data-Driven Approach Original correlation Best of these correlations

© Andrew Ho, Harvard Graduate School of EducationUnit 2a – Slide 13 “Starting” or “Tuning” your Transformation by Adding a Constant  Power and log transformations become problematic with negative and zero values.  Even with all positive values, we often add 1 by convention to “start” or “tune” the transformation.  This ends up making a small difference and offers another indication of how arbitrary and shallow this data-driven process can be.  Power and log transformations become problematic with negative and zero values.  Even with all positive values, we often add 1 by convention to “start” or “tune” the transformation.  This ends up making a small difference and offers another indication of how arbitrary and shallow this data-driven process can be. Add the starting/tuning constant BEFORE you transform. Untransformed Transformed, no starting constant. Add a starting constant of 1 prior to transformation.

Unit 2a – Slide 14 The Box-Cox Procedure: A Formal, Still Shallow, Data-Driven Approach © Andrew Ho, Harvard Graduate School of Education The Box-Cox procedure will overstate R- sq and inflate your Type I error rate (false alarms) if used uncritically. It capitalizes on chance variation in the sample that leads to a fit that does not generalize to the population (overfitting) UP.. DOWN

Unit 2a – Slide 15© Andrew Ho, Harvard Graduate School of Education All we’re doing is plotting this equation over a scatterplot of our data

Unit 2a – Slide 16© Andrew Ho, Harvard Graduate School of Education All we’re doing is plotting this equation over a scatterplot of our data

Unit 2a – Slide 17 The Transformed World and the Untransformed World © Andrew Ho, Harvard Graduate School of Education Regression line in the transformed space (bending the points): Implied function in the untransformed space (bending the line)

© Andrew Ho, Harvard Graduate School of EducationUnit 1b – Slide 18 UntransformedTransformed Horizontal vs. vertical acceleration. Sometimes accompanied by telltale decrease in the density of observations (positive skew)