Does your logistic regression model suck?. PERFECTION!

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

ANNMARIA DE MARS, Ph.D. The Julia Group Logistic Regression: For when your data really do fit in neat little boxes.
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Logistic Regression Psy 524 Ainsworth.
Penalized Maximum Likelihood Logistic Regression
Overview of Logistics Regression and its SAS implementation
Structural Equation Modeling
P449. p450 Figure 15-1 p451 Figure 15-2 p453 Figure 15-2a p453.
GRA 6020 Multivariate Statistics; The Linear Probability model and The Logit Model (Probit) Ulf H. Olsson Professor of Statistics.
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Circular statistics Maximum likelihood Local likelihood Kenneth D. Harris 4/3/2015.
Logistic Regression – Basic Relationships
Introduction to Multilevel Modeling Using SPSS
Structural Equation Modeling 3 Psy 524 Andrew Ainsworth.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Categorical Data Analysis School of Nursing “Categorical Data Analysis 2x2 Chi-Square Tests and Beyond (Multiple Categorical Variable Models)” Melinda.
Logistic regression is used when a few conditions are met: 1. There is a dependent variable. 2. There are two or more independent variables. 3. The dependent.
Temperature correction of energy consumption time series Sumit Rahman, Methodology Advisory Service, Office for National Statistics.
2-1 MGMG 522 : Session #2 Learning to Use Regression Analysis & The Classical Model (Ch. 3 & 4)
Non-Linear Models. Non-Linear Growth models many models cannot be transformed into a linear model The Mechanistic Growth Model Equation: or (ignoring.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Non-Linear Models. Non-Linear Growth models many models cannot be transformed into a linear model The Mechanistic Growth Model Equation: or (ignoring.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-5 Multiple Regression.
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
Estimation Kline Chapter 7 (skip , appendices)
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
Multivariate Data Analysis Chapter 5 – Discrimination Analysis and Logistic Regression.
Logistic Regression. Conceptual Framework - LR Dependent variable: two categories with underlying propensity (yes/no) (absent/present) Independent variables:
Linear vs. Logistic Regression Log has a slightly better ability to represent the data Dichotomous Prefer Don’t Prefer Linear vs. Logistic Regression.
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Regression Correlation Background Defines relationship between two variables X and Y R ranges from -1 (perfect negative correlation) 0 (No correlation)
Today Ensemble Methods. Recap of the course. Classifier Fusion
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
ELEC 303 – Random Signals Lecture 18 – Classical Statistical Inference, Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 4, 2010.
The McGraw-Hill Companies, Inc. 2006McGraw-Hill/Irwin DSS-ESTIMATING COSTS.
Regression Analysis Part C Confidence Intervals and Hypothesis Testing
Slide 1 The Kleinbaum Sample Problem This problem comes from an example in the text: David G. Kleinbaum. Logistic Regression: A Self-Learning Text. New.
Logistic Regression. Linear Regression Purchases vs. Income.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
MTMM Byrne Chapter 10. MTMM Multi-trait multi-method – Traits – the latent factors you are trying to measure – Methods – the way you are measuring the.
1 Chapter 16 logistic Regression Analysis. 2 Content Logistic regression Conditional logistic regression Application.
NON-LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 6 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
LOGISTIC REGRESSION Binary dependent variable (pass-fail) Odds ratio: p/(1-p) eg. 1/9 means 1 time in 10 pass, 9 times fail Log-odds ratio: y = ln[p/(1-p)]
Logistic Regression Saed Sayad 1www.ismartsoft.com.
Oyindamola B. Yusuf (Ph.D) Department of Epidemiology and Medical Statistics, College of Medicine, University of Ibadan, Ibadan, Nigeria 1.
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
Nonparametric Statistics
Four way analysis Nursing home residence Gender Age Death.
REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Chapter 4. The Normality Assumption: CLassical Normal Linear Regression Model (CNLRM)
Nonparametric Statistics
Logistic Regression.
Notes on Logistic Regression
Basic Estimation Techniques
CJT 765: Structural Equation Modeling
Basic Estimation Techniques
G Lecture 6 Multilevel Notation; Level 1 and Level 2 Equations
Nonparametric Statistics
Working with missing Data
Lecture 5 Unsupervised Learning in fully Observed Directed and Undirected Graphical Models.
1.7 Nonlinear Regression.
5.2 Least-Squares Fit to a Straight Line
Regression and Categorical Predictors
Chapter 18: The Chi-Square Statistic
Presentation transcript:

Does your logistic regression model suck?

PERFECTION!

This is bad Model Convergence Status Quasi-complete separation of data points detected. Warning: The maximum likelihood estimate may not exist. Warning: The LOGISTIC procedure continues in spite of the above warning. Results shown are based on the last maximum likelihood iteration. Validity of the model fit is questionable.

Complete separation XGroup

If you don’t go to church you will never die

Quasi-complete separation Like complete separation BUT one or more points where the points have both values

there is not a unique maximum likelihood estimate

“FOR ANY DICHOTOMOUS INDEPENDENT VARIABLE IN A LOGISTIC REGRESSION, IF THERE IS A ZERO IN THE 2 X 2 TABLE FORMED BY THAT VARIABLE AND THE DEPENDENT VARIABLE, THE ML ESTIMATE FOR THE REGRESSION COEFFICIENT DOES NOT EXIST.” Depressing words from Paul Allison

What the hell happened?

Solution? Collect more data. Figure out why your data are missing and fix that. Delete the category that has the zero cell.. Delete the variable that is causing the problem

Model Fit Statistics

Bonus fact! You can compare nested models using the model fit statistics and test if one model is superior to the other