Dependent Variable Discrete  2 values – binomial  3 or more discrete values – multinomial  Skewed – e.g. Poisson Continuous  Non-normal.

Slides:



Advertisements
Similar presentations
Chapter 2 Describing Contingency Tables Reported by Liu Qi.
Advertisements

© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Brief introduction on Logistic Regression
- Word counts - Speech error counts - Metaphor counts - Active construction counts Moving further Categorical count data.
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
Log-linear and logistic models Generalised linear model ANOVA revisited Log-linear model: Poisson distribution logistic model: Binomial distribution Deviances.
Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.
Presentation and Data  Short Courses  Intro to SAS  Download Data to Desktop 1.
PROC GLIMMIX: AN OVERVIEW
Part V The Generalized Linear Model Chapter 16 Introduction.
Binary Response Lecture 22 Lecture 22.
1 Experimental design and analyses of experimental data Lesson 6 Logistic regression Generalized Linear Models (GENMOD)
Linear statistical models 2009 Models for continuous, binary and binomial responses  Simple linear models regarded as special cases of GLMs  Simple linear.

GRA 6020 Multivariate Statistics; The Linear Probability model and The Logit Model (Probit) Ulf H. Olsson Professor of Statistics.
Generalised linear models
GRA 6020 Multivariate Statistics; The Linear Probability model and The Logit Model (Probit) Ulf H. Olsson Professor of Statistics.
Linear statistical models 2008 Model diagnostics  Residual analysis  Outliers  Dependence  Heteroscedasticity  Violations of distributional assumptions.
Log-linear and logistic models Generalised linear model ANOVA revisited Log-linear model: Poisson distribution logistic model: Binomial distribution Deviances.
Log-linear and logistic models
Logistic Regression Biostatistics 510 March 15, 2007 Vanessa Perez.
Generalised linear models Generalised linear model Exponential family Example: Log-linear model - Poisson distribution Example: logistic model- Binomial.
OLS versus MLE Example YX Here is the data:
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Generalized Linear Models
Poisson Regression Caution Flags (Crashes) in NASCAR Winston Cup Races L. Winner (2006). “NASCAR Winston Cup Race Results for ,” Journal.
Review of Lecture Two Linear Regression Normal Equation
SAS Lecture 5 – Some regression procedures Aidan McDermott, April 25, 2005.
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
Lecture 6 Generalized Linear Models Olivier MISSA, Advanced Research Skills.
Fixed vs. Random Effects Fixed effect –we are interested in the effects of the treatments (or blocks) per se –if the experiment were repeated, the levels.
Chapter 3: Generalized Linear Models 3.1 The Generalization 3.2 Logistic Regression Revisited 3.3 Poisson Regression 1.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Today: Lab 9ab due after lecture: CEQ Monday: Quizz 11: review Wednesday: Guest lecture – Multivariate Analysis Friday: last lecture: review – Bring questions.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Linear Model. Formal Definition General Linear Model.
Different Distributions David Purdie. Topics Application of GEE to: Binary outcomes: – logistic regression Events over time (rate): –Poisson regression.
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
The Triangle of Statistical Inference: Likelihoood Data Scientific Model Probability Model Inference.
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
1 GLM I: Introduction to Generalized Linear Models By Curtis Gary Dean Distinguished Professor of Actuarial Science Ball State University By Curtis Gary.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
1 Follow the three R’s: Respect for self, Respect for others and Responsibility for all your actions.
Statistics……revisited
LOGISTIC REGRESSION Binary dependent variable (pass-fail) Odds ratio: p/(1-p) eg. 1/9 means 1 time in 10 pass, 9 times fail Log-odds ratio: y = ln[p/(1-p)]
Statistics 2: generalized linear models. General linear model: Y ~ a + b 1 * x 1 + … + b n * x n + ε There are many cases when general linear models are.
Remembering way back: Generalized Linear Models Ordinary linear regression What if we want to model a response that is not Gaussian?? We may have experiments.
1 Fighting for fame, scrambling for fortune, where is the end? Great wealth and glorious honor, no more than a night dream. Lasting pleasure, worry-free.
Chapter 8: Using Basic Statistical Procedures “33⅓% of the mice used in the experiment were cured by the test drug; 33⅓% of the test population were unaffected.
Nonparametric Statistics
Unit 32: The Generalized Linear Model
Nonparametric Statistics
BINARY LOGISTIC REGRESSION
A priori violations In the following cases, your data violates the normality and homoskedasticity assumption on a priori grounds: (1) count data  Poisson.
B&A ; and REGRESSION - ANCOVA B&A ; and
Generalized Linear Models
Generalized Linear Models
Generalized Linear Models (GLM) in R
Introduction to logistic regression a.k.a. Varbrul
Quantitative Methods What lies beyond?.
Nonparametric Statistics
What is Regression Analysis?
Quantitative Methods What lies beyond?.
Introduction to Logistic Regression
Presentation transcript:

Dependent Variable Discrete  2 values – binomial  3 or more discrete values – multinomial  Skewed – e.g. Poisson Continuous  Non-normal

Link Function Connection between dependent variable and predictor:  Logit – ln(p/(1-p))  Probit – inverse normal  Other nonlinear connections (exponential, logarithmic, power, etc.)

Function link(y) = a + b1*x1 + b2*x2 + … + bn*xn + e) The link function should connect the (discrete) dependent observation to the linear predictor.  y = inverse-link (a + b1*x1 …)

Link Functions DistributionLink Normal, gamma, PoissonLinear, log, power BinomialLogit, probit MultinomialLog(x1/(1 – x2 - … - xn))

Solution  Requires numeric solution (rather than algebraic for traditional GLM)

Significance  Wald statistic  Likelihood Ratio statistic  Score statistic

Residuals  Pearson residuals – based on observed – predicted values  Deviance residuals – contribution to log likelihood statistic  Leverage  Studentized  Cook’s D

Models  ANOVA  Regression  ANCOVA  More complex linear models

SAS  PROC GENMOD: procedure call  CLASS: categorical (ANOVA) variables  MODEL: dependent= independent

MODEL  Model= dependent  Model = events/trials = (ratio of events divided by number of trials for summarized binomial responses)

Model Options  CORR, COVB: parameter correlations or covariances  DIST= lists the assumed distribution of the dependent variable (see SAS docs)  LINK= specifies the link function. SAS will pick a default for a DIST if you don’t  Type1 (sequential), Type3 (partial), Wald statistics  P (predicted estimates) R (residuals)