1 Fighting for fame, scrambling for fortune, where is the end? Great wealth and glorious honor, no more than a night dream. Lasting pleasure, worry-free.

Slides:



Advertisements
Similar presentations
Chapter 2 Describing Contingency Tables Reported by Liu Qi.
Advertisements

© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Brief introduction on Logistic Regression
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
Logistic Regression Example: Horseshoe Crab Data
1. Analyzing Non-Normal Data with Generalized Linear Models 2010 LISA Short Course Series Sai Wang, Dept. of Statistics.
Proportion Data Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Loglinear Models for Contingency Tables. Consider an IxJ contingency table that cross- classifies a multinomial sample of n subjects on two categorical.
Log-Linear Models & Dependent Samples Feng Ye, Xiao Guo, Jing Wang.
Chapter 8 Logistic Regression 1. Introduction Logistic regression extends the ideas of linear regression to the situation where the dependent variable,
Linear statistical models 2008 Binary and binomial responses The response probabilities are modelled as functions of the predictors Link functions: the.
Instructor: K.C. Carriere
Log-linear modeling and missing data A short course Frans Willekens Boulder, July
Statistics 200b. Chapter 5. Chapter 4: inference via likelihood now Chapter 5: applications to particular situations.
Generalised linear models
Log-linear and logistic models Generalised linear model ANOVA revisited Log-linear model: Poisson distribution logistic model: Binomial distribution Deviances.
Log-linear and logistic models
EPI 809/Spring Multiple Logistic Regression.
Linear statistical models 2008 Count data, contingency tables and log-linear models Expected frequency: Log-linear models are linear models of the log.
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
Log-linear analysis Summary. Focus on data analysis Focus on underlying process Focus on model specification Focus on likelihood approach Focus on ‘complete-data.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Generalized Linear Models
April 6, 2010 Generalized Linear Models 2010 LISA Short Course Series Mark Seiss, Dept. of Statistics.
1 B. The log-rate model Statistical analysis of occurrence-exposure rates.
Logistic Regression In logistic regression the outcome variable is binary, and the purpose of the analysis is to assess the effects of multiple explanatory.
Simple Linear Regression
Chapter 3: Generalized Linear Models 3.1 The Generalization 3.2 Logistic Regression Revisited 3.3 Poisson Regression 1.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Linear Model. Formal Definition General Linear Model.
November 5, 2008 Logistic and Poisson Regression: Modeling Binary and Count Data LISA Short Course Series Mark Seiss, Dept. of Statistics.
1 The greatest achievement in life is to be able to get up again from failure.
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
1 GLM I: Introduction to Generalized Linear Models By Curtis Gary Dean Distinguished Professor of Actuarial Science Ball State University By Curtis Gary.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Hong Tran, April 21, 2015.
STA 216 Generalized Linear Models Meets: 2:50-4:05 T/TH (Old Chem 025) Instructor: David Dunson 219A Old Chemistry, Teaching.
Introduction to logistic regression and Generalized Linear Models July 14, 2011 Introduction to Statistical Measurement and Modeling Karen Bandeen-Roche,
Joyful mood is a meritorious deed that cheers up people around you like the showering of cool spring breeze.
1 STA 517 – Chp4 Introduction to Generalized Linear Models 4.3 GENERALIZED LINEAR MODELS FOR COUNTS  count data - assume a Poisson distribution  counts.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Generalized Linear Models (GLMs) and Their Applications.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
1 Follow the three R’s: Respect for self, Respect for others and Responsibility for all your actions.
Discrepancy between Data and Fit. Introduction What is Deviance? Deviance for Binary Responses and Proportions Deviance as measure of the goodness of.
Statistics 2: generalized linear models. General linear model: Y ~ a + b 1 * x 1 + … + b n * x n + ε There are many cases when general linear models are.
Remembering way back: Generalized Linear Models Ordinary linear regression What if we want to model a response that is not Gaussian?? We may have experiments.
1 Say good things, think good thoughts, and do good deeds.
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Dependent Variable Discrete  2 values – binomial  3 or more discrete values – multinomial  Skewed – e.g. Poisson Continuous  Non-normal.
Categorical Data Analysis
Logistic Regression and Odds Ratios Psych DeShon.
Nonparametric Statistics
Week 7: General linear models Overview Questions from last week What are general linear models? Discussion of the 3 articles.
 Naïve Bayes  Data import – Delimited, Fixed, SAS, SPSS, OBDC  Variable creation & transformation  Recode variables  Factor variables  Missing.
Nonparametric Statistics
BINARY LOGISTIC REGRESSION
A priori violations In the following cases, your data violates the normality and homoskedasticity assumption on a priori grounds: (1) count data  Poisson.
Generalized Linear Models
STA 216 Generalized Linear Models
Generalized Linear Models
Introduction to logistic regression a.k.a. Varbrul
Quantitative Methods What lies beyond?.
STA 216 Generalized Linear Models
Nonparametric Statistics
Quantitative Methods What lies beyond?.
Do whatever is needed to finish…
Joyful mood is a meritorious deed that cheers up people around you
MPHIL AdvancedEconometrics
Presentation transcript:

1 Fighting for fame, scrambling for fortune, where is the end? Great wealth and glorious honor, no more than a night dream. Lasting pleasure, worry-free forever, who can attain?

2 Categorical Data Analysis Chapter 4: Introduction to Generalized Linear Models

3 3-Way Tables Response ClinicTreatmentSuccessFailure 1A1812 B 8 2A28 B832 TotalA20 B 40

4 Marginal vs. Conditional Marginal independence: marginal odds ratio =1 Conditional independence: conditional odds ratios =1 The observed effect of X on Y might simply reflect effects of other covariates on both X and Y

5 Generalized Linear Models (GLM) 3 components: Random component (Y1, Y2, …, Yn): n independent observations from a distribution in the exponential family (not necessary to be i.i.d.): for i=1,2,…,n, where a, b, c are all positive functions. e.g. poisson, binomial, exponential, normal

6 Systematic component: the right hand side of the model equation; often a linear combination of explanatory variables: Or in matrix format or X’B

7 Link component: the link between u(=E(Y)) and X’B In a model equation g(u)=X’B, g(.) is called the link function, a monotonic differentiable function –Most common link: Canonical link e.g. normal, binomial, poisson

Canonical Link Normal data: identity link Binary data: logit link Count data: log link 8

Deviance Deviance measures the loss of information from data reduction Saturated model: the most general model which fits each observation –Could be each subject’s response or –Could be each cell frequency 9

10 GLMs for Binary Data Linear probability model (identity link) Logistic regression model (logit link) Probit model (probit link)

11 Example: Snoring vs. Heart Disease Snoring (score) Observed P(H.D.) Linear fitLogit fitProbit fit Never (0) Occasionally (2) Almost daily (4) Daily (5) Note: The fits will change if relative spacings between scores change.

Example: 2x2 Tables 12 Binary covariate X and response Y Logit link GLM:

13 GLMs for Count Data Poisson loglinear model Count data: certain events occur over time, space or alike, e.g. the # of car accidents of a random sample of 100 drivers in 2005 Rate data: count/(time or space or alike), e.g. the car accident rates of a random sample of 100 drivers in 2005 (Sec , p. 385)

Example: Horseshoe Crabs Data: Table 4.3 (p. 127) Y= # of satellites X= carapace width 1.Poisson loglinear model 2.Poisson GLM with identity link 14

15 Inference for GLMs Goodness of fit: –Measure: deviance –Test: Likelihood-Ratio (LR) tests Model comparison: L-R tests Residuals: Pearson and standardized

16 Example: Insecticide vs Beetles dosage# of beetles exposed # of dead beetles ………

17 Types of Models for Statistical Analysis Random component LinkSystematic component ModelChapter NormalIdentityContinuousRegression NormalIdentityCategoricalANOVA NormalIdentityMixedANCOVA BinomialLogitMixedLogistic regression 5, 6 PoissonLogMixedLoglinear8 multinomialGenerali- zed logit MixedMultinomial response 7