Conditional Logistic Regression Epidemiology/Biostats VHM812/802 Winter 2016, Atlantic Veterinary College, PEI Raju Gautam.

Slides:



Advertisements
Similar presentations
M2 Medical Epidemiology
Advertisements

Logistic Regression Psy 524 Ainsworth.
Basic epidemiologic analysis with Stata Biostatistics 212 Lecture 5.
What is Interaction for A Binary Outcome? Chun Li Department of Biostatistics Center for Human Genetics Research September 19, 2007.
SC968: Panel Data Methods for Sociologists Random coefficients models.
Matched designs Need Matched analysis. Incorrect unmatched analysis. cc cc exp,exact Proportion | Exposed Unexposed | Total Exposed
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function F(Z) giving the probability is the cumulative standardized.
Multilevel Models 4 Sociology 8811, Class 26 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Lecture 17: Regression for Case-control Studies BMTRY 701 Biostatistical Methods II.
Analysis and presentation of Case-control study data Chihaya Koriyama February 14 (Lecture 1)
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
1 Logistic Regression EPP 245 Statistical Analysis of Laboratory Data.
In previous lecture, we highlighted 3 shortcomings of the LPM. The most serious one is the unboundedness problem, i.e., the LPM may make the nonsense predictions.
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
BIOST 536 Lecture 14 1 Lecture 14 – Stratified Models Outline  Description of stratified models  Exact and approximate conditional likelihood  Comparison.
BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.
BIOST 536 Lecture 9 1 Lecture 9 – Prediction and Association example Low birth weight dataset Consider a prediction model for low birth weight (< 2500.
EPI 809/Spring Multiple Logistic Regression.
BIOST 536 Lecture 12 1 Lecture 12 – Introduction to Matching.
In previous lecture, we dealt with the unboundedness problem of LPM using the logit model. In this lecture, we will consider another alternative, i.e.
Sociology 601 Class 23: November 17, 2009 Homework #8 Review –spurious, intervening, & interactions effects –stata regression commands & output F-tests.
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
BIOST 536 Thompson1 Modeling the association between a binary outcome, Y, and an “exposure”, X Slides are from Research Professor M. Thompson.
BINARY CHOICE MODELS: LOGIT ANALYSIS
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: binary choice logit models Original citation: Dougherty, C. (2012) EC220.
Stratification and Adjustment
Unit 6: Standardization and Methods to Control Confounding.
Analysis of Categorical Data
Methods Workshop (3/10/07) Topic: Event Count Models.
1 The Receiver Operating Characteristic (ROC) Curve EPP 245 Statistical Analysis of Laboratory Data.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function is the cumulative standardized normal distribution.
Basic epidemiologic analysis with Stata Biostatistics 212 Lecture 5.
Making a figure with Stata or Excel Biostatistics 212 Lecture 7.
Department of Epidemiology and Public Health Unit of Biostatistics and Computational Sciences Regression models for binary and survival data PD Dr. C.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Analysis of time-stratified case-crossover studies in environmental epidemiology using Stata Aurelio Tobías Spanish Council for Scientific Research (CSIC),
Basic Biostatistics Prof Paul Rheeder Division of Clinical Epidemiology.
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
BIOST 536 Lecture 11 1 Lecture 11 – Additional topics in Logistic Regression C-statistic (“concordance statistic”)  Same as Area under the curve (AUC)
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
Lecture 18 Ordinal and Polytomous Logistic Regression BMTRY 701 Biostatistical Methods II.
BIOST 536 Lecture 1 1 Lecture 1 - Introduction Overview of course  Focus is on binary outcomes  Some ordinal outcomes considered Simple examples Definitions.
Biostat 200 Lecture 8 1. Where are we Types of variables Descriptive statistics and graphs Probability Confidence intervals for means and proportions.
Multiple Regression  Similar to simple regression, but with more than one independent variable R 2 has same interpretation R 2 has same interpretation.
1 Chapter 16 logistic Regression Analysis. 2 Content Logistic regression Conditional logistic regression Application.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
The dangers of an immediate use of model based methods The chronic bronchitis study: bronc: 0= no 1=yes poll: pollution level cig: cigarettes smokes per.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
LOGISTIC REGRESSION Binary dependent variable (pass-fail) Odds ratio: p/(1-p) eg. 1/9 means 1 time in 10 pass, 9 times fail Log-odds ratio: y = ln[p/(1-p)]
2006 Chicago Actuarial Association Workshop Predictors of Exceptional Human Longevity Dr. Leonid A. Gavrilov, Ph.D. Dr. Natalia S. Gavrilova, Ph.D. Center.
Exact Logistic Regression
1 Ordinal Models. 2 Estimating gender-specific LLCA with repeated ordinal data Examining the effect of time invariant covariates on class membership The.
THE CHI-SQUARE TEST BACKGROUND AND NEED OF THE TEST Data collected in the field of medicine is often qualitative. --- For example, the presence or absence.
Birthweight (gms) BPDNProp Total BPD (Bronchopulmonary Dysplasia) by birth weight Proportion.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
Analysis of matched data Analysis of matched data.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Methods of Presenting and Interpreting Information Class 9.
Logistic Regression APKC – STATS AFAC (2016).
Discussion: Week 4 Phillip Keung.
Lecture 18 Matched Case Control Studies
Introduction to Logistic Regression
Problems with infinite solutions in logistic regression
Case-control studies: statistics
Presentation transcript:

Conditional Logistic Regression Epidemiology/Biostats VHM812/802 Winter 2016, Atlantic Veterinary College, PEI Raju Gautam

Purpose Matched data (i.e Matched case-control design) Eliminate nuisance parameters (i.e parameters we are not interested in) 2

Logistic regression recap 3

Conditional likelihood 4

Conditional inference Inference about β in two ways – Exact (i.e exact logistic regression), based on permutation distribution of sufficient statistics – Asymptotic (conditional logistic regression), based on maximizing the conditional likelihood (cMLE): analysis of matched or stratified data 5

Conditional logistic regression Matched case-control study design – Types of matching: one (1:1) or several (1:m) controls matched to each case – Exposure variable recorded for cases and controls Purpose of matching: – Make cases and controls equal on known confounders – Emphasize difference on exposure variable – Commonly used matching variables: age, sex, location, time Comparison within (not across) matched sets 6

Conditional logistic… 7

8

Example Dataset SAL_OUTBRK (VER) – Subset of real dataset from S. typhimurium outbreak (Denmark 1996) – 39 cases (diseased persons), 73 controls and 17 variables – Matched for age, sex and residence (1-2 per case) – Exposure variables obtained by interviews Study aim – Determine the source of Salmonella outbreak 9

Description of Data VariableDescriptionValues match_grpmatched set idnominal casecontrolcase-control status0/1 (control/case) ageage (years) gender 0/1 (male/female) eatbeefate beef in prev. 72 hours0/1 (no/yes) eatporkate pork in prev. 72 hours0/1 (no/yes) eatpoulage poultry in prev. 72 hours0/1 (no/yes) eateggsage eggs in prev. 72 hours0/1 (no/yes) slt_aate pork from sl.house A0/1 (no/yes) dlr_aage pork from wholesaler A0/1 (no/yes) ……… Variableeatporkeatbeefslt_adlr_a Status case control Sample data: Matched set # 23 10

Simple descriptive methods for matched study design Dichotomous (and categorical) exposure variable – Mantel-Haenszel statistic (1:1 matching ~ McNemar’s test) Continuous exposure variables – Paired t-test or equivalent non-parametric test – If 1:m matching use average among controls 11

Simple stratified analysis with STATA Matching group | OR [95% Conf. Interval] Crude | M-H combined | Test of homogeneity (Tarone) chi2(38) = Pr>chi2 = Test that combined OR = 1: Mantel-Haenszel chi2(1) = 9.48 Pr>chi2 = M-H estimate cannot be generalized to data with many covariates, whereas conditional likelihood permits that. Exposure variable (binary): slt_a 12

Conditional logistic in STATA Use clogit command – clogit casecontrol slt_a, group(match_grp) or Conditional (fixed-effects) logistic regression Number of obs = 112 LR chi2(1) = Prob > chi2 = Log likelihood = Pseudo R2 = casecontrol | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] slt_a |

Compare with OLR.logit casecontrol slt_a, or Logistic regression Number of obs = 112 LR chi2(1) = 8.27 Prob > chi2 = Log likelihood = Pseudo R2 = casecontrol | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] slt_a | _cons | clogit casecontrol slt_a, group(match_grp) or Conditional (fixed-effects) logistic regression Number of obs = 112 LR chi2(1) = Prob > chi2 = Log likelihood = Pseudo R2 = casecontrol | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] slt_a |

Model building Similar to OLS – Perform univariable/bivariable analysis – Identify important variables – Build model using stepwise forward selection Let us consider in the “sal_outbreak” data – slt_a (P=0.004), dlr_a (P=0.02) and eateggs (P=0.17) are important – Use stepwise forward selection for model building using these variables 15

Model building… Conditional (fixed-effects) logistic regression Number of obs = 83 LR chi2(2) = Prob > chi2 = Log likelihood = Pseudo R2 = casecontrol | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] slt_a | dlr_a | Add dlr_a to the original model with slt_a Try adding interaction effect Here the two variables are highly collinear, so we omit Decide whether dlr_a should stay or not Add the third variable and so on…, until you have a final model In our case, slt_a remains as the only predictor 16

Model Diagnostics Model evaluation by residuals/diagnostics (CLR specific) – With predict (STATA 13) – With clfit (add on) 17