SJS SDI_131 Design of Statistical Investigations Stephen Senn 13 Cohort Studies.

Slides:



Advertisements
Similar presentations
BPS - 5th Ed. Chapter 131 Binomial Distributions.
Advertisements

The analysis of survival data in nephrology. Basic concepts and methods of Cox regression Paul C. van Dijk 1-2, Kitty J. Jager 1, Aeilko H. Zwinderman.
Introductory Mathematics & Statistics for Business
Statistical vs Clinical or Practical Significance
Statistical vs Clinical Significance
The Application of Propensity Score Analysis to Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Lilly Yue, Ph.D.* CDRH, FDA,
From study objectives to analysis plan Helen Maguire.
Critical review of significance testing F.DAncona from a Alain Morens lecture 2006.
SJS SDI_71 Design of Statistical Investigations Stephen Senn 7. Orthogonal Designs Two (plus) Blocking Factors.
SJS SDI_161 Design of Statistical Investigations Stephen Senn Random Sampling I.
SJS SDI_11 Design of Statistical Investigations Stephen Senn 1 General Introduction.
SJS SDI_141 Design of Statistical Investigations Stephen Senn 14 Case Control Studies.
Assumptions underlying regression analysis
Dr Eva Batistatou. Outline of this presentation… What is epidemiology? The Fundamentals of Epidemiology course What is biostatistics? The Biostatistics.
SIMnet Student Registration Guide
1 Chapter 4 The Designing Research Consumer. 2 High Quality Research: Evaluating Research Design High quality evaluation research uses the scientific.
Scientific writing (81-933) Lecture 5: Discussion Dr. Avraham Samson Faculty of Medicine in the Galilee 1.
How would you explain the smoking paradox. Smokers fair better after an infarction in hospital than non-smokers. This apparently disagrees with the view.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. Statistical Significance for 2 x 2 Tables Chapter 13.
Statistical Analysis SC504/HS927 Spring Term 2008
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Objective: To test claims about inferences for two proportions, under specific conditions Chapter 22.
Three or more categorical variables
Cross Sectional Designs
Study Designs in Epidemiologic
Introduction to Epidemiology
CONCEPTS UNDERLYING STUDY DESIGN
Epidemiologic study designs
Case-Control Studies (Retrospective Studies). What is a cohort?
Chance, bias and confounding
Extension Article by Dr Tim Kenny
Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2009.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Cohort Studies.
THREE CONCEPTS ABOUT THE RELATIONSHIPS OF VARIABLES IN RESEARCH
Sample Size Determination Ziad Taib March 7, 2014.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Cohort Study.
Research Design Interactive Presentation Interactive Presentation
Multiple Choice Questions for discussion
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 5: Analysis Issues in Large Observational Studies.
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
Study Designs in Epidemiologic
Literature searching & critical appraisal Chihaya Koriyama August 15, 2011 (Lecture 2)
October 15. In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox 19.3 Mantel-Haenszel Methods 19.4 Interaction.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Chapter 2 Notes Math Math 1680 Assignments Look over Chapter 1 and 2 before Wednesday Assignment #2: Chapter 2 Exercise Set A (all, but #7, 8, and.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Gathering Useful Data. 2 Principle Idea: The knowledge of how the data were generated is one of the key ingredients for translating data intelligently.
Study Designs for Clinical and Epidemiological Research Carla J. Alvarado, MS, CIC University of Wisconsin-Madison (608)
Research Methods Objectives –Understanding sampling –Understanding different research designs –Understanding strengths and weaknesses of different designs.
SIMPSON’S PARADOX Any statistical relationship between two variables may be reversed by including additional factors in the analysis. Application: The.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Overview and Common Pitfalls in Statistics and How to Avoid Them
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
01/20151 EPI 5344: Survival Analysis in Epidemiology Confounding and Effect Modification March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
1 Bandit Thinkhamrop, PhD.(Statistics) Dept. of Biostatistics & Demography Faculty of Public Health Khon Kaen University Overview and Common Pitfalls in.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. D ESIGN OF E XPERIMENTS Section 1.3.
Case control & cohort studies
BPS - 5th Ed. Chapter 231 Inference for Regression.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Epidemiological Methods
Experiments and Observational Studies
Risk ratios 12/6/ : Risk Ratios 12/6/2018 Risk ratios StatPrimer.
Interpreting Epidemiologic Results.
Presentation transcript:

SJS SDI_131 Design of Statistical Investigations Stephen Senn 13 Cohort Studies

SJS SDI_132 Two Major Types of Epidemiological Observational Study This section partly based on Rothman Cohort study –Sometimes referred to as prospective study But this terms is best avoided –Some cohort studies are not prospective Case-control study –Sometimes referred to as retrospective study But this term is also best avoided –Since some cohort studies are retrospective

SJS SDI_133 Cohorts A cohort was the tenth part of a legion of Roman soldiers It is used by epidemiologists to mean a group of individuals followed over time –Analogy of a body of marching men In demography it is sometimes used to distinguish generational as opposed to cross-sectional approaches

SJS SDI_134 Cohort Study The epidemiological equivalent of clinical trial Subjects are compared according to exposure Followed up for outcome However unlike a clinical trial, the exposure is not assigned

SJS SDI_135 Example Obs_2 John Snow & Cholera London 1854 Two different companies supplied water to London –Lambeth 26,107 houses 14 houses with fatal attacks –Southwark and Vauxhall 40,046 houses 286 houses with fatal attacks

SJS SDI_136 Obs_2 Notes Sampling is by exposure –Lambeth company versus Southwark & Vauxhall company The study is retrospective however. Snow obtained data once the outbreak was known

SJS SDI_137 Population at Risk Population chosen should be capable (in principle) of suffering event of interest Standard requirement that population at risk must be free of disease of interest at outset –Argument is that you cannot develop a disease if you already have it WARNING. Consider how this agrees with our notions of causality?

SJS SDI_138 Closed and Open Cohorts Closed cohort –Membership is defined at outset –Numbers can only get smaller as study progresses Open Cohort (dynamic cohort) –Can take on new members as study progresses –Usually defined geographically

SJS SDI_139 Confounding Confounding is the major problem of observational studies We fear that the presence of hidden variables (confounders) rather than the variable under study may explain results In the extreme case we have a complete reversal known as Simpsons Paradox

SJS SDI_1310 Simpsons Paradox Obs_3 Berkeley Example Case of graduate admissions to University of Berkeley in California in early 1970s Data by sex show that a lower proportion of females are admitted status |sex |Female |Male |RowTotl| accept | 628 |1198 |1826 | |0.34 |0.45 | | reject |1207 |1493 |2700 | |0.66 |0.55 | | ColTotl|1835 |2691 |4526 |

SJS SDI_1311 Logistic Regression Call: glm(formula = p.accepted ~ sex, family = binomial, weights = n.applied) Coefficients: Value Std. Error t value (Intercept) sex Males have significantly higher admission rate

SJS SDI_1312

SJS SDI_1313 Logistic Regression 2 Call: glm(formula = p.accepted ~ sex + faculty, family = binomial, weights = n.applied) Coefficients: Value Std. Error t value (Intercept) sex faculty faculty faculty faculty faculty Males have significantly lower admission rate

SJS SDI_1314 A Paradox? If we do not take faculty into account admission is more difficult for females If we allow for the faculty the reverse is the case In the extreme case (no quite here) when the trend in each and every stratum is the opposite of the overall trend we have Simpsons paradox.

SJS SDI_1315 Simpsons Paradox? Given some information we come to one conclusion Given further information we come to the opposite conclusion This is worrying because, given yet further information we might restore the original conclusion. But is this a paradox?…consider the following story

SJS SDI_1316 Reversal of Opinion An Illustrative Story In the Welsh legend, the returning Llewelyn is met by his hound Gelert at the castle door. Its muzzle is flecked with blood. In the nursery the scene is one of savage disorder and the infant son is missing. Only once the hound has been put to the sword is the child heard to cry and discovered safe and sound by the body of a dead wolf. The additional evidence reverses everything: Llewelyn and not his hound is revealed as a faithless killer. (From chapter 1 of Senn, SJ, Dicing with Death ) So reversal of opinion is not a purely statistical phenomenon. It is a human one, we accept. So why do we regard this as being a paradox?

SJS SDI_1317 Obs_4 Poole Diabetic Cohort (Julious and Mulee)

SJS SDI_1318

SJS SDI_1319 Suppose that the numbers in the table remain the same but refer now to a clinical trial in some life-threatening condition and we replace Type of Diabetes by Treatment and non-insulin dependent by A and insulin-dependent by B and Subjects by Patients. An incautious interpretation of the table would then lead us to a truly paradoxical conclusion. Treating young patients with A rather than B is beneficial (or at least not harmful – the numbers of deaths 0 in the one case and 1 in the other are very small). Treating older patients with A rather than B is beneficial. However, the overall effect of switching patients from B to A would be to increase deaths overall. From Dicing with Death

SJS SDI_1320 In his brilliant book, Causality(1), Judea Pearl gives Simpsons paradox pride of place. Many statisticians have taken Simpsons paradox to mean that judgements of causality based on observational studies are ultimately doomed. We could never guarantee that further refined observation would not lead to a change in opinion. Pearl points out, however, that we are capable of distinguishing causality from association because there is a difference between seeing and doing. In the case of the trial above we may have seen that the trial is badly imbalanced but we know that the treatment given cannot affect the age of the patient at baseline, that is to say before the trial starts. However, age very plausibly will affect outcome and so it is a factor that should be taken account of when judging the effect of treatment. If in future we change a patients treatment we will not (at the moment we change it) change their age. So there is no paradox. We can improve the survival of both young and the old and will not, in acting in this way, adversely affect the survival of the population as a whole. Dicing with Death (1) Pearl, J. (2000) Causality. Cambridge University Press, New York.

SJS SDI_1321 Lessons Confounders can be a problem for cohort studies We may need to measure many potential confounders We will almost certainly need to include them in our models Interpretation may have to be cautious.

SJS SDI_1322 Questions A survey of women in Wickham, England in , with 20 year follow-up gave results recorded in the following slide. Do the results show smoking to be dangerous? What explanation can you think of for the result? What further data would you like to see?

SJS SDI_1323 Obs_5 Wickham Wonders status |smoker? |no |yes |RowTotl| alive |502 |443 |945 | |0.69 |0.76 | | dead |230 |139 |369 | |0.31 |0.24 | | ColTotl|732 |582 |1314 |

SJS SDI_1324 More Questions What sort of interaction is described? What explanations can you think of? What further information would you like to have? Look at the study by Best et al, described on the next slide

SJS SDI_1325 Obs_6 Best, et al 1. The relationship between blood cyclosporin concentration (CyACb) and a patient's risk of organ rejection following heart-lung (HL) transplantation was investigated. 2. Longitudinal data were collected for 90 days post-operation for 31 HL transplant recipients. Following exploratory analysis, a multiple logistic regression model with a binary outcome variable representing presence or absence of lung rejection (as defined on biopsy findings and/or intention to treat) in the next 5 days was fitted to the data. 3. A significant interaction between time post-transplant and CyACb was found. During weeks 1-3, the relative risk (RR) of rejection per unit increase in log(e) (5-day mean CyACb) was reduced: RR = 0.29, 95% confidence interval (CI) = (0.12, 0.72). After 3 post-operative weeks, this trend was reversed: RR = 1.61, 95% CI = (0.96, 2.70). Best, Trull, Tan, Hue, Spiegelhalter, Gore, Wallwork, Brit J Clin Pharm, 1992