Cohort studies: Statistical analysis Jan Wohlfahrt Department of Epidemiology Research Statens Serum Institut.

Slides:



Advertisements
Similar presentations
The analysis of survival data in nephrology. Basic concepts and methods of Cox regression Paul C. van Dijk 1-2, Kitty J. Jager 1, Aeilko H. Zwinderman.
Advertisements

Survival of Births During Preceding Year Method (SBPY) to Estimate q(1) By Salih Hamza Abu-El-Yamen Central Bureau of Statistics - Sudan.
Line Efficiency     Percentage Month Today’s Date
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
1 Experimental design and analyses of experimental data Lesson 6 Logistic regression Generalized Linear Models (GENMOD)
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
1 Modeling Ordinal Associations Section 9.4 Roanna Gee.
Linear statistical models 2008 Count data, contingency tables and log-linear models Expected frequency: Log-linear models are linear models of the log.
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
Log-linear analysis Summary. Focus on data analysis Focus on underlying process Focus on model specification Focus on likelihood approach Focus on ‘complete-data.
1 B. The log-rate model Statistical analysis of occurrence-exposure rates.
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Survival Analysis: From Square One to Square Two
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
7 Regression & Correlation: Rates Basic Medical Statistics Course October 2010 W. Heemsbergen.
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
Measuring Associations Between Exposure and Outcomes.
1 Why are more children diagnosed with ADHD and ASD? Erik Parner Sektion for Biostatistics Aarhus University A change in registration/recognition or a.
1 Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2014.
Survival Data John Kornak March 29, 2011
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Economics and Statistics Administration U.S. CENSUS BUREAU U.S. Department of Commerce Research on Estimating International Migration of the Foreign-Born.
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
23-1 Analysis of Covariance (Chapter 16) A procedure for comparing treatment means that incorporates information on a quantitative explanatory variable,
Assessing Survival: Cox Proportional Hazards Model
Design and Analysis of Clinical Study 11. Analysis of Cohort Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
HSRP 734: Advanced Statistical Methods July 17, 2008.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
1 EPI 5240: Introduction to Epidemiology Measures used to compare groups October 5, 2009 Dr. N. Birkett, Department of Epidemiology & Community Medicine,
Describing the risk of an event and identifying risk factors Caroline Sabin Professor of Medical Statistics and Epidemiology, Research Department of Infection.
1 STA 617 – Chp9 Loglinear/Logit Models 9.7 Poisson regressions for rates  In Section 4.3 we introduced Poisson regression for modeling counts. When outcomes.
Applied Statistics Week 4 Exercise 3 Tick bites and suspicion of Borrelia Mihaela Frincu
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
Measures of Disease Frequency
ProjectImpactResourcesDeadlineResourcesDeadline Forecast Plan Time Resources Risk 001xx 002xx 003xx 004xx 005xx 006xx 007xx TotalXX Example 1: Portfolio.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Analysis of matched data Analysis of matched data.
Methodology of estimating the annual number of usual resident population in Latvia Baiba Zukula Deputy Director of Social Statistics Department Central.
EPI 5344: Survival Analysis in Epidemiology Week 6 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine, University of Ottawa 03/2016.
Transportation-related Injuries among US Immigrants: Findings from National Health Interview Survey.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
03/20161 EPI 5344: Survival Analysis in Epidemiology Estimating S(t) from Cox models March 29, 2016 Dr. N. Birkett, School of Epidemiology, Public Health.
S T A T E N S S E R U M I N S T I T U T Michael Howitz 1, Jacob Brunbjerg Simonsen 2, Tyra Grove Krause 1, John Robbins 3, Rachel Scheerson.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Lesson 10 - Topics SAS Procedures for Standard Statistical Tests and Analyses Programs 19 and 20 LSB 8:16-17.
Jan 2016 Solar Lunar Data.
Average Monthly Temperature and Rainfall
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Gantt Chart Enter Year Here Activities Jan Feb Mar Apr May Jun Jul Aug
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Plasma 25-Hydroxyvitamin D and Risk of Non-Melanoma and Melanoma Skin Cancer: A Prospective Cohort Study  Shoaib Afzal, Børge G. Nordestgaard, Stig E.
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Births as per Civil Registration System,
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Summary of Measures and Design
Presentation transcript:

Cohort studies: Statistical analysis Jan Wohlfahrt Department of Epidemiology Research Statens Serum Institut

Contents 1.A research question and a wrong answer 2.What kind of data is needed 3.How to analyse data 4.Confounder adjustment 5.Poisson regression 6.Cox regression Danish Epidemiology Science Centre, Copenhagen, Denmark

1.1. A research question Do MMR vaccination increase the risk of autistic disorder ? Danish Epidemiology Science Centre, Copenhagen, Denmark

1.1. Material All children born 1991 to 98 ( children). Registerbased information on MMR vacn. ( ) and autistic disorder (412 cases). Danish Epidemiology Science Centre, Copenhagen, Denmark Inform. on autisme: Danish Psychiatric Central Register Danish Civil Registration System Inform. on MMR Danish National Board of Health Cohort

1.2. A wrong answer I Danish Epidemiology Science Centre, Copenhagen, Denmark -autism+autism - MMR vacn (0.064%) + MMR vacn (0.079%) Relative risk = 0.079/0.064= What is the proportion of children with autism in vacn and non-vacn in the cohort before end of 2000(end of follow-up)?

1.2. A wrong answer II Danish Epidemiology Science Centre, Copenhagen, Denmark The simple comparison of proportion is not correct, because: 1)autism may be diagnosed before MMR 2)no age-adjustment, time under risk not taken into account, Conclusion: Compare person-time under risk, not the number of persons under risk.

2. What kind of data is needed

2.1. Information on time Danish Epidemiology Science Centre, Copenhagen, Denmark Time of study-entrance (1yr birthdate) Time of status-change (date of vaccination) Time of outcome (date of autism) Time of study-exit (date of autism, death, emigration, disappearance, end of study)

2.2 Datalines Who1yr birth dateVacn.autismdeath/emig. 111sep199504apr199717oct dec jan jan jul199304nov199501jan nov jun199703apr apr may nov1995. …..……….. ……………….. more than datalines. Not before end of 2000

2.3 Data as livelines

3. How to analyse data

3.1 Cox vs. Poisson regresssion Poisson regression in large datasets with time-dependent variables Cox regression in small datasets

3.2 Livelines

3.3 Contribution of pyrs Vacn. ??Person yearsAutisme ?? 1 No1.56 yrNo 1Yes0.54 yrYes 2No5.11 yrNo yrNo 4Yes2.16 yrYes 5No0.13 yrNo yrNo 6Yes2.03 yrNo yrYes

3.4 Contribution of pyrs Vacn. ??Person yearsAutisme ?? 1 No1.56 yrNo 1Yes0.54 yrYes 2No5.11 yrNo yrNo 4Yes2.16 yrYes 5No0.13 yrNo yrNo 6Yes2.03 yrNo yrYes

3.5 Data reduction Vacn. ??casesperson years (pyrs) - vacn = vacn =4.73

3.6 Rate ratio calculation Vacn.casesperson yearsRate (per ) - vacn vacn (Incidence) rate = number of new autistic cases per year = cases/pyrs Rate ratio = RR +vacn vs –vacn = rate +vacn /rate -vacn = 1.40

4. Confounder Adjustment

4.1 The lexis-diagram

4.2 Person-years by age and period Nr.Vacn.AgePeriodPyrsAutism … No No No No No No No No No Yes …..

4.3 Person-years by age and period (9 ages) x (9 periods) x (two vacn.) =162 groups e.g. Age period vacn pyrs cases yes

4.4 Relative rates by age and period ageperiodvacn.casespyrs 1 Rate 2 RR vacn vacn vacn vacn vacn05-- +vacn vacn vacn in thousands, 2 per yr

5. Poisson Regression

5.1 Regression analysis of the rates log(rate) = const + a  I(vacn) + b  I(5-9) + c  I(96-00) I(vacn) = 1 if vacn, 0 otherwise I(5-9) = 1 if 5-9 years, 0 otherwise I(96-00) = 1 in period , 0 otherwise For non-vacn. children in 1997 aged 6 log(rate) is modelled by: const+b+c.

5.2 Log-linear Poisson regression (I) log(rate) = log((nr of cases)/pyrs) = log(nr of cases) - log(pyrs) i.e. log(nr of cases) = log(pyrs) + log(rate) log(rate) = const + a  I(vacn) + b  I(5-9) + c  I(96-00) log(nr of cases) = log(pyrs) + const + a  I(vacn) + b  I(5-9) + c  I(96-00)

5.3 Log-linear Poisson regression (II) log(nr of cases) = log(pyrs) + const + a  I(vacn) + b  I(5-9) + c  I(96-00) The number of case is Poisson-distributed. log of the number of cases is modelled with a linear- function log(pyrs) is considered known for every cell and is called an offset

5.4 Parameters and rate ratios log(rate) = k + a  I(vacn) + b  I(5-9) + c  I(96-00) rate = exp(k + a  I(vacn) + b  I(5-9) + c  I(96-00)) = exp(k)  exp(a  I(vacn)  exp(b  I(5-9))  exp(c  I(96-00)). For children 5-9 yr in the period : RR +vacn vs -vacn = rate +vacn /rate -vacn = (exp(k)  exp(a)  exp(b)  exp(c)) (exp(k)  exp(b)  exp(c)) = exp(a)

5.5 A more complicated model log(rate) = k + a  I(vacn) + b 1  I(1yr) + b 2  I(2yr) + b 3  I(3yr) + b 4  I(4yr) + b 5  I(5yr) + b 6  I(6yr) + b 7  I(7yr) + b 8  I(8yr) + c 1  I(92-93) + c 2  I(94) + c 3  I(95) + c 4  I(96) + c 5  I(97) + c 6  I(98) + c 7  I(99) + with non-vacn as the vacn-reference, age=9yr as the age- reference, and period=2000 as the period-reference.

5.6 SAS-dataset to Poisson regression data mmrdata; input age period vacn cases pyrs; logpyrs=log(pyrs); datalines; ; run;

5.7 SAS-procedure to Poisson regression proc genmod data=mmrdata; class age period; model cases=age period vacn/ dist=poisson link=log offset= logpyrs ; run;

5.8 SAS-output Parameter DF Estimate Std Err ChiSquare Pr>Chi INTERCEPT AGE AGE AGE AGE AGE AGE AGE AGE AGE PERIOD PERIOD PERIOD PERIOD PERIOD PERIOD PERIOD PERIOD VACN VACN

5.9 Confidence-interval RR +vacn vs –vacn = exp( ) = 0.89 Confidence-interval: RR lower = exp(estimate  StdErr) RR upper = exp(estimate  StdErr) RR +vacn vs -vacn = 0.89 ( )

X.X Time since vaccination 5.9 years0.8 years - vacn0.8yr + vacn5.9yr -vacn0.8 yr V:<1 yr1 yr V:1-2 yr2 yr V:3-4 yr2 yr V:5+ yr0.9 yr

6. Cox Regression

6.1 Cox regression

log(rate) = k + a  I(vacn) + b 1  I(1yr) + b 2  I(2yr) + b 3  I(3yr) b 4  I(4yr) + b 5  I(5yr) + b 6  I(6yr) + b 7  I(7yr) + b 8  I(8yr) + c 1  I(92-93) + c 2  I(94) + c 3  I(95) + c 4  I(96) + c 5  I(97) + c 6  I(98) + c 7  I(99) Cox regression (age)

6.3 Live-lines

6.4 Data to Cox-regression data coxdata; intime vactime auttime othtime date7.; datalines; 11sep95 04apr97 17oct97. 13dec jan00 27jan jul93 04nov95 01jan98. 15nov jun97 03apr99. 15apr01 03may92. 06nov run;

6.5 Cox SAS-program data coxdata2; set coxdata; outtime=min(auttime,othtime,"31dec2000"d); time=(outtime-intime); if auttime=outtime then status=1; else status=0; run; proc phreg; model time*status(0)=vacn; if (vactime=. or time<(vactime-intime)) then vacn=0; else vacn=1; run;