Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.

Slides:



Advertisements
Similar presentations
Residuals Residuals are used to investigate the lack of fit of a model to a given subject. For Cox regression, there’s no easy analog to the usual “observed.
Advertisements

Surviving Survival Analysis
Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
Survival Analysis In many medical studies, the primary endpoint is time until an event occurs (e.g. death, remission) Data are typically subject to censoring.
If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
Survival Analysis-1 In Survival Analysis the outcome of interest is time to an event In Survival Analysis the outcome of interest is time to an event The.
KRUSKAL-WALIS ANOVA BY RANK (Nonparametric test)
Survival Analysis. Statistical methods for analyzing longitudinal data on the occurrence of events. Events may include death, injury, onset of illness,
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
Departments of Medicine and Biostatistics
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
بسم الله الرحمن الرحیم. Generally,survival analysis is a collection of statistical procedures for data analysis for which the outcome variable of.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Intermediate methods in observational epidemiology 2008 Instructor: Moyses Szklo Measures of Disease Frequency.
Main Points to be Covered
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
Biostatistics in Research Practice Time to event data Martin Bland Professor of Health Statistics University of York
Introduction to Survival Analysis
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Introduction to Survival Analysis Seminar in Statistics 1 Presented by: Stefan Bauer, Stephan Hemri
Main Points to be Covered Cumulative incidence using life table method Difference between cumulative incidence based on proportion of persons at risk and.
Today Concepts underlying inferential statistics
Measures of disease frequency (I). MEASURES OF DISEASE FREQUENCY Absolute measures of disease frequency: –Incidence –Prevalence –Odds Measures of association:
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Cox Proportional Hazards Regression Model Mai Zhou Department of Statistics University of Kentucky.
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
Analysis of Complex Survey Data
Survival Analysis: From Square One to Square Two
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Survival Curves Marshall University Genomics Core.
Introduction to Survival Analysis August 3 and 5, 2004.
1 Survival Analysis Biomedical Applications Halifax SAS User Group April 29/2011.
NASSER DAVARZANI DEPARTMENT OF KNOWLEDGE ENGINEERING MAASTRICHT UNIVERSITY, 6200 MAASTRICHT, THE NETHERLANDS 22 OCTOBER 2012 Introduction to Survival Analysis.
HSRP 734: Advanced Statistical Methods July 10, 2008.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Statistical approaches to analyse interval-censored data in a confirmatory trial Margareta Puu, AstraZeneca Mölndal 26 April 2006.
1 Introduction to medical survival analysis John Pearson Biostatistics consultant University of Otago Canterbury 7 October 2008.
Assessing Survival: Cox Proportional Hazards Model
Design and Analysis of Clinical Study 11. Analysis of Cohort Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
INTRODUCTION TO SURVIVAL ANALYSIS
01/20151 EPI 5344: Survival Analysis in Epidemiology Survival curve comparison (non-regression methods) March 3, 2015 Dr. N. Birkett, School of Epidemiology,
Linear correlation and linear regression + summary of tests
HSRP 734: Advanced Statistical Methods July 17, 2008.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
Statistical Inference for more than two groups Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Lecture 9: Analysis of intervention studies Randomized trial - categorical outcome Measures of risk: –incidence rate of an adverse event (death, etc) It.
Survival Analysis 1 Always be contented, be grateful, be understanding and be compassionate.
Lecture 12: Cox Proportional Hazards Model
1 Lecture 6: Descriptive follow-up studies Natural history of disease and prognosis Survival analysis: Kaplan-Meier survival curves Cox proportional hazards.
Survival Analysis approach in evaluating the efficacy of ARV treatment in HIV patients at the Dr GM Hospital in Tshwane, GP of S. Africa Marcus Motshwane.
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
Satistics 2621 Statistics 262: Intermediate Biostatistics Jonathan Taylor and Kristin Cobb April 20, 2004: Introduction to Survival Analysis.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
01/20151 EPI 5344: Survival Analysis in Epidemiology Hazard March 3, 2015 Dr. N. Birkett, School of Epidemiology, Public Health & Preventive Medicine,
INTRODUCTION TO CLINICAL RESEARCH Survival Analysis – Getting Started Karen Bandeen-Roche, Ph.D. July 20, 2010.
Nonparametric Statistics
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
SURVIVAL ANALYSIS PRESENTED BY: DR SANJAYA KUMAR SAHOO PGT,AIIH&PH,KOLKATA.
Nonparametric Statistics
April 18 Intro to survival analysis Le 11.1 – 11.2
Survival curves We know how to compute survival curves if everyone reaches the endpoint so there is no “censored” data. Survival at t = S(t) = number still.
Statistics 103 Monday, July 10, 2017.
Nonparametric Statistics
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Presentation transcript:

Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor of Mathematics Indiana University NW

Time-to-Event Time-to-event data are generated when the measure of interest is the amount of time to occurrence of an event of interest. For Example: – Time from randomization to death in clinical trial – Time from randomization to recurrence in a cancer clinical trial – Time from diagnosis of cancer to death due to the cancer – Time from diagnosis of cancer to death due to any causes – Time from remission to relapse of leukemia – Time from HIV infection to AIDS – Time from exposure to cancer incidence in an epidemiological cohort study

Censoring Censoring occurs when we have some information, but we don’t know the exact time-to-event measure. For example, patients typically enter a clinical study at the time randomization (or the time of diagnosis, or treatment) and are followed up until the event of interest is observed. However, censoring may occur for the following reasons: a person does not experience the event before the study ends; death due to a cause not considered to be the event of interest (traffic accident, adverse drug reaction,…); and loss to follow-up, for example, if the person moves. We say that the survival time is censored. These are examples of right censoring, which is the most common form of censoring in medical studies. For these patients, the complete time-to-event measure is unknown; we only know that the true time-to-event measure is greater than the observed measurement.

Example: X means an event occurred; O means that the subject was censored.

Example 2 (from Kleinbaum: “Survival Analysis”) PatientTime (t) Censor (  ) Consider data from a retrospective study of 13 women who had surgery for breast cancer. The survival times are: 23, 47, 69, 70+, 71+, 100+, 101+, 148, 181, 198+, 208+, 212+, 224+ (the “+” means that that particular patient was censored)

Survival Curve - Calculus S(t) = cumulative survival function = proportion that survive until time t f(t) = frequency distribution of age at death h(t) = hazard function (i.e. death rate at age t) = event rate Relationships:

Distribution Function, Survival Function and Density Function Probability Distribution function Probability Density function Survival function

Creating a Kaplan-Meier curve For each non-censored failure time t j (time-to-event time) evaluate: n j = number at risk before time t j d j = number of deaths from t j-1 to t j Fraction = estimated probability of surviving past t j-1 given that you are at risk at time The Product Limit Formula:

Kaplan-Meier Product Limit Estimate Consider data from a retrospective study of 45 women who had surgery for breast cancer. The survival times are: 23, 47, 69, 70+, 71+, 100+, 101+, 148, 181, 198+, 208+, 212+, 224+

Survival Curves – more examples

Log-Rank test for two groups Suppose we have two groups, each with a different treatment. Usually, we represent this kind of situation in a 2x2 table. EventNo Event Intervention45198 Control52203 or # at Risk# Events Interventionn1 = 243m1 = 45 Controln2 = 255m2 = 52 TOTAL:N = 498M = 97 Expected number of events: Intervention Control Observed- Expected:-2.33 Variance: 19.55

If the data are given through time, we have a series of 2x2 tables. Expected number of events If the two groups were the same – what would the expected number of events be? Observed minus expected This is a measure of deviation of one treatment from their average (the expected) Log-rank statistic measures whether the data in the two groups are statistically “different”. Log-Rank test for two groups

Comparing Survival Functions Question: Did the treatment make a difference in the survival experience of the two groups? Hypothesis: H 0 : S 1 (t)=S 2 (t) for all t ≥ 0. Three often used tests: 1. Log-rank test (aka Mantel-Haenszel Test); 2. Wilcoxon Test; 3. Likelihood ratio test.

Log-rank example (from Kleinbaum: “Survival Analysis”)

Survival data vs. two-by-two table = different

Log-Rank test for several groups The null hypothesis is that all the survival curves are the same. Log-rank statistic is given by the sum: This statistic has Chi-square distribution with (# of groups – 1) degrees of freedom.

Cox Proportional Hazards Regression Most interesting survival-analysis research examines the relationship between survival — typically in the form of the hazard function — and one or more explanatory variables (or covariates). Most common are linear-like models for the log hazard.  For example, a parametric regression model based on the exponential distribution, Needed to assess effect of multiple covariates on survival Cox-proportional hazards is the most commonly used multivariate survival method  Easy to implement in SPSS, Stata, or SAS  Parametric approaches are an alternative, but they require stronger assumptions about h(t).

Assumes multiplicative risk—this is the proportional hazard assumption Conveniently separates baseline hazard function from covariates  Baseline hazard function over time  Covariates are time independent Nonparametric Can handle both continuous and categorical predictor variables (think: logistic, linear regression) Without knowing baseline hazard h o (t), can still calculate coefficients for each covariate, and therefore hazard ratio Multivariate methods: Cox proportional hazards

Limitations of Cox PH model Covariates normally do not vary over time  True with respect to gender, ethnicity, or congenital condition One can program time-dependent variables  Baseline hazard function, h o (t), is never specified, but Cox PH models known hazard functions  You can estimate h o (t) accurately if you need to estimate S(t).

Hazard Ratio Interesting to interpret For example, if HR = 0.70, we can deduce the following:  Relative effect on survival is or 30% reduction of the risk of death  Absolute Difference in survival is given as so, if S = 60%, which represents a 10% difference.  Difference in median survival is given as the difference between the median/HR and the median. For example, if the median is months, then the difference is given as or months increase in median survival.