1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 6.1: Recurrent event data Intensity processes and rate functions Robust.

Slides:



Advertisements
Similar presentations
Autocorrelation Functions and ARIMA Modelling
Advertisements

Topic: Several Approaches to Modeling Recurrent Event Data Presenter: Yu Wang.
Qualitative predictor variables
Lecture 11 (Chapter 9).
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
4.3 Confidence Intervals -Using our CLM assumptions, we can construct CONFIDENCE INTERVALS or CONFIDENCE INTERVAL ESTIMATES of the form: -Given a significance.
What role should probabilistic sensitivity analysis play in SMC decision making? Andrew Briggs, DPhil University of Oxford.
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Categorical Data. To identify any association between two categorical data. Example: 1,073 subjects of both genders were recruited for a study where the.
Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
SUMS OF RANDOM VARIABLES Changfei Chen. Sums of Random Variables Let be a sequence of random variables, and let be their sum:
Point and Confidence Interval Estimation of a Population Proportion, p
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Visual Recognition Tutorial
Probability theory 2010 Conditional distributions  Conditional probability:  Conditional probability mass function: Discrete case  Conditional probability.
Modeling clustered survival data The different approaches.
Accelerated Failure Time (AFT) Model As An Alternative to Cox Model
Standard error of estimate & Confidence interval.
Review of normal distribution. Exercise Solution.
Objectives of Multiple Regression
Estimating cancer survival and clinical outcome based on genetic tumor progression scores Jörg Rahnenführer 1,*, Niko Beerenwinkel 1,, Wolfgang A. Schulz.
Economics 173 Business Statistics Lecture 2 Fall, 2001 Professor J. Petry
Prediction concerning Y variable. Three different research questions What is the mean response, E(Y h ), for a given level, X h, of the predictor variable?
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 1: Event history data and counting processes.
The horseshoe estimator for sparse signals CARLOS M. CARVALHO NICHOLAS G. POLSON JAMES G. SCOTT Biometrika (2010) Presented by Eric Wang 10/14/2010.
Random variables Petter Mostad Repetition Sample space, set theory, events, probability Conditional probability, Bayes theorem, independence,
Correlation.
Statistical Decision Theory
Model Inference and Averaging
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
Scientific question: Does the lunch intervention impact cognitive ability? The data consists of 4 measures of cognitive ability including:Raven’s score.
1 Using martingale residuals to assess goodness of fit for sampled risk set data Ørnulf Borgan Department of Mathematics University of Oslo Based on joint.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Bayesian Analysis and Applications of A Cure Rate Model.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Review of Measures of Central Tendency, Dispersion & Association
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
Borgan and Henderson:. Event History Methodology
1 An Interim Monitoring Approach for a Small Sample Size Incidence Density Problem By: Shane Rosanbalm Co-author: Dennis Wallace.
Different Distributions David Purdie. Topics Application of GEE to: Binary outcomes: – logistic regression Events over time (rate): –Poisson regression.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
Agresti/Franklin Statistics, 1 of 87  Section 7.2 How Can We Construct a Confidence Interval to Estimate a Population Proportion?
Lecture 12: Cox Proportional Hazards Model
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Lecture 4: Likelihoods and Inference Likelihood function for censored data.
A first order model with one binary and one quantitative predictor variable.
1 Using dynamic path analysis to estimate direct and indirect effects of treatment and other fixed covariates in the presence of an internal time-dependent.
Introduction to Frailty Models
CHAPTER- 3.2 ERROR ANALYSIS. 3.3 SPECIFIC ERROR FORMULAS  The expressions of Equations (3.13) and (3.14) were derived for the general relationship of.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Regression Analysis: A statistical procedure used to find relations among a set of variables B. Klinkenberg G
Hypothesis Testing and Statistical Significance
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
The simple linear regression model and parameter estimation
Sampling Distributions
Comparing Cox Model with a Surviving Fraction with regular Cox model
Model validation and prediction
Inference about the Slope and Intercept
The regression model in matrix form
Inference about the Slope and Intercept
LIMITED DEPENDENT VARIABLE REGRESSION MODELS
Multiple Regression Chapter 14.
Parametric Methods Berlin Chen, 2005 References:
Presentation transcript:

1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 6.1: Recurrent event data Intensity processes and rate functions Robust variance estimation in Cox regression

2 Example: The bladder cancer study: –86 patients with superficial bladder tumours –Tumours were removed, and the patients randomized to placebo or treatment by thiotepa –Patients were followed up, and the recurrence of tumours were registered (truncated at 4) Recurrent event data

3

4 Choice of time-scale Time measured from an initial event (e.g. birth, onset of disease) –Poisson processes and their generalizations Time measured between events –Renewal processes and their generalizations Important distinction between models is the choice of time scale:

5 Let N i (t) count the recurrent events for individual i as a function of (study) time t A Cox-Poisson regression model Assume that N i (t) has intensity process The events for individual i occur according to a Poisson process with intensity intensity for individual i depending on fixed covariates at risk indicator

6 Data file for analysis of bladder data: id trt fu no size intno start stop rec ···· ···· ···· Note: One line for each time interval between events

7 Coding: – idpatient number – trt treatment (1=placebo; 2=thiotepa) – fu total time of follow-up – no initial number of tumors (8 denotes 8+) – size initial size of largest tumor (in cm) – intno number of the time interval – start start of the time interval – stop end of the time interval – recrecurrence at end of the time interval (0=no; 1=yes)

8 R code: coxph(Surv(start,stop,rec)~trt, data=bladder) coef se(coef)p trt Standard error and p-value are valid provided we have an appropriate model for the intensity processes. Looking at the data, there seems to be more variation among the individuals than can be described by the fixed covariates, so the Cox-Poisson model may be too simplistic.

9 In addition to the Cox-Poisson model, we have the following modelling options for recurrent event data: Modelling options for recurrent event data Frailty model Marginal model Dynamic model (cf. session 6.2)

10 Frailty model for recurrent events Model the differences between individuals (not accounted for by the fixed covariates) by frailties Z i (e.g. gamma distributed). Given Z i assume that the recurrent events for individual i occur according to a Poisson process with intensity May integrate out Z i to obtain the intensity process of N i (t).

11 R code for gamma frailty model: coxph(Surv(start,stop,rec)~trt+frailty(id), data=bladder) coef se(coef)p trt Note that the standard error for the frailty model becomes substantially larger (and more appropriate) than the one for the Cox-Poisson model

12 Marginal model for recurrent events The intensity process of N i (t) is given by Note that the intensity process depends on all information available "just before" time t, i.e. fixed covariates and events in [0,t). One may try to model this dependence using information on previous events as dynamic covariates (cf. session 6.2)

13 If the main interest is on the effect of the fixed covariates, another option is to adopt a marginal model. A marginal model is defined by means of the rate function r i (t) given by Note that we only condition on fixed covariates and at risk status. Thus we average (marginalize) over all possible event patterns in [0,t).

14 We assume a proportional effect of the fixed covariates on the rate functions: Note: The interpretation of the regression coefficients are not the same for the frailty model and the marginal model. For the frailty model they measure the effects of the covatiates for a given individual (i.e. a given value of the frailty), while for the marginal model they measure the effects of the covariates averaged over all individuals at risk.

15 For the marginal model we may still estimate the regression coefficients using Cox's partial likelihood (even though this is not a proper partial likelihood for marginal models) The usual estimator for the covariance matrix based on the observed information does not apply, however, and has to be replaced by a robust "sandwich estimator" (cf. below)

16 R code for marginal model: coxph(Surv(start,stop,rec)~trt+cluster(id),data=bladder) coef se(coef) robust se p trt Note that the robust standard error (which is the one to be used for the marginal model) becomes substantially larger than for the Cox-Poisson model.

17 For the marginal model we may write but is not a martingale increment. Therefore the martigale based proofs of the large sample properties (cf. session 3) no longer apply, and the usual likelihood based methods cannot be used. Robust variance estimation

18 By the argument in session 3, we have with the observed information and

19 By the (ordinary) cental limit theorem is approximately normally distributed with mean zero and a covariance matrix that may be estimated by Assuming an iid model, is a sum of iid variables. It follows by (*) that is approximately normally distributed around with a covariance matrix that may be estimated by the (robust) "sandwich estimator"