Presentation is loading. Please wait.

Presentation is loading. Please wait.

بسم الله الرحمن الرحیم. Generally,survival analysis is a collection of statistical procedures for data analysis for which the outcome variable of.

Similar presentations


Presentation on theme: "بسم الله الرحمن الرحیم. Generally,survival analysis is a collection of statistical procedures for data analysis for which the outcome variable of."— Presentation transcript:

1 بسم الله الرحمن الرحیم

2

3

4 Generally,survival analysis is a collection of statistical procedures for data analysis for which the outcome variable of interest is time until an event occurs.

5

6 4

7 The usual objective with this type of data is to determine the length of remission and survival and to compare the distributions of remission and survival time in each group. Thirty melanoma patients (stages 2 to 4) were studied to compare the immunotherapies BCG (Bacillus Calmette- Guerin) and Corynebacterium parvum for their abilities to prolong remission duration and survival time. The age, gender, disease stage, treatment received, remission duration, and survival time are given in Table 3.1. Comparison of Two Treatments

8

9

10 Comparison of Three Diets A laboratory investigator interested in the relationship between diet and the development of tumors divided 90 rats into three groups and fed them low-fat, saturated fat, and unsaturated fat diets, respectively (King et al., 1979). The rats were of the same age and species and were in similar physical condition. An identical amount of tumor cells were injected into a foot pad of each rat. The rats were observed for 200 days.

11

12

13

14

15

16

17 Type of censored data Right censored Left censored Interval censored

18

19

20

21

22

23 the survivor function

24

25 The hazard function This mathematical formula is difficult to explain in practical terms.

26

27

28

29

30 h1(t): patients with acute leukemia who do not respond to treatment have an increasing hazard rate h2(t): indicates the risk of soldiers wounded by bullets who undergo surgery h3(t): is the risk of healthy persons between 18 and 40 years of age whose main risks of death are accidents. h4(t): describes the process of human life h5(t): patients with tuberculosis have risks that increase initially, then decrease after treatment

31

32 Goals of Survival Analysis

33

34 Data Layout

35

36 The estimated survivor curves for the treatment and placebo groups.

37

38 The possible confounding effect In this case, we would say that the treatment effect is confounded by the effect of log WBC. Need to adjust for imbalance in the distribution of log WBC

39 Interaction What we mean by interaction is that the effect of the treatment may be different, depending on the level of log WBC. There is strong treatment by log WBC interaction, and we would have to qualify the effect of the treatment as depending on the level of logWBC.

40 1) To stratify on log WBC and compare survival curves for different strata or 2) To use mathematical modeling procedures such as the proportional hazards or other survival models

41 How to estimate and graph survival curves? Use Kaplan-Meier (KM) method.

42 Introduction to Kaplan-Meier Non-parametric estimate of the survival function: No math assumptions! (either about the underlying hazard function or about proportional hazards). Simply, the empirical probability of surviving past certain times in the sample (taking into account censoring).

43 Introduction to Kaplan-Meier Non-parametric estimate of the survival function. Commonly used to describe survivorship of study population/s. Commonly used to compare two study populations. Intuitive graphical presentation.

44 Beginning of studyEnd of study  Time in months  Subject B Subject A Subject C Subject D Subject E Survival Data (right-censored) 1. subject E dies at 4 months X

45 100%  Time in months  Corresponding Kaplan-Meier Curve Probability of surviving to 4 months is 100% = 5/5 Fraction surviving this death = 4/5 Subject E dies at 4 months

46 Beginning of studyEnd of study  Time in months  Subject B Subject A Subject C Subject D Subject E Survival Data 2. subject A drops out after 6 months 1. subject E dies at 4 months X 3. subject C dies at 7 months X

47 100%  Time in months  Corresponding Kaplan-Meier Curve subject C dies at 7 months Fraction surviving this death = 2/3

48 Beginning of studyEnd of study  Time in months  Subject B Subject A Subject C Subject D Subject E Survival Data 2. subject A drops out after 6 months 4. Subjects B and D survive for the whole year-long study period 1. subject E dies at 4 months X 3. subject C dies at 7 months X

49 100%  Time in months  Corresponding Kaplan-Meier Curve Rule from probability theory: P(A&B)=P(A)*P(B) if A and B independent In survival analysis: intervals are defined by failures (2 intervals leading to failures here). P(surviving intervals 1 and 2)=P(surviving interval 1)*P(surviving interval 2)  Product limit estimate of survival = P(surviving interval 1/at-risk up to failure 1) * P(surviving interval 2/at-risk up to failure 2) = 4/5 * 2/3=.5333

50 The product limit estimate The probability of surviving in the entire year, taking into account censoring = (4/5) (2/3) = 53% NOTE:  40% (2/5) because the one drop-out survived at least a portion of the year. AND <60% (3/5) because we don’t know if the one drop-out would have survived until the end of the year.

51 n(f n(f): the number of subjects in the risk set at the start the interval t(f) t(f): failure time q(f): q(f): the number of censored subjects m(f): m(f): the number of failures

52 KM formula =product limit formula When there are censored subjects

53

54 how to evaluate whether or not KM curves for two or more groups are statistically equivalent? Themost popular testing method is called the log–rank test.

55

56 The Log-Rank Test for Several Groups

57

58 Alternatives to the Log Rank Test

59 The Wilcoxon test (called the Breslow test in SPSS) Wilcoxon Test

60 All the test results are highly significant yielding a similar conclusion to reject the null hypothesis.

61 Choosing a Test

62 Confidence intervals for KM curves

63

64

65

66 Edited Output From Stata:

67 Time-independent variable:Values for a given individual do not change over time; e.g.,SEX and Smoking status(SMK).

68 Why the Cox PH Model Is Popular? 1) Semiparametric property 2) Cox PH model is “robust” the baseline hazard is not specified, reasonably good estimates of regression coefficients, hazard ratios of interest, and adjusted survival curves can be obtained for a wide variety of data situations.

69 We need are estimates of the b’s to assess the effect of explanatory variables of interest. The measure of effect, which is called a hazard ratio (HR) Maximum likelihood (ML) Estimation of the Cox PH Model

70 Statistical inferences for hazard ratios

71 1) Test for treatment effect: Wald statistic: P <0.001 (highly significant) Conclusion: treatment effect is significant 2) Point estimate: HR = 4.523 Conclusion: the hazard for the placebo group is 4.5 times the hazard for the treatment group 3) 95% confidence interval for the HR: (2.027,10.094)

72 the potential confounding effect HR for model 1 (4.523) is higher than HR for model 2 (3.648) Confounding: crude versus adjusted HR are meaningfully different. Confounding due to log WBC  must control for log WBC, i.e., prefer model 2 to model 1.

73 Interaction in model

74 The Meaning of the PH Assumption The PH assumption requires that the HR is constant over time The hazard for one individual is proportional to the hazard for any other individual, where the proportionality constant is independent of time.

75

76 PH Not Satisfied EXAMPLE: General rule: If the hazards cross, then a Cox PH model is not appropriate.

77 If the Cox PH model is inappropriate, how should we carry out the analysis?

78 Evaluating the Proportional Hazards Assumption Checking the Proportional Hazards Assumption:

79 There are two types of graphical techniques available. 1) Comparing estimated –ln(–ln) survivor curves 2) Compare observed with predicted survivor curves.

80 Goodness-Of-Fit (GOF) tests

81

82

83

84

85

86

87

88

89

90

91

92

93

94

95

96

97

98

99

100

101

102

103

104

105

106

107

108

109

110

111

112


Download ppt "بسم الله الرحمن الرحیم. Generally,survival analysis is a collection of statistical procedures for data analysis for which the outcome variable of."

Similar presentations


Ads by Google