Presentation is loading. Please wait.

Presentation is loading. Please wait.

3 rd Summer School in Computational Biology September 10, 2014 Frank Emmert-Streib & Salissou Moutari Computational Biology and Machine Learning Laboratory.

Similar presentations


Presentation on theme: "3 rd Summer School in Computational Biology September 10, 2014 Frank Emmert-Streib & Salissou Moutari Computational Biology and Machine Learning Laboratory."— Presentation transcript:

1 3 rd Summer School in Computational Biology September 10, 2014 Frank Emmert-Streib & Salissou Moutari Computational Biology and Machine Learning Laboratory Center for Cancer Research and Cell Biology Queen’s University Belfast, UK

2 Exercise – Survival Analysis Homework ~ 1.5 hours

3 1. Kaplan-Meier Survival Curves 3

4 Result: Survival Curve 4 S(t)

5 Goal: estimate S(t) from data A survival curve shows S(t) as a function of t. – S(t): survival function (survivor function) – t: time S(t) gives the probability that the random variable T is larger than a specified time t, i.e., S(t) = Pr(T>t) T: is the event 5 Problem: censoring

6 Small example: Leukemia 6 Chemotherapy (we use this info later) censoring Acute Myelogenous Leukemia (AML) survival time Only 5 patients

7 Small example: Leukemia 7 censoring Number in riskNumber of events event ???

8 Kaplan-Meier estimator for S(t) Estimator: n i : number of subjects at time t i d i : number of events at time t i 8 Kaplan & Meier 1958

9 Kaplan-Meier estimator for S(t) Estimator: n i : number of subjects at time t i d i : number of events at time t i 9

10 Check S(t) till t 10

11 Kaplan-Meier estimator for S(t) Estimator: n i : number of subjects at time t i d i : number of events at time t i 11

12 Check S(t) till t 12

13 Kaplan-Meier estimator for S(t) Estimator: n i : number of subjects at time t i d i : number of events at time t i 13 Last time seen, still alive at that time

14 Check S(t) till t 14

15 Kaplan-Meier estimator for S(t) Estimator: n i : number of subjects at time t i d i : number of events at time t i 15

16 Check S(t) till t 16

17 Kaplan-Meier estimator for S(t) Estimator: n i : number of subjects at time t i d i : number of events at time t i 17

18 Check S(t) till t 18

19 Full data set: Leukemia 19 23 patients

20 R code 20

21 2. Comparing Survival Curves 21

22 Reasons for comparing survival curves (SC) Treatment vs no treatment: – Compare a SC for patients that have been treated with a certain medication with the SC for patient that have not been treated. – Result: Has the treatment an effect on the survival of the patients? 22

23 Reasons for comparing survival curves Chemotherapy vs no chemotherapy : – Compare a SC for patients that had chemotherapy with the SC for patient that have not had chemotherapy. – Result: Has the chemotherapy an effect on the survival of the patients? 23 Survival Analysis has a big practical relevance

24 Data: Leukemia 24 11 patients with chemo 12 patients without Goal: compare the two SCs statistically Group 1 Group 2

25 R code 25

26 Log-rank test (Mantel-Haenszel) Hypothesis: Null hypothesis H 0 : No difference in survival between (group 1) and (group 2). Alternative hypothesis H 1 : Difference in survival between (group 1) and (group 2). 26 Mantel and Haenszel 1959

27 Idea of the test For each time t, estimate the expected number of events for (group 1) and (group 2). 27 Number in risk at t in i Number of events at t in i

28 28 The e it are obtained assuming H 0 is true. Hence, m it – e it is a measure for the deviation of the data from H 0. sum E2E2 E1E1 O 1 - E 1 O 2 – E 2

29 Wrapping up Test statistic: Sampling distribution: s follows a chi-square distribution with one degree of freedom 29

30 R code Back to our leukemia data set: 30

31 Data: Leukemia 31 11 patients with chemo 12 patients without Goal: compare the two SCs statistically Group 1 Group 2

32 Survival Analysis & Biomarkers

33 NIH Definition of Biomarker A characteristic that is objectively measured and evaluated as an indicator of normal biological processes, pathogenic processes, or pharmacologic responses to therapeutic intervention.

34 FDA Definition of Biomarker Any measurable diagnostic indicator that is used to assess the risk or presence of disease

35 What is a biomarker? These definitions are very broad and do not help in finding practical implementations for a particular disease.

36 Our “definition” Remark: We do not want to address all possible problems that can involve biomarkers but focus on a particular application. Application: Identify a set of genes that can be used for a prognostic analysis. …that are good!

37 Definition of ‘prognosis’ A prognosis is a medical term denoting the prediction of how a patient will progress over time. For instance, a patient with a diagnosed disease can have: – Long time survival – Short time survival

38 Our “definition” Remark: We do not want to address all possible problems that can involve biomarkers but focus on a particular application. Application: Identify a set of genes that can be used for a prognostic analysis. Set of genes: we call biomarkers Use biomarkers to predict the prognostic outcome of a patient to classify survival

39

40 Underlying idea to identify biomarkers The identification of biomarkers is a composite approach (or a procedure) that is based on a couple of other methods. In the previous example: 1.Survival analysis 2.Differential expression of genes 3.Classification

41

42 Underlying idea to identify biomarkers The identification of biomarkers is a composite approach (or a procedure) that is based on a couple of other methods. In the previous example: 1.Clustering 2.Survival analysis 3.Differential expression of genes 4.Classification

43 Our “definition” Remark: We do not want to address all possible problems that can involve biomarkers but focus on a particular application. Application: Identify a set of genes that can be used for a prognostic analysis. Structured patient groups vs unstructured patient groups Statistics: Feature selection problem

44 Underlying idea to identify biomarkers The identification of biomarkers is a composite approach (or a procedure) that is based on a couple of other methods. The definition of the procedure is part of the experimental design of the whole experiment. Yes, the experimental design includes the analysis of the data!

45 Summary & Outlook to Genome and Network Medicine Almost there!

46 Schedule 17 lectures

47 Interdisciplinary summer school

48 Vision of the VC Universities require interdisciplinary engagement in the educational and research effort Professor Patrick Johnston of President and Vice-Chancellor (VC) of Queen’s University

49 A look 5 years ahead

50 1. Single cell experiments Experimental measurements of – DNA – Gene expression (mRNA) – Protein binding within single cells. What do the other high-throughput data provide information for? Populations of cells. NGS

51 1. Single cell experiments Experimental measurements of – DNA – Gene expression (mRNA) – Protein binding within single cells. What do the other high-throughput data provide information for? Populations of cells. NGS Study the heterogeneity of cancer tumors.

52 1. Single cell experiments PacBio (Pacific Biosciences) SMRT: Single molecule real time sequencing

53 2. Personalized Medicine The idea behind Personalized medicine is to provide a customization of healthcare using molecular analysis - with medical decisions, practices etc, which are tailored to the needs of the individual patient. One drug for all customized treatment.

54 2. Personalized Medicine 2012

55 What does this all mean?

56 It means first of all more data!

57 What does this all mean? It means first of all more data!

58 Survey Please participate in the survey about the summer school in order to help us to improve. We will send it early next week.

59 Thank you to everyone for participating! We hope you enjoyed the summer school.


Download ppt "3 rd Summer School in Computational Biology September 10, 2014 Frank Emmert-Streib & Salissou Moutari Computational Biology and Machine Learning Laboratory."

Similar presentations


Ads by Google