Personalized Medicine: Analytics for Cancer Survival Curves Ran Qi, Shujia Zhou, Yelena Yesha June 13, 2013 IAB Meeting Research Report.

Slides:



Advertisements
Similar presentations
UNDERWRITING CORRELATION FOR CANCER CASES. Are we going to accept a proposed insured with known cancer?
Advertisements

TOP2A IS AN INDEPENDENT PREDICTOR OF SURVIVAL IN UNSELECTED BREAST CANCER Amit Pancholi Molecular Profiling of Breast Cancer: Predictive Markers of Long.
Clustering Clustering of data is a method by which large sets of data is grouped into clusters of smaller sets of similar data. The example below demonstrates.
Basic Gene Expression Data Analysis--Clustering
STAGING OF BRONCHOGENIC CA NSCLC STAGING TNM CLASSFICATION Adenocarcinoma Squamous cell carcinoma Large cell carcinoma T – Primary tumor N – Regional.
SURGERY CASE 2 Rogelio, Graciella; Roque, Marianne; Ruanto, Teresa; Sabalvaro, Dyan Dr. Dakila Delos Angeles.
Dept of Biomedical Engineering, Medical Informatics Linköpings universitet, Linköping, Sweden A Data Pre-processing Method to Increase.
Ca lung Dr. D.P. Singh Professor, Surgery.. Primary lung cancer – risk factors Cigarette smoking Number of years Number of packs Passive smoking Atmospheric.
Medical School of Oncology Highlights on NSCLC Management STAGING STRATEGIES Maria Grazia Ghi Divisione di Oncologia-Venezia Roma, 29 ottobre 2010.
TNM staging and prognosis Alexandru Eniu, MD, PhD Medical Oncologist Department of Breast Tumors Cancer Institute Ion Chiricuţă Cluj-Napoca, Romania.
Diagnosis and Staging JoAnne Zujewski, MD
Non Small Cell Lung Cancer Introduction
A Slide Presentation for Oncology Nurses
Breast Cancer, A Common Problem in Sri Lanka
Departments of Medicine and Biostatistics
Introduction to Bioinformatics
What is TNM? TNM is a system for classifying malignant tumours ! It is a cancer staging system, which describes the extent of a person's cancer ! Most.
More about Correlations. Spearman Rank order correlation Does the same type of analysis as a Pearson r but with data that only represents order. –Ordinal.
Breast Cancer By: Vincent Russo And Scott Jeffery.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Prognostic Modelling and Profiling of Breast Cancer Patients after Surgery Ian Jarman School of Computer and Mathematical Sciences Liverpool John Moores.
Supplemental Figures Loss of circadian clock gene expression is associated with tumor progression in breast cancer Cristina Cadenas 1*, Leonie van de Sandt.
Lecture 11. Microarray and RNA-seq II
Prognostic value of ER, PR, and HER2 breast cancer biomarkers and AJCC’s TNM staging system on overall survival of Caucasian females with breast cancer.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Breast cancer in elderly patients (70 years and older): The University of Tennessee Medical Center at Knoxville 10 year experience Curzon M, Curzon C,
Snyder D, Heidel RE, Panella T, Bell J, Orucevic A University of Tennessee Medical Center – Knoxville Departments of Pathology, Surgery, and Medicine BREAST.
Acknowledgements This report differs from the submitted abstract due to further subdivision of patients into analytic and non- analytic, and focus on the.
Cluster Analysis.
A new initialization method for Fuzzy C-Means using Fuzzy Subtractive Clustering Thanh Le, Tom Altman University of Colorado Denver July 19, 2011.
Ki-67 index cutoff value of 1% is a valuable prognostic biomarker for pulmonary carcinoids based on this large cohort. Our data also provide strong evidence.
Pan-cancer analysis of prognostic genes Jordan Anaya Omnes Res, In this study I have used publicly available clinical and.
Extranodal Extension on Sentinel Lymph Node Dissection: Why Should We Treat It Differently? Audrey Choi MD, Matthew Surrusco MD, Samuel Rodriguez MD, Khaled.
Survival-Time Classification of Breast Cancer Patients DIMACS Workshop on Data Mining and Scalable Algorithms August 22-24, Rutgers University Y.-J.
A B C Supplementary Figure S1. Time-dependent assessment of grade, GGI and PAM50 in untreated patients Landmark analyses of the Kaplan-Meier estimates.
CLINICAL ASPECT OF GRADING AND STAGING Hanggoro Tri Rinonce, MD, PhD Department of Anatomical Pathology Faculty of Medicine, Gadjah Mada University.
IDENTIFYING CANCER SUBTYPES BASED ON SOMATIC MUTATION PROFILE BIOINFORMATICS SEMINAR 2016 SPRING YOUJIN SHIN.
THE IMPORTANCE OF STAGING AND PROGNOSTIC FACTORS IN CANCER CARE
Immunohistochemical analysis of epidermal growth factor receptor family members in stage I non-small cell lung cancer  Wu-Wei Lai, MD, Fen-Fen Chen, MD,
Genomic analysis: Toward a new approach in breast cancer management
Prof. Shaila Anwar Professor Obs & Gynae
Supporting Information for Meta-analysis
Proposed Changes to the 7th Edition
Cancer Staging.
A Long Noncoding RNA Signature That Predicts Pathological Complete Remission Rate Sensitively in Neoadjuvant Treatment of Breast Cancer  Gen Wang, Xiaosong.
Kaplan-Meier survival curves comparing survival between both time periods according to management strategy. Survival in patients with infective endocarditis.
Fig. 2 LYM attractor metagene.
Validation of the Stage Groupings in the Eighth Edition of the TNM Classification for Lung Cancer  Xizhao Sui, MD, Wei Jiang, MD, Haiqing Chen, MD, Fan.
Prognostic evaluation based on a new TNM staging system proposed by the International Association for the Study of Lung Cancer for resected non–small.
A Model to Predict the Use of Surgical Resection for Advanced-Stage Non-Small Cell Lung Cancer Patients  Elizabeth A. David, MD, Stina W. Andersen, PhD,
Prognosis and survival after resection for bronchogenic carcinoma based on the 1997 TNM-staging classification: the Japanese experience  Tsuguo Naruke,
Factors predictive of prognosis after esophagectomy for squamous cell cancer  Houhuai Li, MD, PhD, Qingzhen Zhang, Lin Xu, MD, Yijiang Chen, Yongxiang.
Prognostic Factors in Completely Resected Node-Negative Lung Adenocarcinoma of 3 cm or Smaller  Jung-Jyh Hung, MD, PhD, Yi-Chen Yeh, MD, Yu-Chung Wu,
Impact of Positive Nodal Metastases in Patients with Thymic Carcinoma and Thymic Neuroendocrine Tumors  Benny Weksler, MD, Anthony Holden, MD, Jennifer.
Tumor-Infiltrating Foxp3+ Regulatory T Cells are Correlated with Cyclooxygenase-2 Expression and are Associated with Recurrence in Resected Non-small.
Immunohistochemical analysis of epidermal growth factor receptor family members in stage I non-small cell lung cancer  Wu-Wei Lai, MD, Fen-Fen Chen, MD,
Refining the Nodal Staging for Esophageal Squamous Cell Carcinoma Based on Lymph Node Stations  Jun Peng, MD, Wen-Ping Wang, MD, Ting Dong, MD, Jie Cai,
It’s All in the “Swerve of the Curve”
GOCS GRUPO ONCOLÓGICO COOPERATIVO DEL SUR
LATS2-associated gene expression pattern is down-regulated specifically in lumB breast tumors. LATS2-associated gene expression pattern is down-regulated.
Identification and Validation of Lymphovascular Invasion as a Prognostic and Staging Factor in Node-Negative Esophageal Squamous Cell Carcinoma  Qingyuan.
Fig. 2 LYM attractor metagene.
Effect of Formalin Fixation on Tumor Size Determination in Stage I Non-Small Cell Lung Cancer  Po-Kuei Hsu, MD, Hsu-Chih Huang, MD, Chih-Cheng Hsieh,
Kaplan-Meier survival analysis for all-cause and CVD mortality in 2,823 type 2 diabetic patients stratified by CKD according to each creatinine-based equation.
Nadia Howlader, PhD National Cancer Institute
Fig. 1. Classification of the Kaplan-Meier curves and Cox survival estimates for the OS of patients using the pSPC in Cohort_C and in the overall population.
Supplementary Figure S5
by Wei-Yi Cheng, Tai-Hsien Ou Yang, and Dimitris Anastassiou
NSCLC: Staging and TNM classification
Fig. 5 Proportions of EpCAM+ systemic tumor cells correlate with the clinical outcome of patients with MBC. Proportions of EpCAM+ systemic tumor cells.
Presentation transcript:

Personalized Medicine: Analytics for Cancer Survival Curves Ran Qi, Shujia Zhou, Yelena Yesha June 13, 2013 IAB Meeting Research Report

Introduction: Cancer Staging (1) Cancer stage is an anatomic description of character and quantity of the extent of cancer spread (usually I to IV) – Prognostic factors Tumor (T): size, location, local extent Nodes (N): number, location of nodal metastases Metastasis (M): presence of distance organ spread

Lung cancer staging (bin model) Stage IT1 N0 M0 Stage IIA T1 N1 M0 T2 N0 M0 Stage IIB T2 N1 M0 T3 N0 M0 Stage IIIA T1, 2 N2 M0 T3 N1, 2M0 Stage IIIBT4 N0,1,2 M0 Stage IIICAny T N3 M0 Stage IVAny T Any N M1 bin

Lung cancer survival curves

A Bin Model Breast cancer: 5 T’s, 4 N’s, 2 M’s - 40 bins Adding grades (3 levels): 120 bins (5x4x2x3) Adding ER (hormonal status, 2 levels) 240 bins Thus, for additional variables, the number of bins that would have to be added to a stage would be enormous, and collapsing into a stage would become impractical. “Bin” is also called “combination”.

Problems How to combine the growing number of prognostic factors into small number of stages – Since the TNM staging system was announced in the 1950’s, many new prognostic factors have been identified. – By 1995, 76 predictive factors for breast cancer. – By 2002, 150 factors for lung cancer. Different prognostic factors have different levels of impacts on the survival curves

Objectives Reduce the number of bins through grouping the similar patients Find the relationship between prognostic factors and survival curve

Approaches Grouping cancer patients according to their similarity Ensemble algorithm for Clustering Cancer Data (EACCD) Grouping algorithm for Cancer Data (GACD)

Initialize groups of patients with cutoff Partitioning clustering + statistical calculations 200,000 patients Combinations Log-rank test Dissimilarity matrix Learnt dissimilarity matrix Hierarchical clustering with dendrogram New groups of patients Kaplan-Meier Estimator Cancer Patient Dataset Step 1: Step 2: Step 3: Step 4: Survival curves The GACD work flow MCMC  jump over local minimum Weight  Increase efficiency

GACD Features – A deterministic grouping method – Use weighted dissimilarity to improve the grouping efficiency. – Use MCMC to avoid local minima Results – Find that grouping results are sensitive to the partitioning algorithms (e.g., PAM and Fuzzy) – Find that grouping results are different between local-minimum and global-minimum partitioning algorithms. – Implemented weighted dissimilarity

Prognostic factors: Size, node, age, race Number of combinations: 59 Reduce 59 curves to 3

Evaluation Metric for Grouping Results The area enclosed by two Kaplan-Meier curves Linear correlation coefficient between the merging order of dendrogram and the area of Kaplan-Meier curves

Conclusion The expanded TNM system (e.g., EACCD and GACD) can analyze cancer survival with more prognostic factors. GACD improves the efficiency of grouping algorithm through using weights. The area enclosed by two Kaplan-Meier curves appears to be useful for evaluating grouping results.

Acknowledgement This project is sponsored by NIST through NSF CHMPR. We would like to thank D. Chen, D. Henson, A. Schwartz, A. Dima, M. Brady the helpful discussions.