Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 5.

Slides:



Advertisements
Similar presentations
Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Advertisements

Functional Organization of the Transcriptome in Human Brain Michael C. Oldham Laboratory of Daniel H. Geschwind, UCLA BIOCOMP ‘08, Las Vegas, NV July 15,
By Russell Armstrong Supervisor Mrs Wei Ji Diagnosis Analysis of Lung Cancer by Genome Expression Profiles.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Multidimensional Analysis If you are comparing more than two conditions (for example 10 types of cancer) or if you are looking at a time series (cell cycle.
---- Mark Borodovsky a short intro Position open: Scientist - Pathway Informatics (June 2009) THE POSITION The successful candidate will join the Computational.
Comprehensive Gene Expression Analysis of Prostate Cancer Reveals Distinct Transcriptional Programs Associated With Metastatic Disease Kevin Paiz-Ramirez.
Introduction The goal of translational bioinformatics is to enable the transformation of increasingly voluminous genomic and biological data into diagnostics.
1 Harvard Medical School Transcriptional Diagnosis by Bayesian Network Hsun-Hsien Chang and Marco F. Ramoni Children’s Hospital Informatics Program Harvard-MIT.
“An Extension of Weighted Gene Co-Expression Network Analysis to Include Signed Interactions” Michael Mason Department of Statistics, UCLA.
Evaluation of Supervised Learning Algorithms on Gene Expression Data CSCI 6505 – Machine Learning Adan Cosgaya Winter 2006 Dalhousie University.
University of Washington Institute of Technology Tacoma, WA, USA Ecole des Hautes Etudes en Santé Publique Département Infobiostat Rennes, France Isabelle.
Verna Vu & Timothy Abreo
Huidi Liu, M.D. & Ph.D Genomics Research Centre Harbin Medical University, China Reduced expression of SOX7 in ovarian cancer: a novel.
Transcriptomic Analysis of Peripheral Blood Mononuclear Cells in Rapid Progressors in Early HIV Infection Identifies a Signature Closely Correlated with.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
Apostolos Zaravinos and Constantinos C Deltas Molecular Medicine Research Center and Laboratory of Molecular and Medical Genetics, Department of Biological.
Class Prediction and Discovery Using Gene Expression Data Donna K. Slonim, Pablo Tamayo, Jill P. Mesirov, Todd R. Golub, Eric S. Lander 발표자 : 이인희.
A quick introduction to Oncinfo Lab Dr. Habil Zare, PhD PI of Oncinfo Lab Department of Computer Science Texas State University 18 September 2015.
Differential analysis of Eigengene Networks: Finding And Analyzing Shared Modules Across Multiple Microarray Datasets Peter Langfelder and Steve Horvath.
Application of Class Discovery and Class Prediction Methods to Microarray Data Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics.
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
A comparative study of survival models for breast cancer prognostication based on microarray data: a single gene beat them all? B. Haibe-Kains, C. Desmedt,
Blood and Tissue Based Molecular Signatures in Predicting Prostate Cancer Progression Tarek A. Bismar, MD Professor, University of Calgary Departments.
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Prof. Yechiam Yemini (YY) Computer Science Department Columbia University (c)Copyrights; Yechiam Yemini; Lecture 2: Introduction to Paradigms 2.3.
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring T.R. Golub et al., Science 286, 531 (1999)
Bioinformatics for biologists
Pan-cancer analysis of prognostic genes Jordan Anaya Omnes Res, In this study I have used publicly available clinical and.
Learning disjunctions in Geronimo’s regression trees Felix Sanchez Garcia supervised by Prof. Dana Pe’er.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Consensus modules: modules present across multiple data sets Peter Langfelder and Steve Horvath Eigengene networks for studying the relationships between.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 3.
Bioinformatics for biologists (2) Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Raphael Sandaltzopoulos, PhD, MBA Professor at MBG (Molecular Biology) Lab. of Gene Expression, Molecular Diagnosis and Modern Therapeutics,
Classifiers!!! BCH364C/391L Systems Biology / Bioinformatics – Spring 2015 Edward Marcotte, Univ of Texas at Austin.
Show & Tell Limsoon Wong Kent Ridge Digital Labs Singapore Role of Bioinformatics in the Genomic Era.
Evolution-informed Modeling discover biomarkers for precision oncology Li Liu, M.D. August 22, 2016.
Graph clustering to detect network modules
David Amar, Tom Hait, and Ron Shamir
Areas of Research Xia Jiang Associate Professor of
An Artificial Intelligence Approach to Precision Oncology
Network analysis for AML data
Statistical Applications in Biology and Genetics
Classifiers!!! BCH339N Systems Biology / Bioinformatics – Spring 2016
Gene expression.
Bioinformatics for biologists
Eigengenes as biological signatures
Hallett, et al., - Supplementary Figure 1
Gene Expression Classification
Molecular Classification of Cancer
Dept of Biomedical Informatics University of Pittsburgh
Areas of Research Xia Jiang Assistant Professor
Loyola Marymount University
Grövdal M et al. Blood 2008;112:Abstract 223.
Somi Jacob and Christian Bach
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Loyola Marymount University
Loyola Marymount University
Loyola Marymount University
Loyola Marymount University
Claudio Lottaz and Rainer Spang
Presentation transcript:

Eigengenes as biological signatures Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University 5 February 2016

Outline -Large-scale gene network analysis reveals the role of extracellular matrix pathway and homeobox genes in acute myeloid leukemia. -Similar approaches are useful in identifying low-risk breast cancer cases.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb AML Diagram by A. Rad Acute myeloid leukemia (AML) is an aggressive type of blood cancer, which can cause death within months after diagnosis. Cancer here

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb MDS Diagram by Cazzola Myelodysplastic syndromes (MDS) is less aggressive than AML but it can transform to AML with a risk probability of 30%.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Hypothesis Network analysis can reveal the biological differences between AML and MDS.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Overview of the methodology

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Expression data

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Expression data -Discovery dataset: Microarray gene expression data of 202 AML-NK and 164 MDS cases from MILE study. -Validation dataset: RNA-seq data of 52 AML-NK and 22 MDS cases from BCCA.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Identifying gene modules

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Identifying gene modules - We analyzed 9,166 differentially expressed genes in AML vs. MDS. - We considered a module as a set of highly correlated genes in AML, and identified 33 such modules.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Computing eigengenes

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Computing eigengenes -An eigengene summarizes a module. It is a weighted sum (linear combination) of expression of all genes in the corresponding module. -We applied PCA on each module separately to compute its corresponding eigengene.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Computing eigengenes Eigengenes are differentially expressed in AML compared to MDS.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb The Bayesian network

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Bayesian network The Bayesian network shows the probabilistic dependencies between the modules and the disease type.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb The disease associated modules The children of the “Disease” node are enriched in genes associated with AML.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb The decision tree

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb ECM and HOXA&B eigengenes were automatically selected from the set of children of the Disease node to build a predictive model. The decision tree Average expression of 113 genes Average expression of 42 genes

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Validation in an independent dataset

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb We inferred the expression of eigengenes in 52 AML and 22 MDS cases from BCCA dataset. Qualitative validation MILE BCCA

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Some of the eigengenes showed expression patterns similar to MILE dataset. Qualitative validation MILE BCCA

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb With the same thresholds, the tree classifies cases from both datasets. Quantitative validation

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb We trained our model on MILE microarray dataset, and validated its performance on BCCA RNA-seq dataset. Although the platforms differ, performances are comparable indicating the robustness of our approach. Quantitative validation

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Validation using epigenetics Among all genes in ECM pathway, MMP9 has the highest weight in the eigengene.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Validation using epigenetics These 3 genes from matrix metalloproteinase (MMP) family are methylated in AML, which can explain their relatively lower expression.

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Validation at the protein level Is the expression of MMP9 protein product different in AML compared to MDS? Work in progress…

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Because an eigengene is based on the average expression of several genes, our approach is robust with respect to noise in expression profiles. Robustness to noise

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Even when 30% entries of the expression profile are replaced with noise, the accuracy drops only by 2%. Robustness to noise {

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Two modules were automatically selected: -A cell cycle associated module with 319 genes. -A mysterious module with 26 genes, 24 in 9q34. Breast cancer risk factors METABRIC discovery dataset METABRIC validation dataset MILLER dataset

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb Using a similar approach, we could identify low-risk ER+ breast cancer cases with precision > 88% in 3 datasets. Breast cancer risk assessment

Dr. Habil Zare, PhD The PI Computational Biologist Dr. Amir Forpushani, PhD Postdoc, Computational Biologist Rupesh Agrihari Grad student, Computer Science Acknowledgments Oncinfo Lab Members 31 Dr. Aly Karsan, MD Hematopathologist Rod Docking Grad student, BCCA & UBC In collaboration with British Columbia Cancer Agency

Eigengenes as biological signature, Dr. Habil Zare, Oncinfo Lab Texas State University, 5 Feb We can apply a similar approach on fish RNA-seq data. 1.Identify gene modules using all available expression data including normal samples. 2.Compute the eigengenes for each module. 3.Investigate which eigengenes are associated with experiment conditions like dosage or wavelength. 4.Perform overrepresentation analysis on the corresponding modules to determine the most relevant biological processes. Future work Eigengene 5 Dosage

References: Cazzola, Mario. "IDH1 and IDH2 mutations in myeloid neoplasms–Novel paradigms and clinical implications." Haematologica (2010): Haferlach, Torsten, et al. "Clinical utility of microarray-based gene expression profiling in the diagnosis and subclassification of leukemia: report from the International Microarray Innovations in Leukemia Study Group." Journal of Clinical Oncology (2010): Langfelder, Peter, and Steve Horvath. "WGCNA: an R package for weighted correlation network analysis." BMC bioinformatics 9.1 (2008): 1. Curtis, Christina, et al. "The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups." Nature (2012): Miller, Lance D., et al. "An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival." Proceedings of the National Academy of Sciences of the United States of America (2005): Instaling BioLinux using VM, Dr. Habil Zare 27 Oct