While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.

Slides:



Advertisements
Similar presentations
Statistical methods and tools for integrative analysis of perturbation signatures Mario Medvedovic Laboratory for Statistical Genomics and Systems Biology.
Advertisements

CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Open Day 2006 From Expression, Through Annotation, to Function Ohad Manor & Tali Goren.
Work Process Using Enrich Load biological data Check enrichment of crossed data sets Extract statistically significant results Multiple hypothesis correction.
Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Pathways Analysis using Protein Expression Data Venkatesh Jitender Dr. Vanathi Gopalakrishnan Center for Biomedical Informatics, UPMC.
Predicting protein functions from redundancies in large-scale protein interaction networks Speaker: Chun-hui CAI
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Class Projects. Future Work and Possible Project Topic in Gene Regulatory network Learning from multiple data sources; Learning causality in Motifs; Learning.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
ICA-based Clustering of Genes from Microarray Expression Data Su-In Lee 1, Serafim Batzoglou 2 1 Department.
Biomedical research methods. What are biomedical research methods? An integrated approach using chemical, mathematical and computer simulations, in vitro.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Introduction The goal of translational bioinformatics is to enable the transformation of increasingly voluminous genomic and biological data into diagnostics.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
EnrichNet: network-based gene set enrichment analysis Presenter: Lu Liu.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Gene Set Enrichment Analysis (GSEA)
From motif search to gene expression analysis
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
Conclusions As empirical data about sets of related entities accrues, there are more constraints on possible network realizations that can fit the data;
Detecting binding sites for transcription factors by correlating sequence data with expression. Erik Aurell Adam Ameur Jakub Orzechowski Westholm in collaboration.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Finish up array applications Move on to proteomics Protein microarrays.
Reconstruction of Transcriptional Regulatory Networks
Epigenetic Analysis BIOS Statistics for Systems Biology Spring 2008.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
Tutorial session 3 Network analysis Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
COMPUTATIONAL ANALYSIS OF MULTILEVEL OMICS DATA FOR THE ELUCIDATION OF MOLECULAR MECHANISMS OF CANCER Presented by Azeez Ayomide Fatai Supervisor: Junaid.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
W e present BiNoM [1,2], a Cytoscape plugin, developed to facilitate the manipulation of biological networks represented in standard systems biology formats.
EB3233 Bioinformatics Introduction to Bioinformatics.
Bioinformatics and Computational Biology
Introduction to biological molecular networks
Cluster validation Integration ICES Bioinformatics.
Getting the story – biological model based on microarray data Once the differentially expressed genes are identified (sometimes hundreds of them), we need.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
By Jay Krishnan. Introduction Information gathered from Proteomic techniques + neuroscientific research = Information on protein composition and function.
GMT files and Network are Related Content of GMT files can be converted to networks based on co-occurrence.
A Yeast Synthetic Network for In Vivo Assessment of Reverse-Engineering and Modeling Approaches Cantone, I., Marucci, L., Iorio, F., Ricci, M., Belcastro,
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Regulatory Signatures Inferred From Gene Expression Data Jayanth (Jay) Krishnan SBCNY Fellow Mahopac High School Mount Sinai School of Medicine Bio-Engineering/Bioinformatics.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Integrating Genomic Datasets to Identify Host-Pathogen Interactions
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
1 Department of Engineering, 2 Department of Mathematics,
Loyola Marymount University
Volume 29, Issue 5, Pages (May 2016)
Loyola Marymount University
Loyola Marymount University
Loyola Marymount University
Presentation transcript:

While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible for these changes are still poorly understood. Here we developed a rationale approach to infer regulatory mechanisms governing changes in gene expression by integrating datasets of protein/DNA interactions, protein- protein interactions and kinase-substrate interactions collected from prior biological knowledge. We first utilize data obtained from genome-wide ChIP-on-chip and ChIP-Seq experiments to connect mRNA expression levels of the NCI-60 cancer cell lines to the transcription factors most likely regulating them. These identified transcription factors are then “connected”, using known protein-protein interactions, to form cancer specific sub-networks. Within these sub-networks we assess the enrichment for protein kinase substrates to infer the protein kinases likely regulating these complexes. Finally, using quantitative comparison of the up and down regulated genes for each cancer cell line, and genes affected by FDA approved drugs applied to cancer cells, we predict the mechanisms of action of these drugs. Following this path, from changes in gene expression to transcription factors to protein kinases we can provide a more thorough understanding of the regulatory mechanisms behind the observed mRNA levels in the NCI-60 cancer cell lines and other cancer cells. This approach proposes mechanisms of action for drugs. Wet lab experimental validation of this approach is still necessary, it can be done using single drugs or combinations of them. ChEA Genes2Networks KEA This research was supported by NIH Grant No. 5P50GM Regulatory Signatures of Cancer Cell Lines Inferred from Expression Data Jayanth (Jay) Krishnan 1,2, Avi Ma’ayan 2 1 Mahopac High School, Mahopac, NY Mahopac High School, Mahopac, NY Systems Biology Center New York and Department of Pharmacology and Systems 2 2 Systems Biology Center New York and Department of Pharmacology and Systems Therapeutics, Mount Sinai School of Medicine, New York NY Regulatory Signatures of Cancer Cell Lines Inferred from Expression Data Jayanth (Jay) Krishnan 1,2, Avi Ma’ayan 2 1 Mahopac High School, Mahopac, NY Mahopac High School, Mahopac, NY Systems Biology Center New York and Department of Pharmacology and Systems 2 2 Systems Biology Center New York and Department of Pharmacology and Systems Therapeutics, Mount Sinai School of Medicine, New York NY Abstract Acknowledgements Introduction Workflow The NCI-60 database provides mRNA profiles from microarray experiments of 60 commonly studies cancer cell lines Although analyzing these mRNA values is a reliable method to measure the mRNA level of many genes within a cell, this method offers little clues about how cells are regulated While mRNA profiles indicates changes caused by cancer, understanding the underlying regulatory mechanisms disregulated in different cancers will bring us closer to therapeutics In this project we aim to identify the transcription factors, protein complexes and protein kinases responsible for the aberrant expression of genes in the various types of cancer cell lines Differentially expressed gene lists from the various NCI-60 Cancer Cell Lines are used as input. Over expressed and under expressed genes are identified for specific cancer cell lines The following algorithm was implemented: The NCI-60 database was parsed and 18,133 unique genes were identified The population mean for the expression of each of the genes across all the 60 cancer cell lines was calculated The sample mean and sigma for each (gene, cancer cell line) pair was calculated The two-sided T-test statistic was applied for each (gene, cancer cell line) pair. Whether the gene was over expressed or under expressed was calculated by checking whether the test statistic exceeded a critical T score or was a less than a critical T score determined based on a particular P value. A list of genes which are over/under expressed for multiple cancer cell lines was developed Analyzing the mRNA profile from the NCI-60 database Statistical Methods: ChEA, Genes2Networks, and KEA are all web-based tools developed at the Ma’ayan lab to allow users to predict which transcription factors, protein sub- networks, and protein kinases are most correlated with their inputted seed list By using the identified up and down regulated genes for each cancer cell line as an input for ChEA; the top ranked transcription factors (based on p-value from Fisher’s Exact Test) that most likely influence the input seed list are given as the output Future Research Future research involves further analyzing other cancer datasets Cluster analysis will be done to groups transcription factors or kinases that were identified Additionally, by combining such data with data collected for drug perturbation of these cells, we may be able to suggest which drugs can reverse the observed changes Genes2Networks The transcription factor output for each cancer cell line from ChEA is used as an input to Genes2Networks Genes2Networks connects lists of transcription factors with other protein intermediates from mammalian protein interactions databases KEA The unique protein sub-networks outputted by Genes2Networks can then be inputted into KEA which identifies protein kinases most likely regulating the proteins from the subnetwork using the Fisher’s Exact Test. At this stage top regulating transcription factors, protein sub-networks and kinases have been identified for each of the NCI-60 cancer cell lines An integrated matrix can now be created in order to holistically compare the data by displaying the top regulating elements and their putative effects on the different cell lines Example of Process Microarray Analyze mRNA profile from NCI -60 database by using statistical techniques to compute over/under expressed genes Identify protein sub-networks that “connect” the transcription factors through additional proteins Wet lab experimental validation Future Research Identify protein sub-networks that “connect” the transcription factors through additional proteins Wet lab experimental validation Top ranked protein kinases most likely regulating the protein sub-networks Top 222 over expressed genes for cancer cell line MDA_N (melanoma) With gene input, ChEA identified the top ranked transcription factors Genes2Networks output of protein sub- networks when top 10 transcription factors from ChEA were given as an input Top ranked kinase proteins identified from KEA