MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.

Slides:



Advertisements
Similar presentations
Basic Gene Expression Data Analysis--Clustering
Advertisements

The multi-layered organization of information in living systems
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Effective Heuristics for NP-Hard Problems Arising in Molecular Biology Richard M. Karp Bangalore, January 5, 2011.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
EXPression ANalyzer and DisplayER Adi Maron-Katz Igor Ulitsky Chaim Linhart Amos Tanay Rani Elkon Seagull Shavit Dorit Sagir Eyal David Roded Sharan Israel.
A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles Authors: Chia-Hao Chin 1,4,
Automatic Identification of ROIs (Regions of interest) in fMRI data.
APO-SYS workshop on data analysis and pathway charting Igor Ulitsky Ron Shamir ’ s Computational Genomics Group.
Using Contiguous Bi-Clustering for data driven temporal analysis of fMRI based functional connectivity.
Threshold selection in gene co- expression networks using spectral graph theory techniques Andy D Perkins*,Michael A Langston BMC Bioinformatics 1.
Clustering short time series gene expression data Jason Ernst, Gerard J. Nau and Ziv Bar-Joseph BIOINFORMATICS, vol
Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Cs726 Modeling regulatory networks in cells using Bayesian networks Golan Yona Department of Computer Science Cornell University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
衛資所 生物資訊組 陳俊宇 April 07, 03. graph nodeedge Chromosomegenepositional correlations Pathwayenzymefunctional correlations Gene expression genecoexpressed.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Data Mining Presentation Learning Patterns in the Dynamics of Biological Networks Chang hun You, Lawrence B. Holder, Diane J. Cook.
Fuzzy K means.
Regulatory element detection using correlation with expression (REDUCE) Literature search WANG Chao Sept 14, 2004.
Joint analysis of genetic and physical interactions in S. cerevisiae Igor Ulitsky Ron Shamir lab School of Computer Science Tel Aviv University.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Protein Interactions and Disease Audry Kang 7/15/2013.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Inferring Cellular Networks Using Probabilistic Graphical Models Jianlin Cheng, PhD University of Missouri 2009.
The Relative Vertex-to-Vertex Clustering Value 1 A New Criterion for the Fast Detection of Functional Modules in Protein Interaction Networks Zina Mohamed.
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Network Analysis and Application Yao Fu
1 Joint analysis of regulatory networks and expression profiles Ron Shamir School of Computer Science Tel Aviv University April Sources: Igor Ulitsky.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Networks and Interactions Boo Virk v1.0.
Kristen Horstmann, Tessa Morris, and Lucia Ramirez Loyola Marymount University March 24, 2015 BIOL398-04: Biomathematical Modeling Lee, T. I., Rinaldi,
EXPression ANalyzer and DisplayER Adi Maron-Katz Igor Ulitsky Chaim Linhart Amos Tanay Rani Elkon Seagull Shavit Dorit Sagir Eyal David Roded Sharan Israel.
EXPression ANalyzer and DisplayER Adi Maron-Katz Igor Ulitsky Chaim Linhart Amos Tanay Rani Elkon Seagull Shavit Dorit Sagir Eyal David Roded Sharan Israel.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Supplementary Figure S1 eQTL prior model modified from previous approaches to Bayesian gene regulatory network modeling. Detailed description is provided.
GRNmap and GRNsight June 24, Systems Biology Workflow DNA microarray data: wet lab-generated or published Generate gene regulatory network Modeling.
Clustering What is clustering? Also called “unsupervised learning”Also called “unsupervised learning”
Tutorial session 3 Network analysis Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Anis Karimpour-Fard ‡, Ryan T. Gill †,
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Understanding Network Concepts in Modules Dong J, Horvath S (2007) BMC Systems Biology 2007, 1:24.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Bioinformatics support at School of Biological Sciences
Shortest Path Analysis and 2nd-Order Analysis Ming-Chih Kao U of M Medical School
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Elucidating regulatory mechanisms downstream of a signaling pathway using informative experiments Discussion leader: Navneet Scribe: James Computational.
Computer Science and Engineering PhD in Computer Science Monday, November 07, :00 a.m. – 11:00 a.m. Swearingen Conference Room 3A75 Network Based.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.
Presentation transcript:

MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification of Functional Modules using Network Topology and High-Throughput Data. BMC Systems Biology 1:8 (2007).

Microarray data analysis Input: expression levels of (all) genes in several conditions Analysis methods: Clustering (CLICK) Biclustering (SAMBA) Extraction of regulatory networks

Protein interaction network analysis Input: Network with nodes=proteins/genes edges=interactions Analysis methods: Global properties Motif content analysis Complex extraction Cross-species comparison

Integrated analysis Combined support for low quality data Joint visualization Statistics of known pathways Detection of “ hot spots ”

MATISSE Identify sets of genes (modules) that Have highly correlated expression patterns Induce connected subgraphs in the interaction network Interaction High Similarity

The Probabilistic Model Formulates module finding as a hypothesis testing problem The likelihood ratio is decomposed into pairwise weights Allows incorporation of gene-specific priors Parameters learned by an EM algorithm

Computational aspects Finding a single module without connectivity constraints Reduces to finding the heaviest subgraph (+/- edge weights) An NP-Hard problem Heuristics inspired by maximum density approximation algorithms

MATISSE workflow Seed generation Greedy optimization Significance filtering

Finding seeds Three seeding alternatives Building small seeds around single nodes: Best neighbors All neighbors Approximating the heaviest subgraph

Greedy optimization Simultaneous optimization of all the seeds The following steps are considered: Node addition Node removal Assignment change Module merge

Advantages of MATISSE No need for confidence estimation on individual measurements Works even when only a fraction of the genes have expression patterns Can handle any similarity data, not only expression Produces connected modules No need to specify the number of modules

Osmotic shock response of S. cerevisiae Network of 6,246 genes and 65,990 protein-protein and protein-DNA interactions 133 experimental conditions – response of perturbed strains to osmotic shock (O’Rourke and Herskowitz, 2004) 2,000 genes filtered based on variation criterion

GO and promoter analysis

Pheromone response subnetwork Back Front

Proteolysis subnetwork Back Front

Performance comparison % of modules % of modules with category enrichment at p< 10 -3

Performance comparison (2) % of annotations % annotations w enrichment at p<10 -3 in modules

Human cell cycle Constructed a network with 6,000 nodes, 25,000 edges HPRD BIND Y2H studies SPIKE HeLa cell cycle time series (Whitfield ’02) Produced subnetworks enriched with all the phases of the cell cycle

M phase subnetwork

Extensions of MATISSE CEZANNE Utilizes confidence-based networks Extracts subnetworks that are connected with high confidence and co-expressed Applied to 11 studies of gene expression in the blood Not yet implemented in the MATISSE application

Extensions of MATISSE DEGAS Utilizes case-control expression data Identifies disregulated pathways – areas in the network in which many genes are dysregulated in most of the cases Beta version implemented in the MATISSE software Ulitsky, Karp and Shamir RECOMB 2008

Difficulties with prior approaches In case-control data, gene pattern correlation can be due to diverse non-disease related factors Patients are different Genetic background Other diseases/confounding factors Disease grade Current methods assume that the same genes are dysregulated in all the patients A weaker assumption – a lot of dysregulated genes appear in the same dysregulated pathway

HD down-regulated The pathway down-regulated in Huntington’s disease (HD) Enriched with: HD modifiers HD relevant genes Calcium signalling Huntingtin Clear outlier

Extensions of MATISSE Identification of modules correlated with external parameters Numerical parameters: Age, tumor grade etc. Logical parameters: Gender, tumor type Identifies subnetworks with genes that are both Correlated with the clinical parameter Correlated with one another

MATISSE tool capabilities MATISSE algorithm execution Dynamic subnetwork layout Customized node/edge highlighting Dynamic expression matrix viewer Module annotation TANGO – Gene Ontology Annotations with custom datasets Calculation of different coefficients based on network/expression