. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.

Slides:



Advertisements
Similar presentations
Weighing Evidence in the Absence of a Gold Standard Phil Long Genome Institute of Singapore (joint work with K.R.K. “Krish” Murthy, Vinsensius Vega, Nir.
Advertisements

Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University.
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
Putting genetic interactions in context through a global modular decomposition Jamal.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
A Probabilistic Dynamical Model for Quantitative Inference of the Regulatory Mechanism of Transcription Guido Sanguinetti, Magnus Rattray and Neil D. Lawrence.
Rich Probabilistic Models for Gene Expression Eran Segal (Stanford) Ben Taskar (Stanford) Audrey Gasch (Berkeley) Nir Friedman (Hebrew University) Daphne.
Learning Module Networks Eran Segal Stanford University Joint work with: Dana Pe’er (Hebrew U.) Daphne Koller (Stanford) Aviv Regev (Harvard) Nir Friedman.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Cs726 Modeling regulatory networks in cells using Bayesian networks Golan Yona Department of Computer Science Cornell University.
Functional genomics and inferring regulatory pathways with gene expression data.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Functional annotation and network reconstruction through cross-platform integration of microarray data X. J. Zhou et al
Goal: Reconstruct Cellular Networks Biocarta. Conditions Genes.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Microarray analysis 2 Golan Yona. 2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i:
6. Gene Regulatory Networks
Artificial Intelligence Term Project #3 Kyu-Baek Hwang Biointelligence Lab School of Computer Science and Engineering Seoul National University
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Epistasis Analysis Using Microarrays Chris Workman.
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Inferring subnetworks from perturbed expression profiles Dana Pe’er, Aviv Regev, Gal Elidan and Nir Friedman Bioinformatics, Vol.17 Suppl
Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks Jacob Scott, Trey Ideker, Richard M. Karp, Roded Sharan RECOMB 2005.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Inferring Cellular Networks Using Probabilistic Graphical Models Jianlin Cheng, PhD University of Missouri 2009.
Bayes Net Perspectives on Causation and Causal Inference
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Whole Genome Expression Analysis
Cis-regulation Trans-regulation 5 Objective: pathway reconstruction.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
What Is a Gene Network?. Gene Regulatory Systems “Programs built into the DNA of every animal.” Eric H. Davidson.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Using Bayesian Networks to Analyze Expression Data By Friedman Nir, Linial Michal, Nachman Iftach, Pe'er Dana (2000) Presented by Nikolaos Aravanis Lysimachos.
Networks and Interactions Boo Virk v1.0.
Microarrays to Functional Genomics: Generation of Transcriptional Networks from Microarray experiments Joshua Stender December 3, 2002 Department of Biochemistry.
Data Analysis with Bayesian Networks: A Bootstrap Approach Nir Friedman, Moises Goldszmidt, and Abraham Wyner, UAI99.
Reverse engineering gene regulatory networks Dirk Husmeier Adriano Werhli Marco Grzegorczyk.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis Presented by Rhee, Je-Keun Graduate Program in Bioinformatics.
Learning Linear Causal Models Oksana Kohutyuk ComS 673 Spring 2005 Department of Computer Science Iowa State University.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &
Introduction to biological molecular networks
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli.
Shortest Path Analysis and 2nd-Order Analysis Ming-Chih Kao U of M Medical School
Dynamic Networks: How Networks Change with Time? Vahid Mirjalili CSE 891.
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Learning gene regulatory networks in Arabidopsis thaliana
Dynamics and context-specificity in biological networks
Building and Analyzing Genome-Wide Gene Disruption Networks
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Schedule for the Afternoon
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
SEG5010 Presentation Zhou Lanjun.
Principle of Epistasis Analysis
Predicting Gene Expression from Sequence
Presentation transcript:

. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Expression Profiling u An Expression Profile is a simultaneous measurement of the level of all mRNAs in a cell population u Experimental design: Measure profiles of mutated or treated cultures u Goal: infer regulatory and molecular interactions Wild-TypeMutant Profile Compare

Common Approaches u Comparative Analysis (Holstage et al. 1998) u Clustering (Hughes et al. 2000) u Limitations: l Cannot distinguish between direct and indirect interactions l Limited to pair-wise relations l Can not infer a finer context

Bayesian Network Framework Friedman, Linial, Nachman,Pe’er (JCB 2000) u Probabilistic: Characterize statistical relationships between expression patterns of different genes u Multi-variable interactions (beyond pair-wise): l Identify intermediate interactions l Handle combinatorial regulation by several gene-products u Statistical confidence: Asses the statistical significance of interactions found

Our Contributions u Modeling of mutations and treatments into the Bayesian network framework u Novel data discretization based on guided k- means clustering u New features: Mediator and Regulator u Automatic reconstruction of statistically significant sub-networks

Modeling Gene Expression Gene 1 Expression level of each gene = Random variable Gene 3 Gene 4 Gene 5 Gene 2 Gene interaction = Probabilistic dependency Directed Acyclic graph Models dependency structure of distribution P(3 | 1,2) Each node has a probabilistic function Conditioned on its parents in the graph Activator Inhibitor Graph structure + local probability Define a unique multivariate distribution

Mutational Assay Wild-Type Measurements pgk pgk1 P(rap1|pgk1) Equivalence: Two models explain correlation between RAP1 & PGK1 RAP1PGK1 RAP1PGK1 Mutated pgk1 Measurements 0.5 pgk1 0.5 pgk1 P(rap1|pgk1) Note causality into mutated variable

Compendium Dataset ( Hughes et al., 2000) u 300 samples of yeast deletion mutants and other treatments u Deleted genes are from various functional families u A rich variety of profiles, but… u There is only one sample from each mutation

Guided K-means Discretization Guided K-means Discretization Expression data Markov Separator Edge Regulator Bayesian Network Learning Algorithm + Bootstrap Reconstruct SubNetworks Visualize Using Pathway Explorer Visualize Using Pathway Explorer Preprocess Learn model Feature extraction Feature assembly Visualization E R B A C S

Resulting PDAG

Confidence Estimates: Bootstrap D resample D1D1 D2D2 DmDm... Learn E R B A C E R B A C E R B A C Estimate: Bootstrap approach [FGW, UAI99]

Estimating Confidence Common Practice: Pick a single top scoring model Problem: Insufficient information!! In gene expression data: only few hundred experiments => many high scoring models Answer based on one model useless Solution: Search for features common to many likely models! Sample models from posterior distribution P(Model|Data) Confidence of feature : Feature of G, e.g., X  Y

Guided K-means Discretization Guided K-means Discretization Expression data Markov Separator Edge Regulator Bayesian Network Learning Algorithm + Bootstrap Reconstruct SubNetworks Visualize Using Pathway Explorer Visualize Using Pathway Explorer Preprocess Learn model Feature extraction Feature assembly Visualization

Markov Relations Question: Do X and Y directly interact? l Parent-child (one gene regulating the other) u Hidden Parent (two genes co-regulated by a hidden factor) (0.91,0.67) SST2STE6 SST2STE6 Mating pathway regulator Exporter of mating factor ARG5 ARG3 (0.84,0.79) ARG3 ARG5 GCN4 Arginine Biosynthesis Transcription factor

Low Correlation Relations u Previously unknown link strongly supported by evidence in the literature u High confidence, Low correlation l Processes occur under specific conditions l Captured by our context specific score ESC4KU70 (0.91, 0.16) DNA ds break repair Chromatin silencing

Separators  Question: Given that X and Y are indirectly dependant, who mediates this dependence? u Separator relation: l X affects Z who in turn affects Z l Z regulates both X and Y AGA1FUS1 KAR4 Mating transcriptional regulator of nuclear fusion Cell fusion

Separators: Intra-cluster Context CRH1YPS3 SLT2 Cell wall protein MAPK of cell wall integrity pathway Cell wall protein YPS1 Cell wall protein SLR3 Protein of unknown function + + u All gene pairs have high correlation, l clustering groups them together u assigned putative function to SLR3 - cell wall protein u We can assign regulatory role to SLT2 u Many other signaling and regulatory proteins were identified as direct and indirect separators

Guided K-means Discretization Guided K-means Discretization Expression data Markov Separator Edge Regulator Bayesian Network Learning Algorithm + Bootstrap Reconstruct SubNetworks Visualize Using Pathway Explorer Visualize Using Pathway Explorer Preprocess Learn model Feature extraction Feature assembly Visualization

Sub-Networks u Reconstruct a Conserved sub-network l Provides a more global picture l Allows to include features with lower-confidence l Preserved in most networks with high posterior l Probably reflects a real biological process u Automatic algorithm l Score: high concentration of pairwise features l Greedy search for high scoring subgraphs

Increased Confidence (simulated data) Percent of False positives Confidence Entire network Subnetwork

Guided K-means Discretization Guided K-means Discretization Expression data Markov Separator Edge Regulator Bayesian Network Learning Algorithm + Bootstrap Reconstruct SubNetworks Visualize Using Pathway Explorer Visualize Using Pathway Explorer Preprocess Learn model Feature extraction Feature assembly Visualization

Rosetta networks in Pathway Explorer u

Summary u Primary contribution: automated methodology for finding patterns of interactions among genes l Clear semantics l Principled handing of mutations and interventions u Built in handling of statistical significance l Feature confidence l Extracts significant sub-networks u Differs from clustering l Inter-cluster relations l Finer intra-cluster structure u Provides biologist with promising hypothesis