BAYESIAN INFERENCE OF SIGNALING NETWORK TOPOLOGY IN A CANCER CELL LINE Steven M. Hill, Yiling Lu, Jennifer Molina, Laura M. Heiser, Paul T. Spellman, Terence.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Yinyin Yuan and Chang-Tsun Li Computer Science Department
3.3 Hypothesis Testing in Multiple Linear Regression
Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
1 Parametric Empirical Bayes Methods for Microarrays 3/7/2011 Copyright © 2011 Dan Nettleton.
1 Some Comments on Sebastiani et al Nature Genetics 37(4)2005.
Probabilistic modelling in computational biology Dirk Husmeier Biomathematics & Statistics Scotland.
Pattern Recognition and Machine Learning
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Fast Bayesian Matching Pursuit Presenter: Changchun Zhang ECE / CMR Tennessee Technological University November 12, 2010 Reading Group (Authors: Philip.
Dynamic Bayesian Networks (DBNs)
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Model Assessment, Selection and Averaging
Introduction of Probabilistic Reasoning and Bayesian Networks
EE462 MLCV Lecture Introduction of Graphical Models Markov Random Fields Segmentation Tae-Kyun Kim 1.
Breast cancer is a complex and heterogeneous disease Tumor samples Protein expression Clinical features Mutational status Adapted from TCGA, Nature 2012.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
6. Gene Regulatory Networks
Modeling Gene Interactions in Disease CS 686 Bioinformatics.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
Lorelei Howard and Nick Wright MfD 2008
Gaussian Processes for Transcription Factor Protein Inference Neil D. Lawrence, Guido Sanguinetti and Magnus Rattray.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
The horseshoe estimator for sparse signals CARLOS M. CARVALHO NICHOLAS G. POLSON JAMES G. SCOTT Biometrika (2010) Presented by Eric Wang 10/14/2010.
1 A Presentation of ‘Bayesian Models for Gene Expression With DNA Microarray Data’ by Ibrahim, Chen, and Gray Presentation By Lara DePadilla.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
Reverse engineering gene regulatory networks Dirk Husmeier Adriano Werhli Marco Grzegorczyk.
Learning regulatory networks from postgenomic data and prior knowledge Dirk Husmeier 1) Biomathematics & Statistics Scotland 2) Centre for Systems Biology.
Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland.
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Inferring gene regulatory networks with non-stationary dynamic Bayesian networks Dirk Husmeier Frank Dondelinger Sophie Lebre Biomathematics & Statistics.
INTRODUCTION TO Machine Learning 3rd Edition
CS Statistical Machine learning Lecture 24
Learning Bayesian networks from postgenomic data with an improved structure MCMC sampling scheme Dirk Husmeier Marco Grzegorczyk 1) Biomathematics & Statistics.
Dependency networks Sushmita Roy BMI/CS 576 Nov 25 th, 2014.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
Lecture 2: Statistical learning primer for biologists
Bayes Theorem. Prior Probabilities On way to party, you ask “Has Karl already had too many beers?” Your prior probabilities are 20% yes, 80% no.
GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function Sara Mostafavi, Debajyoti Ray, David Warde-Farley,
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Inferring gene regulatory networks from multiple microarray datasets (Wang 2006) Tiffany Ko ELE571 Spring 2009.
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
Multi-label Prediction via Sparse Infinite CCA Piyush Rai and Hal Daume III NIPS 2009 Presented by Lingbo Li ECE, Duke University July 16th, 2010 Note:
Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
Incorporating graph priors in Bayesian networks
Multi-task learning approaches to modeling context-specific networks
Dynamic Bayesian Networks
Alan Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani
Bud Mishra Professor of Computer Science and Mathematics 12 ¦ 3 ¦ 2001
Dynamic Causal Model for evoked responses in M/EEG Rosalyn Moran.
Volume 6, Issue 5, Pages e13 (May 2018)
Incorporating graph priors in Bayesian networks
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
Volume 4, Issue 1, Pages e10 (January 2017)
Machine Learning: Lecture 6
Label propagation algorithm
Presentation transcript:

BAYESIAN INFERENCE OF SIGNALING NETWORK TOPOLOGY IN A CANCER CELL LINE Steven M. Hill, Yiling Lu, Jennifer Molina, Laura M. Heiser, Paul T. Spellman, Terence P. Speed, Joe W. Gray, Gordon B. Mills and Sach Mukherjee Discussion Leader: Ashwin Scribe: Matthew Computational Network Biology BMI 826/Computer Sciences 838

Problem Overview Data-driven characterization of signaling networks, specific to a context of interest (such as a cell line or tissue type) Requires the ability to probe post-translational modification states in multiple proteins through time and across samples. Proteomic analyses on large scale are challenging Intrinsic and experimental noise Find trade-off between “fit-to-data” and “model parsimony”

Approach and its Motivation Learn network structure through a Dynamic Bayesian Network (DBN) What? Nodes represent proteins at different time points Edges represent probability relationships between the nodes DBNs are capable of modelling feedback loops, while these are not allowed in static BNs Why? Due to computational constraints, statistical inference approaches are limited to exploring only a few hypothesized networks, while this approach can explore a large number of candidate networks.

Fig.1: Data-driven characterization of signaling networks. Reverse-phase protein arrays interrogate signaling dynamics in samples of interest. Network structure is inferred using DBNs, with primary phospho-proteomic data integrated with existing biology, using informative priors objectively weighted by an empirical Bayes approach. Edge probabilities then allow the generation and prioritization of hypotheses for experimental validation

Method: Preview Biological signaling information is incorporated by assigning prior distributions to links Priors assigned based on frequency of links appearing in high-scoring topologies Bayesian Model Averaging carried out Maximal marginal likelihood analysis Obtain a closed form score for network topologies using ‘g- prior’

Assumptions Specific Context: breast cancer cell line MDA-MB-468 First-order Markov and Stationarity assumption Dependence may be sparse, with each node at time t depending only on a subset of nodes at time t-1 Edges are directed forward in time, hence no cycle- checking required

Joint Probability Distribution in a Bayesian Network where X = (X 1,...,X T ) is all data, π G ( i ) ⊆ {1,..., p} is an index set for parents of protein i according to graph G X t π G(i) = { X t j | j ∈ π G ( i ) } is data for the parents of protein i at time t θ i ⊆ are parameters for the conditional distribution of X t i ψ i ⊆ are parameters for X 1 i Joint Probability Distribution can be factorized into a product of conditional distributions :

Regression model The conditionals are taken to be Gaussian and describe the dependence of child nodes on parents and can be thought of as regression models S For ease of notation, X + i = ( X 2 i X 3 i … X T i ) and X - i = ( X 1 i X 2 i … X T-1 i )

Parameters Dependence of a node on products of parents as well as parents themselves. For example, if π G ( i ) = {1, 2, 3} then the mean for variable X t i is a linear combination of the three parents X t-1 j, the three possible pairwise products of parents X t-1 j X t-1 k and the product of all parents X t-1 1 X t-1 2 X t-1 3. For each protein i, let B i denote a [ n x 2 |π G ( i )| - 1 ] design matrix, with columns being the regression coefficients corresponding to all possible combinations of parents, pairwise combinations,…,product of all parents. The 2 |π G ( i )| - 1 regression coefficients, forming a vector β i and variance σ 2 i constitute parameters θ i

Marginal Likelihood Integrating out the prior parameters from the expression for marginal likelihood a closed form expression for the same is obtained: Requires inverse of B i T B i, which may be ill-conditioned or singular, especially when n << 2 |π G ( i )| - 1 Thus the in-degree |π G ( i )| < d max is assumed Ridge regularization done for nodes with large |π G ( i )| by adding a ‘ + ɑ I ’ term to B i T B i

Network Priors Priors are assumed as P(G) exp( λ f(G) ), where λ is a strength parameter for f(G), which is the score of each network according to existing biological knowledge from literature λ chosen by empirically maximizing marginal likelihood P(data | λ)

Posterior Probabilities of Edges Posterior probability of an edge is given by: where P(G|X) is a posterior distribution over graphs Instead of averaging over full graph, score subsets of potential parents and average over them

Results Simulation study. Average ROC curves. True- positive rate (for network edges) plotted against false-positive rate across a range of edge probability thresholds. Simulated data were generated from known graph structures by ancestral sampling. Graph structures were created to be in only partial agreement with the network prior (Supplementary Fig. S3). Results shown are averages obtained from 25 iterations. See text for full details of simulation and for description of methods shown. (For ‘Lasso’, curve produced by thresholding absolute regression coefficients, while marker ‘X’ is single graph obtained by taking non-zero coefficients to be edges)

Results Table 1. Synthetic yeast network study. Inference methods assessed on time- series gene expression data generated from a synthetically constructed gene regulatory network in yeast (Cantone et al., 2009). Results shown are area under the ROC curve (AUC). See text for description of methods shown. [The regimes using a network prior are mean AUCSD over 25 prior network structures. Prior networks were generated to be in partial agreement with the true, underlying network structure (see text for details).]

Results

Fig. 4. Validation of predictions by targeted inhibition in breast cancer cell line MDA-MB-468. (a) MAPK-STAT3 crosstalk. Network inference (Fig. 3a) predicted an unexpected link between phospho-MAPK (MAPKp) and STAT3p(S727) in the breast cancer cell line MDA-MB The hypothesis of MAPK-STAT3 crosstalk was tested by MEK inhibition: this successfully reduced MAPK phosphorylation and resulted in a corresponding decrease in STAT3p(S727). (b) AKTp → p70S6Kp, AKT- MAPK crosstalk and AKT-JNK/JUN crosstalk. AKTp is linked to p70S6kp, MEKp and cJUNp. In line with these model predictions, use of an AKT inhibitor reduced both p70S6K and MEK phosphorylation and increased JNK phosphorylation. (RPPA data; MEK inhibitor GSK and AKT inhibitor GSK690693B at 0 uM, uM, 2.5 uMand 10 uM; measurements taken 0, 5, 15, 30, 60, 90, 120 and 180min after EGF stimulation; average values over 3 replicates shown, error bars indicate SEM)

Discussion Advantages: Efficient and fast for moderate datasets Takes advantage of existing information in the form of priors Exact calculation of Posterior edge probabilities Variable selection approach factorizes the problem and enables computations to be parallelized Limitations: Parameter prior used (g-prior) can suffer from matrix ill-conditioning Assumes homogeneity of parameters and network structure through time, which can be unrealistic Does not take into account ‘latent variables’