Predicting Gene Expression using Logic Modeling and Optimization Abhimanyu Krishna New Challenges in the European Area: Young Scientist’s 1st International.

Slides:

Advertisements

Similar presentations

Systems biology SAMSI Opening Workshop Algebraic Methods in Systems Biology and Statistics September 14, 2008 Reinhard Laubenbacher Virginia Bioinformatics.

Advertisements

DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.

Polynomial dynamical systems over finite fields, with applications to modeling and simulation of biological networks. IMA Workshop on Applications of.

Network biology Wang Jie Shanghai Institutes of Biological Sciences.

Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University.

An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.

Le Song Joint work with Mladen Kolar and Eric Xing KELLER: Estimating Time Evolving Interactions Between Genes.

DYNAMICS OF RANDOM BOOLEAN NETWORKS James F. Lynch Clarkson University.

Deterministic Global Parameter Estimation for a Budding Yeast Model T.D Panning*, L.T. Watson*, N.A. Allen*, C.A. Shaffer*, and J.J Tyson + Departments.

Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.

1 Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science Carnegie Mellon University joint work with Tzu-Kuo Huang, Le.

Systems Biology of Inflammation A systems biology approach is an iterative process that includes: identification of component parts and interactions. integration.

Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.

Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.

Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.

CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.

AQM for Congestion Control1 A Study of Active Queue Management for Congestion Control Victor Firoiu Marty Borden.

1. Elements of the Genetic Algorithm  Genome: A finite dynamical system model as a set of d polynomials over  2 (finite field of 2 elements)  Fitness.

Dynamic Modeling Of Biological Systems. Why Model? When it’s a simple, constrained path we can easily go from experimental measurements to intuitive understanding.

Carla P. Gomes CS4700 CS 4700: Foundations of Artificial Intelligence Prof. Carla P. Gomes Module: Neural Networks: Concepts (Reading:

EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley Asynchronous Distributed Algorithm Proof.

6. Gene Regulatory Networks

Discrete models of biochemical networks Algebraic Biology 2007 RISC Linz, Austria July 3, 2007 Reinhard Laubenbacher Virginia Bioinformatics Institute.

Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.

Bryan Heck Tong Ihn Lee et al Transcriptional Regulatory Networks in Saccharomyces cerevisiae.

REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.

On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.

Bayes Net Perspectives on Causation and Causal Inference

What Is a Gene Network?. Gene Regulatory Systems “Programs built into the DNA of every animal.” Eric H. Davidson.

Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.

Reconstructing Gene Networks Presented by Andrew Darling Based on article  “Research Towards Reconstruction of Gene Networks from Expression Data by Supervised.

A Framework for Distributed Model Predictive Control

ResponseNet revealing signaling and regulatory networks linking genetic and transcriptomic screening data CSE Fall.

Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.

Reconstruction of Transcriptional Regulatory Networks

Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.

Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.

Learning Linear Causal Models Oksana Kohutyuk ComS 673 Spring 2005 Department of Computer Science Iowa State University.

Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.

Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.

Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.

Modeling Genetic Network: Boolean Network Yongyeol Ahn KAIST.

A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.

Synchronization in complex network topologies

EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley.

Steady-state Analysis of Gene Regulatory Networks via G-networks Intelligent Systems & Networks Group Dept. Electrical and Electronic Engineering Haseong.

IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &

Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,

Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.

Constructing and Analyzing a Gene Regulatory Network Siobhan Brady UC Davis.

Sporadic model building for efficiency enhancement of the hierarchical BOA Genetic Programming and Evolvable Machines (2008) 9: Martin Pelikan, Kumara.

G LOBAL S IMILARITY B ETWEEN M ULTIPLE B IONETWORKS Yunkai Liu Computer Science Department University of South Dakota.

Gaussian Process Networks Nir Friedman and Iftach Nachman UAI-2K.

Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.

Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.

Haploid-Diploid Evolutionary Algorithms

BT8118 – Adv. Topics in Systems Biology

Chaos Control (Part III)

T.W. Scholten, C. de Persis, P. Tesi

Haploid-Diploid Evolutionary Algorithms

Ahnert, S. E., & Fink, T. M. A. (2016). Form and function in gene regulatory networks: the structure of network motifs determines fundamental properties.

Optimization Based Design of Robust Synthetic

1 Department of Engineering, 2 Department of Mathematics,

1 Department of Engineering, 2 Department of Mathematics,

Estimating Networks With Jumps

CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks

GENERAL VIEW OF KRATOS MULTIPHYSICS

1 Department of Engineering, 2 Department of Mathematics,

Volume 7, Issue 3, Pages (September 2016)

CISC 667 Intro to Bioinformatics (Spring 2007) Genetic networks and gene expression data CISC667, S07, Lec24, Liao.

Presentation transcript:

Predicting Gene Expression using Logic Modeling and Optimization Abhimanyu Krishna New Challenges in the European Area: Young Scientist’s 1st International Baku Forum

Gene Regulatory Network reconstruction R A TR B TR C p A p A p A BC Input Stimuli C R C B p What is Gene Expression? -> Regulation? -> Gene Regulatory Network? Introduction:

Literature based Gene Regulatory Network Experimental expression data + Missing expression values in grey How to contextualize literature to our experimental conditions Objective

4 Stable state Unstable transient state Biological processes represented as transitions in a landscape “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Introduction: Networks of interactions

5 Why these predictions are not trivial? Noisy network reconstruction process “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

6 Problem: Inconsistency between network and experimental expression data Solution: Contextualize the Network using experimental expression data “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

7 Why is this an optimization problem? “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

8 Why is this an optimization problem? Local consistency “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

9 Why is this an optimization problem? Local consistency Edge removal “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

10 Why is this an optimization problem? Local consistency Global consistency “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

11 Stable state Unstable transient state “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Which property are we going to use in the optimization? Network stability

12 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

14 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

15 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

16 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

17 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

18 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

19 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

20 But the contribution of interactions to the network stability it is not linearly independent. The evaluation of one specific link is highly dependent of the links already removed or, in other words, the order of removal. We are going to capture interdependencies between variables considering sequentially both the probability distribution of positive circuits and separated edges. Positive circuit Negative circuit “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Thomas R, Thieffry D, Kaufman M: DYNAMICAL BEHAVIOR OF BIOLOGICAL REGULATORY NETWORKS.1. BIOLOGICAL ROLE OF FEEDBACK LOOPS AND PRACTICAL USE OF THE CONCEPT OF THE LOOP-CHARACTERISTIC STATE. Bulletin of Mathematical Biology 1995, 57: Positive circuits are necessary condition to have several fixed points

21 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning Positive Circuit 1

22 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning Positive Circuit 2

23 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning Positive Circuit 3

24 Which property are we going to use in the optimization? Network stability “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

25 Biological scope targeted by this approach: transitions between long term expression patterns or stable states Epithelial-mesenchymal transition Epithelial Mesenchymal “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Example:

26 Computing attractors in a discrete dynamical system (Boolean) Based on logic functions and the assumption of only 2 possible gene states: active (ON or 1) and inactive (OFF or 0). Logic functions: The state of the node x i at time t+1 depends on the state of its regulators at time t. Updating scheme: Synchronous Types of attractors: fixed points and limit cycles Fixed point Limit cycle “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

27 Consistency between expression data and network stable states “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

28 Optimization of h(x) (objective function) h(x) = X 1 +X 2 +X 3 +X 4 +X 5 + x 6 X i = 0 or 1 Network topology optimized using an Estimation of Distribution Algorithm (EDA) Toy example: Iterative network pruning “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

29 Top 10 solutions Initial population Next population EDA: toy example

30 EDA: toy example Top 10 solutions Initial population Next population

31 EDA: toy example Top 10 solutions Initial population Next population

32 EDA: toy example Top 10 solutions Initial population Next population

33 EDA: toy example Top 10 solutions Initial population Next population 0.7

34 EDA: toy example Top 10 solutions Initial population Next population

35 EDA: toy example Top 10 solutions Initial population Next population

36 EDA: toy example Top 10 solutions Initial population Next population

37 EDA: toy example Top 10 solutions Initial population Next population

38 EDA: toy example Top 10 solutions Initial population Next population

39 EDA: toy example Top 10 solutions Initial population Next population

40 EDA: toy example Top 10 solutions Initial population Next population STOP CRITERIA

41 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

43 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

44 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

45 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

46 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

47 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

48 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning

49 But the contribution of interactions to the network stability it is not linearly independent. The evaluation of one specific link is highly dependent of the links already removed or, in other words, the order of removal. We are going to capture interdependencies between variables considering sequentially both the probability distribution of positive circuits and separated edges. Positive circuit Negative circuit “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Thomas R, Thieffry D, Kaufman M: DYNAMICAL BEHAVIOR OF BIOLOGICAL REGULATORY NETWORKS.1. BIOLOGICAL ROLE OF FEEDBACK LOOPS AND PRACTICAL USE OF THE CONCEPT OF THE LOOP-CHARACTERISTIC STATE. Bulletin of Mathematical Biology 1995, 57: Positive circuits are necessary condition to have several fixed points

50 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning Positive Circuit 1

51 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning Positive Circuit 2

52 Objective function: This score S uses the normalized Hamming distance (h) to compare N Boolean gene expression values (σ) between all calculated steady states (α) of a pruned network and the two known phenotypes (φ1 and φ2) defined by the expression data, in order to identify the two best-matching phenotype/steady state couples (φα1 and φα2) Iterative network pruning Positive Circuit 3

53 “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states” Algorithm:

54 Predictions based on the consensus between the familiy of alternative solutions “Predicting missing expression values in gene regulatory networks using a discrete logic modeling optimization guided by network stable states”

012/08/30/nar.gks785.full Software Paper Availability:

Isaac Crespo Computational Biology Unit (LCSB) Abhimanyu Krishna Bioinformatic core (LCSB) Antony Le Béchec Antonio del Sol Head of Computational Biology Unit (LCSB) Life sciences research unit (LSRU) Vital-IT (SIB) Thank you! Questions?

57