Functionality & Speciation in Boolean Networks

Slides:



Advertisements
Similar presentations
Hidden Metric Spaces and Navigability of Complex Networks
Advertisements

Design of Experiments Questions Network Inference Working Group October 8, 2008.
Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.
Design Principles in Biology: a consequence of evolution and natural selection Rui Alves University of Lleida
An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.
DYNAMICS OF RANDOM BOOLEAN NETWORKS James F. Lynch Clarkson University.
The multi-layered organization of information in living systems
Max-Planck-Institut für molekulare Genetik Workshop „Systems Biology“ Berlin, Robustness and Entropy of Biological Networks Thomas Manke Max.
Highly designable phenotypes and mutational buffers emerge from a systematic mapping between network topology and dynamic output Nochomovitz.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Bayesian Robust Principal Component Analysis Presenter: Raghu Ranganathan ECE / CMR Tennessee Technological University January 21, 2011 Reading Group (Xinghao.
1 Reasoning Under Uncertainty Over Time CS 486/686: Introduction to Artificial Intelligence Fall 2013.
The Cat and The Mouse -- The Case of Mobile Sensors and Targets David K. Y. Yau Lab for Advanced Network Systems Dept of Computer Science Purdue University.
Scale Free Networks Robin Coope April Abert-László Barabási, Linked (Perseus, Cambridge, 2002). Réka Albert and AL Barabási,Statistical Mechanics.
Networks are useful for describing systems of interacting objects, where the nodes represent the objects and the edges represent the interactions between.
Isometry invariant similarity
Genomic analysis of regulatory network dynamics reveals large topological changes Paper Study Speaker: Cai Chunhui Sep 21, 2004.
Common Properties of Real Networks. Erdős-Rényi Random Graphs.
Reduced Support Vector Machine
1 Introduction to Kernels Max Welling October (chapters 1,2,3,4)
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Similar Sequence Similar Function Charles Yan Spring 2006.
ITIS 6010/8010 Wireless Network Security Dr. Weichao Wang.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Robust estimation Problem: we want to determine the displacement (u,v) between pairs of images. We are given 100 points with a correlation score computed.
Navigability of Networks Dmitri Krioukov CAIDA/UCSD M. Boguñá, M. Á. Serrano, F. Papadopoulos, M. Kitsak, A. Vahdat, kc claffy May, 2010.
國立陽明大學生資學程 陳虹瑋. Genetic Algorithm Background Fitness function ……. population selection Cross over mutation Fitness values Random cross over.
Protein Classification A comparison of function inference techniques.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
The Erdös-Rényi models
A biologist and a matrix The matrix will follow. Did you know that the 20 th century scientist who lay the foundation to the estimation of signals in.
“An Extension of Weighted Gene Co-Expression Network Analysis to Include Signed Interactions” Michael Mason Department of Statistics, UCLA.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Popularity versus Similarity in Growing Networks Fragiskos Papadopoulos Cyprus University of Technology M. Kitsak, M. Á. Serrano, M. Boguñá, and Dmitri.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Class Prediction and Discovery Using Gene Expression Data Donna K. Slonim, Pablo Tamayo, Jill P. Mesirov, Todd R. Golub, Eric S. Lander 발표자 : 이인희.
Enumerating, Sampling and Counting: some illustrative cases in biology Enumerating discrete structures Sampling and searching Nested sampling for counting.
1 Departament of Bioengineering, University of California 2 Harvard Medical School Department of Genetics Metabolic Flux Balance Analysis and the in Silico.
A Maximum Principle for Single-Input Boolean Control Networks Michael Margaliot School of Electrical Engineering Tel Aviv University, Israel Joint work.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Modeling Genetic Network: Boolean Network Yongyeol Ahn KAIST.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Percolation and diffusion in network models Shai Carmi, Department of Physics, Bar-Ilan University Networks Percolation Diffusion Background picture: The.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Developmental Constraints, Genetic Correlations & Natural Selection.
De novo discovery of mutated driver pathways in cancer Discussion leader: Matthew Bernstein Scribe: Kun-Chieh Wang Computational Network Biology BMI 826/Computer.
Classification Ensemble Methods 1
Asymptotic behaviour of blinking (stochastically switched) dynamical systems Vladimir Belykh Mathematics Department Volga State Academy Nizhny Novgorod.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
1 Using Graph Theory to Analyze Gene Network Coherence José A. Lagares Jesús S. Aguilar Norberto Díaz-Díaz Francisco A. Gómez-Vela
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Biological robustness
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Tutorial on "GRID Computing“ EMBnet Conference 2008 CNR - ITB GRID distribution supporting chaotic map clustering on large mixed microarray.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Structures of Networks
Dr. Kenneth Stanley September 25, 2006
Building and Analyzing Genome-Wide Gene Disruption Networks
Hidden Markov Models Part 2: Algorithms
Regulation Analysis using Restricted Boltzmann Machines
Modelling Structure and Function in Complex Networks
Dr. Kenneth Stanley February 6, 2006
Topology and Dynamics of Biological Networks Alfredo BENSO, Stefano DI CARLO, Gianfranco POLITANO, Alessandro SAVINO, Hafeez UR REHMAN Politecnico di Torino,
Using Quotient Graphs to Model Neutrality in Evolutionary Search Dominic Wilson Devinder Kaur University of Toledo.
Inferring Cellular Processes from Coexpressing Genes
Presentation transcript:

Functionality & Speciation in Boolean Networks Jamie Luo Warwick Complexity DTC Dr Matthew Turner Warwick Physics & Systems Biology Model Genetic Regulatory Networks using Boolean Networks. Motivation + Approach + Model Previous Work – A. Wagner and others – results + functionality definition Results – based on reformulated definition of functionality (insilico) Discuss Outlook

Gene Regulatory Networks http://www.cs.uiuc.edu/homes/sinhas/work.html

Gene Regulatory Networks Not our Afghanistan Strategy. Non-linear – lots of feedback noise. http://www.pnas.org/cgi/content-nw/full/104/31/12890/F2 Angiogenic signalling network. A gene regulatory network constructed from inversely regulated proangiogenic genes. All presented genes are down-regulated after endostatin although up-regulated after VEGF/bFGF treatment (except APC gene; arrow demonstrates opposite regulation). The direction of gene regulation and the high degree of cooperative networking between the selected genes point to a switchable angiogenic network. The concerted up-regulation of the network genes indicates the proangiogenic state (On). Highlighted are gene interactions based on promotor-binding site (green connection lines), protein modification (yellow connection lines), protein–protein binding (violet connection lines), gene expression (blue connection lines), and gene regulation (black connection lines). Two signalling pathways, STAT3 (yellow circles) and PPAR/-catenin (red shadows), are highlighted and demonstrate the interconnectedness of the pathways within the angiogenic network. http://www.pnas.org/cgi/content-nw/full/104/31/12890/F2

Why Study Boolean Networks? How does the Topology influence the Dynamics? Construct Predictive Models of Complex Biological Systems. Network Inference. How Dynamical Function Influences Topology? Design and Shaping Intuition. Why use BNs to model GRNs ? - GRNs have many components, non-linear, abstraction of BNs allows for qualitative insights Classical RBNs Kauffman in his 1963 book ‘The Origins of Order’ RBNs – Markovian system over {0 1}^N – Finite space and -> attractor Q: how does topology influence the distribution of attractors etc. Q: Degree or number of edges influence dynamics – all update rules. Derrida results (analytical) k=2 is CRITICAL. People still do this work – simulation based. Fangting Li (Beijing) et al 2004 – yeast cell cycle; Expanded - Hao Ge – stochastic variation; Stefan Bornholdt – built similar models for fission yeast; Konstantin Klemm (Gunnar Boldhaus) Drosophila (model variant) – dynamics of early developmental genes & in predicting mutant phenotypes Boolean Networks as a basis for Network Inference Models Inverse Question Andreas Wagner et al (Klemm) Intuition in Biology is Tricky – Scale, Experimental Limitations (observations & questions one can ask)

Threshold Dynamics N-size (N genes) Threshold Boolean Network is a Markovian dynamical system over the state space S = {0,1}N. Defined by an interaction matrix A ∈ {-1, 0, 1}N . For any v(t) ∈ S , let h(t) = Av(t). Genes are on/off. Matrix – edges - +ve/-ve – up/down regulating Non-linearity Abstract, simple, reflection of GRNs. Stripped out more complex downstream interactions, all produced proteins are TFs or are discarded.

Example GRN p53 – Mdm2 network: Example path through the state space: p53 (also known as protein 53 or tumor protein 53), is a tumor suppressor protein that in humans is encoded by the TP53 gene Explain matrix + graph Actual p53 pathway / GRN is more complex but this is a minimalistic model for it. (0,0) is an attracting state – always. Hao Ge – stochastic version of the above.

Biological Functionality Define a biological function or cell process. Start – end point (v(0), v∞) definition of a function [1]. Find all matrices A ∈ {-1, 0, 1}N which attain this function. Investigate the resulting space of matrices which map v(0) to the fixed point v∞. Defn proposed by Andreas Wagner – Phenotype – Genotype mapping. Finding A’s complete enumeration + sampling or monte carlo [1] Ciliberti S, Martin OC, Wagner A (2007) PLoS Comput Biol 3(2): e15.

Metagraph (Neutral Network) For A , B ∈ {-1, 0, 1}N define a distance: Metagraph where A and B are connected if d(A , B) = 1. Start-end point (v(0), v∞) approach results in a single large connected component dominating the metagraph [1]. d = the number of different entries d(A,B)=1 – point mutations Connectedness – point mutations – neutral – allow any network to find another for most accessible genotypes Core insilico result from which a variety of conclusions are built upon. Used to conclude that robustness is an evolvable quality (through phenotype ‘neutral’ evolutions) What does one mean by robustness? [1] Ciliberti S, Martin OC, Wagner A (2007) PLoS Comput Biol 3(2): e15.

Robustness Mutational Robustness (Md) of a network is its metagraph degree. Noise Robustness (Rn) can be defined as the probability that a change in one gene’s initial expression pattern in v(0) leaves the resulting steady state v∞ unchanged Start-end point approach finds that Mutational Robustness and Noise Robustness are highly correlated. Furthermore Mutational robustness is found to have a broad distribution. Point Mutations. Four defns of noise robustness all highly correlated. Spearman’s s = 0.7, p<10^-15 10^3 sampled matrices. Add Figure to supplements.

Intuition Shaping Robustness is an evolvable property [1]. The metagraph being connected and evolvability of robust networks may be a general organizational principle [1]. Long-term innovation can only emerge in the presence of the robustness caused by a connected metagraph [2]. Above conclusions rely on a largely connected metagraph. Metagraph Islands [3]. Metagraph is connected by gradual mutations. General organizational principle – applies to RNA and protein structures. Innovation – phenotype change. Need to travel long distances to access all phenotypes and this can only be done on a large metagraph. Large Diameter of Neutral Networks is crucial. Gunnar Boldhaus’ work on the Yeast Cell Cycle path. Li – identified a main path through the state space = cell cycle (T=12/13). Metagraph was found for this space and it was not connected – comprised O(10^8) components of size 10^24 – 10^26 GB – hypothesised that increasing number of constraints may be responsible for Metagraph disconnectedness. 22mins [1] Ciliberti S, Martin OC, Wagner A (2007) PLoS Comput Biol 3(2): e15. [2] Ciliberti S, Martin OC, Wagner A (2007) PNAS vol. 104 no. 34 13591-13596 [3] G Boldhaus, K Klemm (2010), Regulatory networks and connected components of the neutral space. Eur. Phys. J. B (2010),

Example GRN Revisited p53 – Mdm2 network: Example path through the state space: Mdm2 p53 How suitable is the start-end defn. Consider the example path. p53 (also known as protein 53 or tumor protein 53), is a tumor suppressor protein that in humans is encoded by the TP53 gene Explain matrix + graph Actual p53 pathway / GRN is more complex but this is a minimalistic model for it. (0,0) is an attracting state – always. Hao Ge – stochastic version of the above.

Redefining a Biological Function Any start-end point function (v(0), v∞) encompasses the ensemble of all paths from v(0) to v∞. Unrepresentative of many cellular processes (cell cycle, p53). We propose using a path {v(t)}t=0,1,...,T to define a function. Crucially distinguish paths by duration T (complexity). Duration analogous to biological complexity of the function.

Which Path to Take? Large number of paths for any given N. How to sample? Method 1 (speed θ): Choose a θ ∈ [0 1]. Randomly sample an initial condition v(0)∈ S. Then vi(t +1) = vi(t) with a probability 1- θ for all t ≥ 0. Method 2 (matrix sampling): Randomly sample an initial condition v(0)∈ S. Then for each t ≥ 0 randomly sample a matrix A to map v(t) to v(t+1) and so on. (2^N)!/(2^N-k)! Method 1 – analogous to setting a speed or binomial distribution for the number of flips for any time step (more true sampling method) Method 2 – natural method defined by the space and is useful for generating longer paths.

Attainability of a Function Increasing duration T exponentially constrains the topology. Method 2 – also true for method 1 Number of Matrices – proxy measurement for how biologically realisable a function is. Not an unexpected result as any loss in the number of rows leads to an exponential loss

Speed Kills? Mean path duration Tend depends non-monotonically on θ. T_end exists because one eventually makes a move which has no solutions. Number of flips is not the only factor in determining whether a solution is attainable. Refractory period positively correlated with T_end. N= 5 - 8 (1000trajs) N=9, 10 (100trajs) O(N) or O(N^2) increase in T_end with N. Return to the Metagraph

T=1 => Connected Metagraph For any path {v(t)}t=0,1,...,T of duration T = 1 the corresponding metagraph is connected. Proof: Fix a path of the form {v(0), v(1)} Let {r : rj ∈{-1, 0, 1}}i be all the row solutions for gene i. Suppose vi(0) = 0 and vi(1) = 1, then hi(0) >0. Therefore 1 = [1 1 , . . . , 1] is always a valid row solution. Furthermore any other solution r can be mapped to 1 by point mutations (changing an entry to rj 1). Other cases are similarly accounted for (-1 = [-1 , . . . , -1]). Also applies to {v(0), v(1), v(1)}

The Metagraph & Speciation Method 2 (matrix) – generates longer trajectories (T inconsistent) 1000 for N=5 +6, 100 for N=7 trajs. Mc number of metagraph components. Mc can vary over orders of magnitudes – log10. But size of components is comparable [Figure for that]. Speciation effect. Reconcile these two results from different functional definitions? – Ensemble versus path effect

Complexity to Speciation Increasing Complexity as measured by duration T leads to a speciation effect. T > 1 T = 1 More complex organisms speciate. Opposite result implied by Wagner’s work. Where does this leave robustness? Is robustness evolvable? 35 mins

Robustness Complexity Trade-off Mutational Robustness decreases with increasing T. Mutational Robustness is inherently constrained by the path complexity/duration. Mean Variance in robustness is not much more than 10.

T vs. ρ(Md,Rn) Mutational Robustness and Noise Robustness are positively correlated but the strength of this correlation is T dependent. Path recovery definition of noise.

Ensemble vs. Path The start-end point definition of a biological function includes the ensemble of all paths from v(0) to the fixed point v∞. Our definition isolates a single path. v∞ v(0) Metagraph: We know the T=1 path is connected – backbone for other connections. Also shorter paths have exponentially more solutions and are also much more probabilistically likely to be attainable than longer ones. Mutational Robustness: Sample over entire ensemble then a broad distribution of robustness is likely. T vs p(Md,Rn): Not so sure. v(T) v(0)

Summary A path definition of functionality leads to contrasting conclusions from the start – end point one. Conclusions based on the existence of a largely connected metagraph are not applicable under a functional path definition. Metagraph connectivity, mutational robustness, ρ(Md,Rn) and the number of solutions all depend on path complexity. The breakup of the metagraph with increasing complexity is analogous to a speciation effect. Robustness is an evolvable property [REF]. The metagraph being connected and evolvability of robust networks may be a general organizational principle [REF]. Long-term innovation can only emerge in the presence of the robustness caused by a connected metagraph [REF]. Conclusions rely on a largely connected metagraph. Metagraph Islands [REF]. Metagraph is connected by gradual mutations. General organizational principle – applies to RNA and protein structures. Innovation – phenotype change. Need to travel long distances to access all phenotypes and this can only be done on a large metagraph. Large Diameter of Neutral Networks is crucial. Gunnar Boldhaus’ work on the Yeast Cell Cycle trajectory.

Future Work & Design Multi-functionality. Paths with Features. Genetic Sensors. Parallels with bi-functionality in Wagner’s work. Path duration is a better variable. Our results are consistent with sum of path lengths. L.c.s, stars etc... Design : Wagner – design of a robust gene network Genetic sensors – sensors built from the same components as the underlying dynamical system

Acknowledgements Matthew Turner Complexity DTC EPSRC Questions? Thank You all for listening