Network Evolution What do models of network evolution do?: 12 34 5 12 34 5 12 34 5 0 t1t1 t2t2 T Overview of today’s lecture: General considerations in.

Slides:



Advertisements
Similar presentations
Hidden Markov Model in Biological Sequence Analysis – Part 2
Advertisements

CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Bayesian Methods with Monte Carlo Markov Chains III
Markov Chains Lecture #5
Scale Free Networks Robin Coope April Abert-László Barabási, Linked (Perseus, Cambridge, 2002). Réka Albert and AL Barabási,Statistical Mechanics.
Proteome Network Evolution by Gene Duplication S. Cenk Şahinalp Simon Fraser University.
Dynamic Network Inference Most statistical work is done on gene regulatory networks, while inference of metabolic pathways and signaling networks are done.
Computational statistics 2009 Random walk. Computational statistics 2009 Random walk with absorbing barrier.
Hidden Markov Models I Biology 162 Computational Genetics Todd Vision 14 Sep 2004.
Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Cs726 Modeling regulatory networks in cells using Bayesian networks Golan Yona Department of Computer Science Cornell University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Global topological properties of biological networks.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Genome evolution: a sequence-centric approach Lecture 3: From Trees to HMMs.
Statistical Alignment: Computational Properties, Homology Testing and Goodness-of-Fit J. Hein, C. Wiuf, B. Knudsen, M.B. Moller and G. Wibling.
Dynamic Network Inference Most statistical work is done on gene regulatory networks, while inference of metabolic pathways and signaling networks are done.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Probabilistic methods for phylogenetic trees (Part 2)
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Phylogeny Estimation: Traditional and Bayesian Approaches Molecular Evolution, 2003
Comparative Biology observable Parameters:time rates, selection Unobservable Evolutionary Path observable Most Recent Common Ancestor ? ATTGCGTATATAT….CAG.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Network Evolution Yeast Protein Interaction Network from Knowledge and Model Organisms KangarooHuman Mouse Rat Comparative.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Boltzmann Machine (BM) (§6.4) Hopfield model + hidden nodes + simulated annealing BM Architecture –a set of visible nodes: nodes can be accessed from outside.
Analysis of the yeast transcriptional regulatory network.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
More statistical stuff CS 394C Feb 6, Today Review of material from Jan 31 Calculating pattern probabilities Why maximum parsimony and UPGMA are.
Randomized Algorithms for Bayesian Hierarchical Clustering
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
The famous “sprinkler” example (J. Pearl, Probabilistic Reasoning in Intelligent Systems, 1988)
Learning Bayesian networks from postgenomic data with an improved structure MCMC sampling scheme Dirk Husmeier Marco Grzegorczyk 1) Biomathematics & Statistics.
Network Evolution Statistics of Networks Comparing Networks Networks in Cellular Biology A. Metabolic Pathways B. Regulatory Networks C. Signaling Pathways.
Structure, evolution and dynamics of transcriptional regulatory networks M. Madan Babu, PhD National Institutes of Health.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Selecting Genomes for Reconstruction of Ancestral Genomes Louxin Zhang Department of Mathematics National University of Singapore.
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli.
Network Evolution ( min.) Networks in Cellular Biology A. Metabolic Pathways B. Regulatory Networks C. Signaling Pathways D. Protein Interaction.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
Building Phylogenies Maximum Likelihood. Methods Distance-based Parsimony Maximum likelihood.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
HW7: Evolutionarily conserved segments ENCODE region 009 (beta-globin locus) Multiple alignment of human, dog, and mouse 2 states: neutral (fast-evolving),
Recombination and Pedigrees Genealogies and Recombination: The ARG Recombination Parsimony The ARG and Data Pedigrees: Models and Data Pedigrees & ARGs.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Structures of Networks
CSCI2950-C Lecture 12 Networks
Distance based phylogenetics
CSCI2950-C Genomes, Networks, and Cancer
Recovering Temporally Rewiring Networks: A Model-based Approach
Biological Networks Analysis Degree Distribution and Network Motifs
Building and Analyzing Genome-Wide Gene Disruption Networks
Hidden Markov Models in Bioinformatics min
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Schedule for the Afternoon
Modelling Structure and Function in Complex Networks
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
Boltzmann Machine (BM) (§6.4)
Presentation transcript:

Network Evolution What do models of network evolution do?: t1t1 t2t2 T Overview of today’s lecture: General considerations in transforming one network into another Facts and Models for the major networks Metabolism Regulatory Signal Transduction Protein Interaction Combining Inference and Evolution Test models Estimate Parameters in the Evolutionary Process Ancestral Analysis Framework for Knowledge Transfer Only topology of networks will be considered. I.e. dynamics and continuous parameters often ignored. Protein Interaction Networks (PINs) are graphs where the nodes are labeled with protein names. Two nodes are connected if the proteins stick to each other. Yeast Protein Interaction Network from (PINs) do not have a temporal dynamic

Likelihood of Homologous Pathways Number of Metabolisms: symmetrical versions P  (, )=P  ( )P  ( -> ) Eleni Giannoulatou Approaches: Continuous Time Markov Chains with computational tricks. MCMC Importance Sampling

t1t1 t2t2 T Evolving Networks: Integration Koskinen,J. (2004) Bayesian Inference for Longitudinal Social Networks. Research Report, number 2004:4, Stockholm University, Department of Statistics. Koskinen,J. and Snijders,T. (2007) Bayesian inference for dynamic social network data, Journal of Statistical Planning and Inference, 137, R. Sharan, T. Ideker, Modeling cellular machinery through biological network comparison, Nature Biotechnology, 24, 427 (2006). Snijders, T. (2001) “Statistical evaluation of social networks dynamics” in Sociological Methodology By Michael Sobel Snijders, T. et al. (2008) “Maximum Likelihood Evaluation for Social Network Dynamics” In press I. Miklos, G.A. Lunter and I. Holmes (2004) A "long indel" model for evolutionary sequence alignment. Mol. Biol. Evol. 21(3): Appendix A Sum over i state assignments gives probability of paths of length i. Integrate of all waiting times (t 1,..,t i ) and state assignments of length i gives probability of specific trajectory Sum over all path lengths gives probability of N turning into N’ The above expression can be shown to be of the form And recursions O(N 2 ) exists to calculate coefficients.

Evolving Networks: MCMC Metropolis-Hasting integrating of all paths - Green (1995) version: Present pathway: Insertion of an edge pair Toggle edge k Deletion of an edge pair Moving of a pair or singles Green, P. J. (1995) Reversible jump Markov chain Monte Carlo computation and Bayesian model determination, Biometrika, 82, Set of paths: Likelihood - L( ) Probability of going from to - q(, ) J - Jacobian Acceptance ratio

P(N 1 -->N 2 ) and Corner Cutting How many networks could be visited on “almost shortest” paths? If d(N1,N2) = k, then there are 2 k networks are visitable on shortest paths. If 2  additional steps are allowed, then 2 k (L +L(L-1)/2 +(L(L-1)..(L-  +1)/  !) are visitable. Can be found in P(  ) at appropriate rows. In general not very useful (number of metabolisms). Forward with symmetries could be used in specific cases. Backward (coupling from the past) Simulations How can P( ) be evaluated?  Olle Haegstroem (2002) Finite Markov Chains and Algorithmic Applications, Cambridge University Press Lyngsø R, Y.S.Song and J.J.Hein (2008) “Accurate Computation of Likelihoods in the Coalescent with Recombination via Parsimony “ In press Recomb Example. 15 nodes, L=105, t=  t=0.05,  =2, d=4. P(4)= e /4!~.003 P(6)= e /6!<10 -4

A Model for the Evolution of Metabolisms A core metabolism: A given set of metabolites: A given set of possible reactions - arrows not shown. A set of present reactions - M black and red arrows Let  be the rate of deletion  the rate of insertion  Then Restriction R: A metabolism must define a connected graph M + R defines 1. a set of deletable (dashed) edges D(M): 2. and a set of addable edges A(M):

A Toy Example (by Aziz Mithani) Metabolic Universe 12 possible edges 1i 1u 3 1i 2u 3 2u 1i 3 2i 2u 3 Equilibrium Probability Full Exponentiation (2 12 states 4096) Exponentiation with corner cutting , 384, 960, 1280,960, 384, 64 MCMC Integration Transition Probability dist=6 Transition Probability: Adding Connectedness Favouring insertions connecting The proportion present:

Regulatory Network Evolution R.Somogyi & CA Sniegoski (1996) Modelling the Complexity of Genetic Networks Complexity Metabolic stability and epigenesis in randomly constructed genetic netsJournal of Theoretical Biology, ハ Volume 22, Issue 3, ハ March 1969, Pages S. A. KauffmanTorsten Reil: Dynamics of Gene Expression in an Artificial Genome - Implications for Biological and Artificial Ontogeny. ECAL 1999: Quayle, A. and Bullock, S. (2006) Modelling the evolution of genetic regulatory networks. Journal of Theoretical Biology, 238 (4). pp. 737 Babu, M.M., Luscombe, N.M., Aravind, L., Gerstein, M. & Teichmann, S.A. (2004) Structure and evolution of transcriptional regulatory networks. Curr. Op. Struc. Biol., 14, Nat Rev Genet Oct ;8 (10): The evolution of genetic networks by non-adaptive processes. Michael Lynch Artificial Genome Riel, 1999: Regulatory control according to rules Proteins can bind the regulatory regions Evolving Artificial Genome Quant & Bullocks, 2007: Evolve Selection will influence final dynamics

Networks: Signal Transduction Pathways Reverse Engineering Biological Networks: Opportunities and Challenges in Computational Methods for Pathway Inference Ann. N.Y. Acad. Sci. 1115: (2007). ARNOLD J. LEVINE, WENWEI, ZHAOHUI FENG AND GERMAN GIL Soyer, OS, Pfeiffer, T, Bonhoeffer, S Simulating the evolution of signal transduction pathways JOURNAL OF THEORETICAL BIOLOGY (2006) 241: R. Albert, B. DasGupta, R. Dondi, S. Kachalo, E.D. Sontag, A. Zelikovsky, and K. Westbrooks. A novel method for signal transduction network inference from indirect experimental evidence. Journal of Computational Biology, 14: , Soyer OS, Bonhoeffer S. Evolution of complexity in signaling pathways. Proc Natl Acad Sci USA. 2006;103:16337 ミ Mutational Process: recruitment/loss + change of interactions Evolution Dynamics One protein is receptor, one effector. Activating receptor creates cascade effect described by simple equation system. Fitness n - number of proteins, c - fitness cost per protein, a - functionality criteria

Models of Protein Interaction Networks Evolution Barabasi & Oltvai, 2004 & Berg et al.,2004; Wiuf etal., 2006 A gene duplicates Inherits it connections The connections can change Berg et al.,2004: Gene duplication slow ~10 -9 /year Connection evolution fast ~10 -6 /year Observed networks can be modeled as if node number was fixed.

Likelihood of PINs Irreducible (and isomorphic) Data de-DAing De-connecting 2386 nodes and 7221 links 735 nodes Can only handle 1 graph. Limited Evolution Model

Inference and Evolution Observe (data) Evolve HumanMouse A’A’ B’ C’ D’ A B C D Infer network

Suggestion: Evolving Dynamical Systems Goal: a time reversible model with sparse mass action system of order three!! Adding/Deleting components (TKF91): Delete rate: k  Add rate: (k+1) Adding reactions with birth of component: There are 3k(k-1) possible reactions involving a new-born Reaction Coefficients: Continuous Time Continuous States Markov Process - specifically Diffusion. For instance Ornstein-Uhlenbeck, which has Gausssian equilibrium distribution

Network Example: Cell Cycle Evolve! What is the edit distance? Which properties are conserved? As N1 starts to evolve, you can only add reactions. Isn’t that strange? If you only knew Budding Yeast, how much would you know about Fission Yeast? On a path from N1 to N2 how close to the minimal has evolution travelled? What is the number of equation systems possible for N1?