Biological Networks Feng Luo.

Slides:



Advertisements
Similar presentations
Course Evaluation Form About The Course -Go more slowly (||) -More lectures (||) -Problem Sets, Class Projects (|||) -Software tools About The Instructor.
Advertisements

Network analysis Sushmita Roy BMI/CS 576
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Transcription Networks And The Cell’s Functional Organization Presenter: Roni Sharf.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Exp. vs. Scale-Free Poisson distribution Exponential Network Power-law distribution Scale-free Network.
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
Scale Free Networks Robin Coope April Abert-László Barabási, Linked (Perseus, Cambridge, 2002). Réka Albert and AL Barabási,Statistical Mechanics.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
From Complex Networks to Human Travel Patterns Albert-László Barabási Center for Complex Networks Research Northeastern University Department of Medicine.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Sedgewick & Wayne (2004); Chazelle (2005) Sedgewick & Wayne (2004); Chazelle (2005)
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Global topological properties of biological networks.
Network Motifs: simple Building Blocks of Complex Networks R. Milo et. al. Science 298, 824 (2002) Y. Lahini.
Modular Organization of Protein Interaction Network Feng Luo, Ph.D. Department of Computer Science Clemson University.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
The structure of the Internet. The Internet as a graph Remember: the Internet is a collection of networks called autonomous systems (ASs) The Internet.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Gene Regulatory Networks slides adapted from Shalev Itzkovitz’s talk given at IPAM UCLA on July 2005.
Topological Analysis in PPI Networks & Network Motif Discovery Jin Chen MSU CSE Fall 1.
School of Information University of Michigan SI 614 Network subgraphs (motifs) Biological networks Lecture 11 Instructor: Lada Adamic.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Stefano Boccaletti Complex networks in science and society *Istituto Nazionale di Ottica Applicata - Largo E. Fermi, Florence, ITALY *CNR-Istituto.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Analysis of biological networks Part III Shalev Itzkovitz Shalev Itzkovitz Uri Alon’s group Uri Alon’s group July 2005 July 2005.
Biological Networks & Network Evolution Eric Xing
School of Information University of Michigan Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Network Evolution Statistics of Networks Comparing Networks Networks in Cellular Biology A. Metabolic Pathways B. Regulatory Networks C. Signaling Pathways.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Bioinformatics Center Institute for Chemical Research Kyoto University
Network resilience.
Introduction to complex networks Part I: Structure
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Hierarchical Organization in Complex Networks by Ravasz and Barabasi İlhan Kaya Boğaziçi University.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Scale-free and Hierarchical Structures in Complex Networks L. Barabasi, Z. Dezso, E. Ravasz, S.H. Yook and Z. Oltvai Presented by Arzucan Özgür.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Structures of Networks
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Biological networks CS 5263 Bioinformatics.
Assessing Hierarchical Modularity in Protein Interaction Networks
Biological Networks Analysis Degree Distribution and Network Motifs
Social Network Analysis
Modelling Structure and Function in Complex Networks
Presentation transcript:

Biological Networks Feng Luo

Copyright notice Many of the images in this power point presentation of other people. The Copyright belong to the original authors. Thanks!

Biological Networks Biological Systems Biological Networks Made of many non-identical elements interact each other with diverse ways. Biological Networks Biological networks as framework for the study of biological systems

Why Study Networks? It is increasingly recognized that complex systems cannot be described in a reductionist view. Understanding the behavior of such systems starts with understanding the topology of the corresponding network. Topological information is fundamental in constructing realistic models for the function of the network. We saw the complexity and volume of data sets we are dealing with. It is impossible to analysis or manage their properties and underlying principles due to their complexity.

Shortest Path/Geodesic distance Graph Terminology Node Edge Directed/Undirected Degree Shortest Path/Geodesic distance Neighborhood Subgraph Complete Graph Clique Degree Distribution Hubs

Type of Biological Networks Protein interaction networks Gene regulatory networks Metabolism networks Gene co-expression networks Signal transduction networks Genetic interaction networks

P. Uetz, et al. Nature, 2000; Ito et al., PNAS, 2001; … Protein Interactions Y2H method to detect interaction Bait and Prey protein P. Uetz, et al. Nature, 2000; Ito et al., PNAS, 2001; …

Protein Interaction Network Nodes: proteins Links: physical interactions (Jeong et al., 2001)

Metabolic network (KEGG) Graph Node: Object e.g. Chemical compound Edge: Relation between objects e.g. Chemical reaction

Nodes: chemicals (substrates) Links: chem. reaction Metabolic Network

Metabolic Network Nodes: chemicals (substrates) Links: chemistry reactions (Ravasz et al., 2002)

Gene Regulation Proteins are encoded by the DNA of the organism. Proteins regulate expression of other proteins by interacting with the DNA protein protein protein Inducer (external signal) DNA promoter region ACCGTTGCAT Coding region

Activators increase gene production X Y Activator X No transcription gene Y X binding site Y Y Y Y Sx X X* INCREASED TRANSCRIPTION X* Bound activator

Repressors decrease gene production Bound repressor X Y Sx X X* No transcription X* Bound repressor Y Y Unbound repressor Y Y X

Gene Regulatory Networks Nodes are proteins (or the genes that encode them) X Y

The gene regulatory network of E. coli Shen-Orr et. al. Nature Genetics 2002 shallow network, few long cascades. modular compact in-degree, scale free outdegree (promoter size limitation)

Gene regulatory networks

CoExpression Network Revealed from Yeast Cell Cycle Data 1. Protein fate 2. Amino acid synthesis 3. Galactose metabolism 4. Protein glycosylation and transport Cell wall organization 5. Amino acid metabolism 6. Mating 7. Glucogenesis 8. unknown 9. Cell cycle regulation Y’-cluster Histone 11. Cell differentiation 12. Protein synthesis 10. Stress response 13. Cell wall 14. Energy transport 15. Ribosomal biogenesis Ribosomal proteins Mitochondrion Protein degradation Yeast cell cycle microarray data (Spellman et al., 1998)

Signal transduction networks (BD BioScience) Elements inside same module often involved in same biological process. This separation can originate from spatial localization or from chemical specificity. Insulation allows the cell to carry out many diverse reactions without cross-talk that would harm the cell. Connectivity allows one function to influence another. Functional modules reflect the critical level of biological organization. A modular system can reuse existing, well-tested modules.

Properties of Biological Networks Scale Free Small world Hierarchical Modular Robust Motif

Scale-Free Network Degree of a node P(k) Scale-free network The number of adjacent nodes P(k) Degree distribution Frequency of nodes with degree k Scale-free network P(k) follows power law Different from random networks

Connect with probability p Erdös-Rényi model (1960) Connect with probability p p=1/6 N=10 k ~ 1.5 Pál Erdös (1913-1996) Poisson distribution - Democratic - Random

SCALE-FREE NETWORKS (1) The number of nodes (N) is NOT fixed. Networks continuously expand by the addition of new nodes Examples: WWW : addition of new documents Citation : publication of new papers (2) The attachment is NOT uniform. A node is linked with higher probability to a node that already has a large number of links. Examples : WWW : new documents link to well known sites (CNN, YAHOO, NewYork Times, etc) Citation : well cited papers are more likely to be cited again

Scale-free model P(k) ~k-3 (1) GROWTH : At every timestep we add a new node with m edges (connected to the nodes already present in the system). (2) PREFERENTIAL ATTACHMENT : The probability Π that a new node will be connected to node i depends on the connectivity ki of that node P(k) ~k-3 A.-L.Barabási & R. Albert, Science, 1999

Metabolic network Archaea Bacteria Eukaryotes Organisms from all three domains of life are scale-free networks! H. Jeong, B. Tombor, R. Albert, Z.N. Oltvai, and A.L. Barabasi, Nature, 2000

Topology of the protein network H. Jeong, S.P. Mason, A.-L. Barabasi & Z.N. Oltvai, Nature, 2001

Nature 408 307 (2000)

p53 network (mammals)

Local clustering Clustering: My friends will likely know each other! Networks are clustered [large C]

Clustering Coefficient The density of the network surrounding node I, characterized as the number of triangles through I. Related to network modularity k: neighbors of I nI: edges between node I’s neighbors The center node has 8 (grey) neighbors There are 4 edges between the neighbors C = 4 /((8*(8-1)) /2)= 4/28 = 1/7

Shortest-Path between nodes

Shortest-Path between nodes

Small-world Network Every node can be reached from every other by a small number of hops or steps High clustering coefficient and low mean-shortest path length Random graphs don’t necessarily have high clustering coefficients Social networks, the Internet, and biological networks all exhibit small-world network characteristics

Modularity in Cellular Networks Hypothesis: Biological function are carried by discrete functional modules. Hartwell, L.-H., Hopfield, J. J., Leibler, S., & Murray, A. W., Nature, 1999. Traditional view of modularity:

Modular vs. Scale-free Topology (b)

How do we know that metabolic networks are modular? clustering coefficient is the same across metabolic networks in different species with the same substrate corresponding randomized scale free network: C(N) ~ N-0.75 (simulation, no analytical result) bacteria archaea (extreme-environment single cell organisms) eukaryotes (plants, animals, fungi, protists) scale free network of the same size

Real Networks Have a Hierarchical Topology What does it mean? Many highly connected small clusters combine into few larger but less connected clusters even larger and even less connected clusters The degree of clustering follows:

2. Clustering coefficient 3. Clustering coefficient scales Properties of hierarchical networks 1. Scale-free 2. Clustering coefficient independent of N 3. Clustering coefficient scales

Hierarchy in biological systems Metabolic networks Protein networks

Can we identify the modules? topological overlap J(i,j): # of nodes both i and j link to; +1 if there is a direct (i,j) link

Modules in the E. coli metabolism E. Ravasz et al., Science, 2002

Fraction of removed nodes, f Robustness Complex systems maintain their basic functions even under errors and failures (cell  mutations; Internet  router breakdowns) fc 1 Fraction of removed nodes, f S node failure

Robustness of scale-free networks   3 : fc=1 (R. Cohen et. al., PRL, 2000) Failures Topological error tolerance 1 fc Attacks R. Albert et.al. Nature, 2000 S f 1

Attack Tolerance Max Cluster Size Path Length Max Cluster size changes accoeding to removing nodes Random network is vulnerable for random attacks, power law net is not. Random attack vs targeted atack

- lethality and topological position - Yeast protein network - lethality and topological position - Highly connected proteins are more essential (lethal)... H. Jeong, S.P. Mason, A.-L. Barabasi &Z.N. Oltvai, Nature, 2001

Network Motifs

Network motifs Comparable to electronic circuit types (i.e., logic gates) The notion of motif, widely used for sequence analysis, is generalizable to the level of networks. Network Motifs are defined as recurring patterns of interconnections found within networks at frequencies much higher than those found in randomized networks.

Random vs designed/evolved features Large networks may contain information about design principles and/or evolution of the complex system Which features are there for a reason? Design principles (e.g. feed-forward loops) Constraints (e.g. the all nodes on the Internet must be connected to each other) Evolution, growth dynamics (e.g. network growth is mainly due to gene duplication) All proteins are probably interconnected thru at least one part of the DNA – protein – metabolite network

Network motifs Uri Alon et al : “Network Motifs: Simple building Blocks of Complex Networks”; Science, 2002. Different networks were found to have different motif abundances. The motifs reflect the underlying processes that generate each type of network.

Motifs in the network motif to be found graph motif matches in the target graph http://mavisto.ipk-gatersleben.de/frequency_concepts.html

Detecting network motifs There are three main tasks in detecting network motifs: (1) Generating an ensemble of proper random networks (2) Counting the subgraphs in the real network and in random networks (3) Search for graphs that appear disproportionately in one list vs. the other

All 3-node connected subgraphs 13 different isomorphic types of 3-node connected subgraph There are: 199 4-node subgraphs, 9,364 5-node subgraphs, etc…… In order to detect network motifs, one needs to count the number of appearances of all types of n-node subgraphs in the network as well as in an ensemble of randomized networks. There are many isomorphic types of subgraphs with a given number of nodes

Motifs detected Two significant motifs appearing numerous times in non-homologous gene systems that perform diverse biological functions

S. Wuchty, Z. Oltvai & A.-L. Barabasi, Nature Genetics, 2003 Motifs II S. Wuchty, Z. Oltvai & A.-L. Barabasi, Nature Genetics, 2003

Probabilistic algorithm for subgraph sampling The problem : Exhaustive subgraph enumeration complexity scales as # of subgraphs Exponential in subgraph size Infeasible for large networks with hubs Solution : An efficient sampling algorithm

Probabilistic algorithm for subgraph sampling Instead of examining absolute subgraph counts we define subgraph concentration : Sampling algorithm :

Different probabilities of sampling different subgraphs

Weight of each sample corrects for its sampling probability P=0.14 W=7 P=0.33 W=3 4 5 1 2 3 6 7

Rapid convergence to real concentration Kashtan et. al. Bioinformatics 2004

Runtime almost independent of network size Kashtan et. al. Bioinformatics 2004