Elhanan Borenstein Spring 2011 Complex (Biological) Networks Some slides are based on slides from courses given by Roded Sharan and Tomer Shlomi Today:

Slides:



Advertisements
Similar presentations
Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Biological Networks Analysis Introduction and Dijkstras algorithm.
Advertisements

Network analysis Sushmita Roy BMI/CS 576
Biological Networks Analysis Degree Distribution and Network Motifs
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
Emergence of Scaling in Random Networks Albert-Laszlo Barabsi & Reka Albert.
Analysis and Modeling of Social Networks Foudalis Ilias.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Advanced Topics in Data Mining Special focus: Social Networks.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Mining and Searching Massive Graphs (Networks)
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Biological Networks Feng Luo.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Global topological properties of biological networks.
Advanced Topics in Data Mining Special focus: Social Networks.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Stefano Boccaletti Complex networks in science and society *Istituto Nazionale di Ottica Applicata - Largo E. Fermi, Florence, ITALY *CNR-Istituto.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Analysis of biological networks Part III Shalev Itzkovitz Shalev Itzkovitz Uri Alon’s group Uri Alon’s group July 2005 July 2005.
Biological Networks & Network Evolution Eric Xing
Algorithms for Biological Networks Prof. Tijana Milenković Computer Science and Engineering University of Notre Dame Fall 2010.
Networks Igor Segota Statistical physics presentation.
Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Network Evolution Statistics of Networks Comparing Networks Networks in Cellular Biology A. Metabolic Pathways B. Regulatory Networks C. Signaling Pathways.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Bioinformatics Center Institute for Chemical Research Kyoto University
341: Introduction to Bioinformatics
March 3, 2009 Network Analysis Valerie Cardenas Nicolson Assistant Adjunct Professor Department of Radiology and Biomedical Imaging.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Constructing and Analyzing a Gene Regulatory Network Siobhan Brady UC Davis.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Class 2: Graph Theory IST402.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Information Retrieval Search Engine Technology (10) Prof. Dragomir R. Radev.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Network (graph) Models
Lecture 23: Structure of Networks
Structures of Networks
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Biological networks CS 5263 Bioinformatics.
Lecture 23: Structure of Networks
Biological Networks Analysis Degree Distribution and Network Motifs
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Modelling Structure and Function in Complex Networks
Lecture 23: Structure of Networks
Presentation transcript:

Elhanan Borenstein Spring 2011 Complex (Biological) Networks Some slides are based on slides from courses given by Roded Sharan and Tomer Shlomi Today: Measuring Network Topology Thursday: Analyzing Metabolic Networks

Measuring Network Topology  Introduction to network theory  Global Measures of Network Topology  Degree Distribution  Clustering Coefficient  Average Distance  Random Network Models  Network Motifs

What is a Network?  A map of interactions or relationships  A collection of nodes and links (edges)

What is a Network?  A map of interactions or relationships  A collection of nodes and links (edges)

Why Networks?  Focus on the organization of the system (rather than on its components)  Simple representation  Visualization of complex systems  Networks as tools  Underlying diffusion model (e.g. evolution on networks)  The structure and topology of the system affect (determine) its function

Networks vs. Graphs  Graph Theory  Definition of a graph: G=(V,E)  V is the set of nodes/vertices (elements)  |V|=N  E is the set of edges (relations)  One of the most well studied objects in CS  Subgraph finding (e.g., clique, spanning tree) and alignment  Graph coloring and graph covering  Route finding (Hamiltonian path, traveling salesman, etc.)  Many problems are proven to be NP-complete

The Seven Bridges of Königsberg  Published by Leonhard Euler, 1736  Considered the first paper in graph theory

Types of Graphs/Networks  Directed/undirected  Weighted/non-weighted  Directed Acyclic Graphs (DAG) / Trees  Bipartite Graphs  Hypergraphs

 Which is the most useful representation? Computational Representation of Networks B C A D ABCD A0010 B0000 C0100 D0110 Connectivity Matrix List/set of edges: (ordered) pairs of nodes { (A,C), (C,B), (D,B), (D,C) } Object Oriented Name:A ngr: p1 Name:B ngr: Name:C ngr: p1 Name:D ngr: p1p2

Network Visualization Cytoscape VisualComplexity.com  Art? Science?

Networks in Biology  Molecular networks:  Protein-Protein Interaction (PPI) networks  Metabolic Networks  Regulatory Network  Synthetic lethality Network  Gene Interaction Network  More …

Metabolic Networks  Reflect the set of biochemical reactions in a cell  Nodes: metabolites  Edges: biochemical reactions  Additional representations!  Derived through:  Knowledge of biochemistry  Metabolic flux measurements S. Cerevisiae 1062 metabolites 1149 reactions

 Reflect the cell’s molecular interactions and signaling pathways (interactome)  Nodes: proteins  Edges: interactions(?)  High-throughput experiments:  Protein Complex-IP (Co-IP)  Yeast two-hybrid Protein-Protein Interaction (PPI) Networks S. Cerevisiae 4389 proteins interactions

Transcriptional Regulatory Network  Reflect the cell’s genetic regulatory circuitry  Nodes: transcription factors (TFs) and genes;  Edges (directed): from TF to the genes it regulates  Derived through:  Chromatin IP  Microarrays

Other Networks in Biology/Medicine

Non-Biological Networks  Computer related networks:  WWW; Internet backbone  Communication and IP  Social networks:  Friendship (facebook; clubs)  Citations / information flow  Co-authorships (papers); Co-occurrence (movies; Jazz)  Transportation:  Highway system; Airline routes  Electronic/Logic circuits  Many more…

Global Measures of Network Topology

Node Degree / Rank  Degree = Number of neighbors  Local characterization!  Node degree in PPI networks correlates with:  Gene essentiality  Conservation rate  Likelihood to cause human disease

Degree Distribution  Degree distribution P(k): probability that a node has degree k  For directed graphs, two distributions:  In-degree  out-degree  Average degree:  Number of edges: Nd/2

Common Distributions  Poisson:  Exponential:  Power-law:

The Power-Law Distribution  Fat or heavy tail!  Leads to a “scale-free” network  Characterized by a small number of highly connected nodes, known as hubs  Hubs are crucial:  Affect error and attack tolerance of complex networks (Albert et al. Nature, 2000)  ‘party’ hubs and ‘date’ hubs

The Internet  Nodes – 150,000 routers  Edges – physical links  P(k) ~ k -2.3 Govindan and Tangmunarunkit, 2000

Movie Actor Collaboration Network  Nodes – 212,250 actors  Edges – co-appearance in a movie  ( = 28.78)  P(k) ~ k -2.3 Barabasi and Albert, Science, 1999 Tropic Thunder (2008)

Protein Interaction Networks Yook et al, Proteomics, 2004  Nodes – Proteins  Edges – Interactions (yeast)  P(k) ~ k -2.5

Metabolic Networks C.Elegans (eukaryote) E. Coli (bacterium) Averaged (43 organisms) A.Fulgidus (archae) Jeong et al., Nature, 2000  Nodes – Metabolites  Edges – Reactions  P(k) ~ k -2.2±2  Metabolic networks across all kingdoms of life are scale-free

Network Clustering Costanzo et al., Nature, 2010

 Characterizes tendency of nodes to cluster  “triangles density”  “How often do my (facebook) friends know each other (if d i = 0 or 1 then C i is defined to be 0) Clustering Coefficient (Watts & Strogatz)

Clustering Coefficient: Example C i =10/10=1 C i =3/10=0.3C i =0/10=0  Lies in [0,1]  For cliques: C=1  For triangle-free graphs: C=0

Average Distance  Distance: Length of shortest (geodesic) path between two nodes  Average distance: average over all connected pairs

Small World Networks  Despite their often large size, in most (real) networks there is a relatively short path between any two nodes  “Six degrees of separation” (Stanley Milgram;1967)  Collaborative distance:  Erdös number  Bacon number Danica McKellar: 6 Natalie Portman: 6 Daniel Kleitman: 3

Network Structure in Real Networks

Additional Measures  Network Modularity  Giant component  Betweenness centrality  Current information flow  Bridging centrality  Spectral density

Random Network Models 1.Random Graphs (Erdös/Rényi) 2.Generalized Random Graphs 3.Geometric Random Graphs 4.The Small World Model (WS) 5.Preferential Attachment

Random Graphs (Erdös/Rényi)  N nodes  Every pair of nodes is connected with probability p  Mean degree: d = (N-1)p ~ Np

Random Graphs: Properties  Mean degree: d = (N-1)p ~ Np  Degree distribution is binomial  Asymptotically Poisson:  Clustering Coefficient:  The probability of connecting two nodes at random is p  Clustering coefficient is C=p  In many large networks p ~ 1/n  C is lower than observed  Average distance:  l~ln(N)/ln(d) …. (think why?)  Small world! (and fast spread of information)

Generalized Random Graphs  A generalized random graph with a specified degree sequence (Bender & Canfield ’78)  Creating such a graph: 1.Prepare k copies of each degree-k node 2.Randomly assign node copies to edges 3.[Reject if the graph is not simple] This algorithm samples uniformly from the collection of all graphs with the specified degree sequence!

Geometric Random Graphs  G=(V,r)  V – set of points in a metric space (e.g. 2D)  E – all pairs of points with distance ≤ r  Captures spatial relationships  Poisson degree distribution

 Generate graphs with high clustering coefficients C and small distance l  Rooted in social systems 1.Start with order (every node is connected to its K neighbors) 2.Randomize (rewire each edge with probability p)  Degree distribution is similar to that of a random graph! The Small World Model (WS) Watts and Strogatz, Nature, 1998 Varying p leads to transition between order (p=0) and randomness (p=1)

 A generative model (dynamics)  Growth: degree-m nodes are constantly added  Preferential attachment: the probability that a new node connects to an existing one is proportional to its degree  “The rich get richer” principle The Scale Free Model: Preferential Attachment Albert and Barabasi, 2002

Preferential Attachment: Clustering Coefficient C ~ N -01 C ~ N -0.75

Preferential Attachment: Empirical Evidence  Highly connected proteins in a PPI network are more likely to evolve new interactions Wagner, A. Proc. R. Soc. Lond. B, 2003

Model Problems  Degree distribution is fixed (although there are generalizations of this method that handle various distributions)  Clustering coefficient approaches 0 with network size, unlike real networks  Issues involving biological network growth:  Ignores local events shaping real networks (e.g., insertions/deletions of edges)  Ignores growth constraints (e.g., max degree) and aging (a node is active in a limited period)

Conclusions  No single best model!  Models differ in various network measures  Different models capture different attributes of real networks  In literature, “random graphs” and “generalized random graphs” are most commonly used

Network Motifs

 Going beyond degree distribution …  Generalization of sequence motifs  Basic building blocks  Evolutionary design principles R. Milo et al. Network motifs: simple building blocks of complex networks. Science, 2002

What are Network Motifs?  Recurring patterns of interactions (subgraphs) that are significantly overrepresented (w.r.t. a background model) R. Milo et al. Network motifs: simple building blocks of complex networks. Science, possible 3-nodes subgraphs

Finding motifs in the Network 1.Generate randomized networks 2a.Scan for all n-node subgraphs in the real network 2b.Record number of appearances of each subgraph (consider isomorphic architectures) 3a.Scan for all n-node sub graphs in rand’ networks 3b.Record number of appearances of each sub graph 4.Compare each subgraph’s data and choose motifs

Finding motifs in the Network

Network Randomization  Preserve in-degree, out-degree and mutual degree  For motifs with n>3 also preserve distribution of smaller sub-motifs (simulated annealing)

Generation of Randomized Networks  Algorithm A (Markov-chain algorithm):  Start with the real network and repeatedly swap randomly chosen pairs of connections (X1  Y1, X2  Y2 is replaced by X1  Y2, X2  Y1)  Repeat until the network is well randomized  Switching is prohibited if the either of the connections X1  Y2 or X2  Y1 already exist X1 X2Y2 Y1X1 X2Y2 Y1

Generation of Randomized Networks  Algorithm B (Generative):  Record marginal weights of original network  Start with an empty connectivity matrix M  Choose a row n & a column m according to marginal weights  If M nm = 0, set M nm = 1; Update marginal weights  Repeat until all marginal weights are 0  If no solution is found, start from scratch B C A D ABCD A00101 B00000 C01002 D ABCD A00001 B00000 C00002 D ABCD A00001 B00000 C00002 D ABCD A00001 B00000 C01001 D

Criteria for Network Motifs  Subgraphs that meet the following criteria: 1.The probability that it appears in a randomized network an equal or greater number of times than in the real network is smaller than P = The number of times it appears in the real network with distinct sets of nodes is at least 4 3.The number of appearances in the real network is significantly larger than in the randomized networks: (N real –N rand > 0.1N rand )

 E. Coli network  424 operons (116 TFs)  577 interactions  Significant enrichment of FFLs  Coherent FFLs:  The direct effect of x on z has the same sign as the net indirect effect through y  85% of FFLs are coherent Feed-Forward Loops in Transcriptional Regulatory Networks S. Shen-Orr et al. Nature Genetics 2002 X Y Z General TF Specific TF Effector operon

What’s So Cool about FFLs A simple cascade has slower shutdown Boolean Kinetics A coherent feed-forward loop can act as a circuit that rejects transient activation signals from the general transcription factor and responds only to persistent signals, while allowing a rapid system shutdown.

Network Motifs in Biological Networks FFL motif is under-represented!

Information Flow vs. Energy Flow FFL motif is under-represented!

Network Motifs in Technological Networks

 An incomplete null model?  Local clustering:  Neighboring neurons have a greater chance of forming a connection than distant neurons  Similar motifs are obtained in random graphs devoid of any selection rule  Gaussian toy network  Preferential-attachment rule Criticism of the Randomization Approach Y. Artzy-Randrup et al. Comment on “Network motifs: simple building blocks of complex networks”. Gaussian “toy network"

Network Comparison: Motif-Based Network Superfamilies R. Milo et al. Superfamilies of evolved and designed networks. Science, 2004

Evolutionary Conservation of Motif Elements Wuchty et al. Nature Genetics, 2003