Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.

Slides:



Advertisements
Similar presentations
Peer-to-Peer and Social Networks Power law graphs Small world graphs.
Advertisements

Complex Networks Advanced Computer Networks: Part1.
Network analysis Sushmita Roy BMI/CS 576
Scale Free Networks.
Bioinformatics 3 – WS 14/15V 6 – Bioinformatics 3 V6 – Biological PPI Networks - are they really scale-free? - network growth - functional annotation in.
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
School of Information University of Michigan SI 614 Random graphs & power law networks preferential attachment Lecture 7 Instructor: Lada Adamic.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
Scale Free Networks Robin Coope April Abert-László Barabási, Linked (Perseus, Cambridge, 2002). Réka Albert and AL Barabási,Statistical Mechanics.
Biological Networks Feng Luo.
Sedgewick & Wayne (2004); Chazelle (2005) Sedgewick & Wayne (2004); Chazelle (2005)
Global topological properties of biological networks.
Advanced Topics in Data Mining Special focus: Social Networks.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Summary from Previous Lecture Real networks: –AS-level N= 12709, M=27384 (Jan 02 data) route-views.oregon-ix.net, hhtp://abroude.ripe.net/ris/rawdata –
Computer Science 1 Web as a graph Anna Karpovsky.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
九大数理集中講義 Comparison, Analysis, and Control of Biological Networks (3) Domain-Based Mathematical Models for Protein Evolution Tatsuya Akutsu Bioinformatics.
8. Lecture WS 2004/05Bioinformatics III1 Protein protein interaction networks Most ideas on biological networks based on initial HT data for Sacharomyces.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Stefano Boccaletti Complex networks in science and society *Istituto Nazionale di Ottica Applicata - Largo E. Fermi, Florence, ITALY *CNR-Istituto.
V 1 – Introduction Mon, Oct 15, 2012 Bioinformatics 3 — Volkhard Helms.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
2. Lecture WS 2008/09Bioinformatics III1 V2 – network topologies Content of today‘s lecture 1 some definitions on mathematical graphs 2Dijkstra‘s algorithm.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
10 December, 2008 CIMCA2008 (Vienna) 1 Statistical Inferences by Gaussian Markov Random Fields on Complex Networks Kazuyuki Tanaka, Takafumi Usui, Muneki.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
2. Lecture WS 2006/07Bioinformatics III1 V2 – network topologies - Random graphs: classical field in graph theory. Well studied analytically and numerically.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Bioinformatics Center Institute for Chemical Research Kyoto University
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
Bioinformatics 3 – WS 15/16V 6 – V6 – Biological PPI Networks - are they really scale-free? - network growth - functional annotation in the network Mon,
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
The simultaneous evolution of author and paper networks
Network (graph) Models
Structures of Networks
V6 – Biological PPI Networks - are they really scale-free
Lecture 1: Complex Networks
Topics In Social Computing (67810)
Biological networks CS 5263 Bioinformatics.
Learning to Generate Networks
On Growth of Limited Scale-free Overlay Network Topologies
Network biology : protein – protein interactions
Models of networks (synthetic networks or generative models): Random, Small-world, Scale-free, Configuration model and Random geometric model By: Ralucca.
Section 8.6: Clustering Coefficients
Network Analysis of Protein-Protein Interactions
Assessing Hierarchical Modularity in Protein Interaction Networks
Biological Networks Analysis Degree Distribution and Network Motifs
Section 8.6 of Newman’s book: Clustering Coefficients
Social Network Analysis
Clustering Coefficients
Bioinformatics 3 V6 – Biological PPI Networks - are they really scale-free? - network growth - functional annotation in the network Fri, Nov 8, 2013.
Modelling Structure and Function in Complex Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012

"Do they look like" scale-free??? Jeong, Mason, Barabási, Oltvai, Nature 411 (2001) 41 → "PPI networks apparently are scale-free…" "Are" they scale-free or "Do they look like" scale-free??? largest cluster of the yeast proteome (at 2001)

Partial Sampling Estimated for yeast: 6000 proteins, 30000 interactions Y2H covers only 3…9% of the complete interactome! Han et al, Nature Biotech 23 (2005) 839

Nature Biotech 23 (2005) 839 Generate networks of various types, sample sparsely from them → degree distribution? • Random (ER) → P(k) = Poisson • Exponential → P(k) ~ exp[-k] • scale-free → P(k) ~ k–γ • P(k) = truncated normal distribution

Sparsely Sampled ER Network resulting P(k) for different coverages linearity between P(k) and power law → for sparse sampling, even an ER networks "looks" scale-free (when only P(k) is considered) Han et al, Nature Biotech 23 (2005) 839

Anything Goes Han et al, Nature Biotech 23 (2005) 839

Compare to Uetz et al. Data Sampling density affects observed degree distribution → true underlying network cannot be identified from available data Han et al, Nature Biotech 23 (2005) 839

Which Network Type? Biochem. Soc. Trans. 31 (2001) 1491

Protein Association Network Proteins interact (bind) via complementary domains → randomly distribute 2m domains onto n proteins with prob. p → on avg. λ = 2mp domains per protein Typical numbers (yeast): n = 6000, m = 1000, λ = 1…2 Central network sub-structure: complete bi-partite graphs

Human Bipartite Graphs Parts of the human interactome from the Pronet database (www.myriad-pronet.com) Thomas et al., Biochem. Soc. Trans. 31 (2001) 1491

Partial Sampling P(k) of the modeled interactome: n = 6000, m = 1000, λ = 1, 2 all nodes and vertices 450 proteins with avg 5 neighbors power law Ito Uetz simulated γ ≈ 2.2 Sparsely sampled protein-domain-interaction network fits very well → is this the correct mechanism? Thomas et al., Biochem. Soc. Trans. 31 (2001) 1491

Network Growth Mechanisms Given: an observed PPI network → how did it grow (evolve)? PNAS 102 (2005) 3192 Look at network motifs (local connectivity): compare motif distributions from various network prototypes to fly network Idea: each growth mechanism leads to a typical motif distribution, even if global measures are equal

The Fly Network Y2H PPI network for D. melanogaster from Giot et al. [Science 302 (2003) 1727] Confidence score [0, 1] for every observed interaction → use only data with p > 0.65 (0.5) → remove self-interactions and isolated nodes percolation events for p > 0.65 High confidence network with 3359 (4625) nodes and 2795 (4683) edges Use prototype networks of same size for training Middendorf et al, PNAS 102 (2005) 3192

Network Motives All non-isomorphic subgraphs that can be generated with a walk of length 8 Middendorf et al, PNAS 102 (2005) 3192

Growth Mechanisms Generate 1000 networks, each, of the following seven types (Same size as fly network, undefined parameters were scanned) DMC Duplication-mutation, preserving complementarity DMR Duplication with random mutations RDS Random static networks RDG Random growing network LPA Linear preferential attachment network AGV Aging vertices network SMW Small world network

Growth Type 1: DMC "Duplication – mutation with preserved complementarity" Evolutionary idea: gene duplication, followed by a partial loss of function of one of the copies, making the other copy essential Algorithm: Start from two connected nodes, repeat N - 2 times: • duplicate existing node with all interactions • for all neighbors: delete with probability qdel either link from original node or from copy

Growth Type 2: DMR "Duplication with random mutations" Gene duplication, but no correlation between original and copy (original unaffected by copy) Algorithm: Start from five-vertex cycle, repeat N - 5 times: • duplicate existing node with all interactions • for all neighbors: delete with probability qdel link from copy • add new links to non-neighbors with probability qnew/n

Growth Types 3–5: RDS, RDG, and LPA RDS = static random network Start from N nodes, add L links randomly RDG = growing random network Start from small random network, add nodes, then edges between all existing nodes LPA = linear preferential attachment Add new nodes similar to Barabási-Albert algorithm, but with preference according to (ki + α), α = 0…5 (BA for α = 0) For larger α: preference only for larger hubs, no difference for lower ki

Growth Types 6-7: AGV and SMW AGV = aging vertices network Like growing random network, but preference decreases with age of the node → citation network: more recent publications are cited more likely SMW = small world networks (Watts, Strogatz, Nature 363 (1998) 202) Randomly rewire regular ring lattice

Alternating Decision Tree Classifier Trained with the motif counts from 1000 networks of each of the seven types → prototypes are well separated and reliably classified Prediction accuracy for networks similar to fly network with p = 0.5: Part of a trained ADT Middendorf et al, PNAS 102 (2005) 3192

Are They Different? Example DMR vs. RDG: Similar global parameters, but different counts of the network motifs Middendorf et al, PNAS 102 (2005) 3192

How Did the Fly Evolve? → Best overlap with DMC (Duplication-mutation, preserved complementarity) → Scale-free or random networks are very unlikely → what about protein-domain-interaction network of Thomas et al? Middendorf et al, PNAS 102 (2005) 3192

Motif Count Frequencies rank score: fraction of test networks with a higher count than Drosophila (50% = same count as fly on avg.) Middendorf et al, PNAS 102 (2005) 3192

Experimental Errors? Randomly replace edges in fly network and classify again: → Classification unchanged for ≤ 30% incorrect edges

Suggested Reading Molecular BioSystems 5 (2009)1482

Summary What you learned today: Sampling matters! → "Scale-free" P(k) by sparse sampling from many network types Test different hypotheses for • global features → depends on unknown parameters and sampling → no clear statement possible • local features (motifs) → are better preserved → DMC best among tested prototypes Next lecture: • Functional annotation of proteins • Gene regulation networks: how causality spreads