The architecture of complexity: From the topology of the www to the cell's genetic network Albert-László Barabási University of Notre Dame Zoltán N. Oltvai.

Slides:



Advertisements
Similar presentations
Course Evaluation Form About The Course -Go more slowly (||) -More lectures (||) -Problem Sets, Class Projects (|||) -Software tools About The Instructor.
Advertisements

Albert-László Barabási
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
Marc Barthélemy CEA, France Architecture of Complex Weighted Networks.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Synopsis of “Emergence of Scaling in Random Networks”* *Albert-Laszlo Barabasi and Reka Albert, Science, Vol 286, 15 October 1999 Presentation for ENGS.
Advanced Topics in Data Mining Special focus: Social Networks.
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Hierarchy in networks Peter Náther, Mária Markošová, Boris Rudolf Vyjde : Physica A, dec
UC Davis, May 18 th 2006 Introduction to Biological Networks Eivind Almaas Microbial Systems Division.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Erzsébet Ravasz, Zoltán Dezsö
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Exp. vs. Scale-Free Poisson distribution Exponential Network Power-law distribution Scale-free Network.
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
Biological Networks Feng Luo.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
From Complex Networks to Human Travel Patterns Albert-László Barabási Center for Complex Networks Research Northeastern University Department of Medicine.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Sedgewick & Wayne (2004); Chazelle (2005) Sedgewick & Wayne (2004); Chazelle (2005)
Global topological properties of biological networks.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
The structure of the Internet. The Internet as a graph Remember: the Internet is a collection of networks called autonomous systems (ASs) The Internet.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Error and Attack Tolerance of Complex Networks Albert, Jeong, Barabási (presented by Walfredo)
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Complex networks A. Barrat, LPT, Université Paris-Sud, France I. Alvarez-Hamelin (LPT, Orsay, France) M. Barthélemy (CEA, France) L. Dall’Asta (LPT, Orsay,
Dynamics of Complex Networks I: Networks II: Percolation Panos Argyrakis Department of Physics University of Thessaloniki.
Complex networks in nature PHYSBIO 2007 Imre Derényi Dept. of Biological Physics, Eötvös University, Budapest Complex systems are often made of many non-identical.
Class 9: Barabasi-Albert Model Network Science: Evolving Network Models February 2015 Prof. Boleslaw Szymanski Prof. Albert-László Barabási Dr. Baruch.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Stefano Boccaletti Complex networks in science and society *Istituto Nazionale di Ottica Applicata - Largo E. Fermi, Florence, ITALY *CNR-Istituto.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
The Architecture of Complexity: From the WWW to network biology title.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Complex Network Theory – An Introduction Niloy Ganguly.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Complex Network Theory – An Introduction Niloy Ganguly.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Class 12: Barabasi-Albert Model-Part II Prof. Albert-László Barabási Dr. Baruch Barzel, Dr. Mauro Martino Network Science: Evolving Network Models February.
Bioinformatics Center Institute for Chemical Research Kyoto University
Class 19: Degree Correlations PartII Assortativity and hierarchy
Network resilience.
Introduction to complex networks Part I: Structure
Class 7: Evolving Network Models Network Science: Evolving Network Models February 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr. Mauro Martino.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
Hierarchical Organization in Complex Networks by Ravasz and Barabasi İlhan Kaya Boğaziçi University.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Weighted Networks IST402 – Network Science Acknowledgement: Roberta Sinatra Laszlo Barabasi.
Scale-free and Hierarchical Structures in Complex Networks L. Barabasi, Z. Dezso, E. Ravasz, S.H. Yook and Z. Oltvai Presented by Arzucan Özgür.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Lecture 23: Structure of Networks
Structures of Networks
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Frontiers of Network Science Class 15: Degree Correlations II
Lecture 23: Structure of Networks
Social Network Analysis
Lecture 23: Structure of Networks
Presentation transcript:

The architecture of complexity: From the topology of the www to the cell's genetic network Albert-László Barabási University of Notre Dame Zoltán N. Oltvai Northwestern Univ., Medical School H. Jeong, R. Albert, E. Ravasz, G. Bianconi, E. Almaas E. Almaas title

Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times

Erdös-Rényi model (1960) - Democratic - Random Pál Erdös Pál Erdös ( ) Connect with probability p p=1/6 N=10  k  ~ 1.5 Poisson distribution

World Wide Web Over 3 billion documents ROBOT: collects all URL’s found in a document and follows them recursively Nodes: WWW documents Links: URL links R. Albert, H. Jeong, A-L Barabasi, Nature, (1999). WWW Expected P(k) ~ k -  Found Scale-free Network Exponential Network

INTERNET BACKBONE (Faloutsos, Faloutsos and Faloutsos, 1999) Nodes: computers, routers Links: physical lines Internet

Internet-Map

Coauthorship Nodes: scientist (authors) Links: write paper together (Newman, 2000, A.-L. B. et al 2001) SCIENCE COAUTHORSHIP

SCIENCE CITATION INDEX (  = 3) Nodes: papers Links: citations (S. Redner, 1998) P(k) ~k -  H.E. Stanley, PRL papers (1988) Citation

Swedish sex-web Nodes: people (Females; Males) Links: sexual relationships Liljeros et al. Nature Swedes; 18-74; 59% response rate.

Many real world networks have a similar architecture: Scale-free networks WWW, Internet (routers and domains), electronic circuits, computer software, movie actors, coauthorship networks, sexual web, instant messaging, web, citations, phone calls, metabolic, protein interaction, protein domains, brain function web, linguistic networks, comic book characters, international trade, bank system, encryption trust net, energy landscapes, earthquakes, astrophysical network…

Scale-free model Barabási & Albert, Science 286, 509 (1999) P(k) ~k -3 BA model (1) Networks continuously expand by the addition of new nodes WWW : addition of new documents Citation : publication of new papers GROWTH: add a new node with m links PREFERENTIAL ATTACHMENT: the probability that a node connects to a node with k links is proportional to k. (2) New nodes prefer to link to highly connected nodes. WWW : linking to well known sites Citation : citing again highly cited papers

Mean Field Theory γ = 3, with initial condition A.-L.Barabási, R. Albert and H. Jeong, Physica A 272, 173 (1999) MFT

Can Latecomers Make It? Fitness Model SF model: k(t)~t ½ (f irst mover advantage) Real systems: nodes compete for links -- fitness Fitness Model: fitness (   k( ,t)~t  where  =  C G. Bianconi and A.-L. Barabási, Europhyics Letters. 54, 436 (2001).

Bose-Einstein Condensation in Evolving Networks G. Bianconi and A.-L. Barabási, Physical Review Letters 2001; Europhys. Lett NetworkBose gas Fit-gets-richBose-Einstein condensation

protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions Bio-Map

Citrate Cycle METABOLISM Bio-chemical reactions

Boehring-Mennheim

Metabolic Network Nodes : chemicals (substrates) Links : bio-chemical reactions Metab-movie

Metabolic network Organisms from all three domains of life are scale-free networks! H. Jeong, B. Tombor, R. Albert, Z.N. Oltvai, and A.L. Barabasi, Nature, (2000) ArchaeaBacteriaEukaryotes Meta-P(k)

protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions Bio-Map

protein-protein interactions PROTEOME

Topology of the protein network H. Jeong, S.P. Mason, A.-L. Barabasi, Z.N. Oltvai, Nature 411, (2001) Prot P(k) Nodes : proteins Links : physical interactions (binding)

Robustness Complex systems maintain their basic functions even under errors and failures (cell  mutations; Internet  router breakdowns) node failure fcfc 01 Fraction of removed nodes, f 1 S Robustness

Robustness of scale-free networks 1 S 0 1 f fcfc Attacks   3 : f c =1 (R. Cohen et al PRL, 2000) Failures Robust-SF Albert, Jeong, Barabasi, Nature (2000) C

Achilles’ Heel of complex networks Internet failure attack Achilles Heel R. Albert, H. Jeong, A.L. Barabasi, Nature (2000)

 Real networks are fragmented into group or modules  Society: Granovetter, M. S. (1973) ; Girvan, M., & Newman, M.E.J. (2001); Watts, D. J., Dodds, P. S., & Newman, M. E. J. (2002).  WWW: Flake, G. W., Lawrence, S., & Giles. C. L. (2000).  Biology: Hartwell, L.-H., Hopfield, J. J., Leibler, S., & Murray, A. W. (1999).  Internet: Vasquez, Pastor-Satorras, Vespignani(2001). Modularity  Traditional view of modularity: Ravasz, Somera, Mongru, Oltvai, A-L. B, Science 297, 1551 (2002).

Modular vs. Scale-free Topology Scale-free (a) Modular (b)

Hierarchical Networks 3. Clustering coefficient scales C(k)= # links between k neighbors k(k-1)/2

Real Networks HollywoodLanguage Internet (AS) Vaquez et al,'01 WWW Eckmann & Moses, ‘02

Hierarchy in biological systems Metabolic networks Protein networks

Characterizing the links Metabolism: Flux Balance Analysis (Palsson) Metabolic flux for each reaction Edwards, J. S. & Palsson, B. O, PNAS 97, 5528 (2000). Edwards, J. S., Ibarra, R. U. & Palsson, B. O. Nat Biotechnol 19, 125 (2001). Ibarra, R. U., Edwards, J. S. & Palsson, B. O. Nature 420, 186 (2002).

Global flux organization in the E. coli metabolic network E. Almaas, B. Kovács, T. Vicsek, Z. N. Oltvai, A.-L. B. Nature, 2004; Goh et al, PRL SUCC: Succinate uptake GLU : Glutamate uptake Central Metabolism, Emmerling et. al, J Bacteriol 184, 152 (2002)

Scale-free Science collaborationWWW Internet CellCitation pattern Language SUMMARY Hierarchical Networks Where do we go from here?…  How topology affects function?  Dynamics on networks: Are there universal properties?

There may be a postdoctoral position open in my research group. For more details see

Traditional modeling: Network as a static graph Given a network with N nodes and L links Create a graph with statistically identical topology RESULT: model the static network topology PROBLEM: Real networks are dynamical systems!  Evolving networks OBJECTIVE: capture the network dynamics METHOD : identify the processes that contribute to the network topology develop dynamical models that capture these processes BONUS: get the topology correctly. 

Bonus: Why Kevin Bacon? Measure the average distance between Kevin Bacon and all other actors. No. of movies : 46 No. of actors : 1811 Average separation: 2.79 Kevin Bacon Is Kevin Bacon the most connected actor? NO! 876 Kevin Bacon Bacon-list

Rod Steiger Martin Sheen Donald Pleasence #1 #2 #3 #876 Kevin Bacon Bacon-map

Protein network Nodes : proteins Links : physical interaction (binding) Proteomics : identify and determine the properties of the proteins. (related to structure of proteins)

Properties of the protein network Highly connected proteins are more essential (lethal) than less connected proteins.

Metabolic Network Nodes : chemicals (substrates) Links : chem. reaction

Metabolic network Organisms from all three domains of life are scale-free networks! H. Jeong, B. Tombor, R. Albert, Z.N. Oltvai, and A.L. Barabasi, Nature, (2000) ArchaeaBacteriaEukaryotes

Whole cellular network

Properties of metabolic networks Average distances are independent of organisms!  by making more links between nodes.  based on “design principles” of the cell through evolution. cf. Other scale-free network: D~log(N)

Taxonomy using networks A: Archaea B: Bacteria E: Eukaryotes

Watts-Strogatz (Nature 393, 440 (1998)) N nodes forms a regular lattice. With probability p, each edge is rewired randomly. Clustering: My friends will know each other with high probability! Probability to be connected C » p C = # of links between 1,2,…n neighbors n(n-1)/2

Modularity in the metabolism  Metabolic network (43 organisms)  Scale-free model Clustering Coefficient: C(k)= # links between k neighbors k(k-1)/2

Internet-Map

Population density Router density Spatial Distributions

Spatial Distribution of Routers Fractal set Box counting: N( )  No. of boxes of size that contain routers N( ) ~ -D f D f =1.5

Preferential Attachment Compare maps taken at different times (  t = 6 months) Measure  k ( k ), increase in No. of links for a node with k links Preferential Attachment:  k ( k ) ~ k

INTERNET N ( ) ~ -D f D f =1.5  k(k) ~ k   =1 P(d) ~ d -   =1

Nature (2000) … “One way to understand the p53 network is to compare it to the Internet. The cell, like the Internet, appears to be a ‘scale-free network’.” P53

p53 network (mammals) P53 P(k)

Preferential Attachment Citation network Internet k vs.  k : increase in the No. of links in a unit time For given  t,  k   (k) (cond-mat/ )

What is the topology of cellular networks? Argument 2: Cellular networks are exponential! Reason: They have been streamlined by evolution... Argument 1: Cellular networks are scale-free! Reason: They formed one node at a time… Cells-SF or ER?

Combining Modularity and the Scale-free Property Deterministic Scale-Free Networks Barabási, A.-L., Ravasz, E., & Vicsek, T. (2001) Physica A 299, 559. Dorogovtsev, S. N., Goltsev, A. V., & Mendes, J. F. F. (2001) cond-mat/ (DGM)

3. Scaling clustering coefficient (DGM) 2. Clustering coefficient independent of N Properties of hierarchical networks 1. Scale-free

Hierarchical Networks

What does it mean? Real Networks Have a Hierarchical Topology Many highly connected small clusters combine into few larger but less connected clusters combine into even larger and even less connected clusters  The degree of clustering follows:

Is the hierarchical exponent β universal?   For most systems: Connect a p fraction of nodes to the central module using preferential attachment

Stochastic Hierarchical Model

Is hierarchy present in network models? NO: -Scale-free model (alb& Albert,1999) -Erdos-Renyi model (1959) -Watts-Strogatz (1998) YES: Dorogovtsev, Goltsev, Mendes, 2001 (determ.) -Klemm and Eguiluz, Vasquez, Pastor-Satorras,Vespignani (2001)*  Bianconi & alb (fitnesss model) (2001)

Exceptions: Geographically Organized Networks: Common feature: economic pressures towards shorter links Internet (router), Vazquez et al, ‘01 Power Grid

Traditional modeling: Network as a static graph Given a network with N nodes and L links Create a graph with statistically identical topology RESULT: model the static network topology PROBLEM: Real networks are dynamical systems!  Evolving networks OBJECTIVE: capture the network dynamics METHOD : identify the processes that contribute to the network topology develop dynamical models that capture these processes BONUS: get the topology correctly. 

Society Internet

Node-node distance in metabolic networks D 15 =2 [1  2  5] D 17 =4 [1  3  4  6  7] … D = ?? Scale-free networks: D~log(N) Larger organisms are expected to have a larger diameter! Meta-diameter

What is Complexity? Main Entry: 1 com·plex Function: noun Etymology: Late Latin complexus totality, from Latin, embrace, from complecti Date: : a whole made up of complicated or interrelated parts  non-linear systems  chaos  fractals A popular paradigm: Simple systems display complex behavior 3 Body Problem Earth( ) Jupiter ( ) Sun ( ) Complexity

Universality? P(k) ~ (k+  (p,q,m) ) -  (p,q,m)   [1,  ) Predict the network topology from microscopic processes with parameters (p,q,m) Scaling but no universality Extended Model p=0.937 m=1  =  = 3.07 Actor network prob. p : internal links prob. q : link deletion prob. 1-p-q : add node

Yeast protein network Nodes : proteins Links : physical interactions (binding) P. Uetz, et al. Nature 403, (2000). Prot Interaction map

A Few Good Man Robert Wagner Austin Powers: The spy who shagged me Wild Things Let’s make it legal Barry Norton What Price Glory Monsieur Verdoux Bacon 1

ARE COMPLEX NETWORKS REALLY RANDOM?

ACTOR CONNECTIVITIES Nodes: actors Links: cast jointly N = 212,250 actors  k  = P(k) ~k -  Days of Thunder (1990) Far and Away (1992) Eyes Wide Shut (1999)  =2.3 Actors

Society Nodes: individuals Links: social relationship (family/work/friendship/etc.) S. Milgram (1967) John Guare, Six Degrees of Separation 1929, Frigyes Karinthy “we could name any person among earth’s one and a half billion inhabitants and through at most five acquaintances, one of which he knew personally, he could link to the chosen one”

 Finite size scaling: create a network with N nodes with P in (k) and P out (k) = log(N) 19 degrees of separation l 15 =2 [1  2  5] l 17 =4 [1  3  4  6  7] … = ?? nd.edu 19 degrees of separation R. Albert et al Nature (99) based on 800 million webpages [S. Lawrence et al Nature (99)] A. Broder et al WWW9 (00) IBM 19 degrees

What is Complexity? Main Entry: 1 com·plex Function: noun Etymology: Late Latin complexus totality, from Latin, embrace, from complecti Date: : a whole made up of complicated or interrelated parts A popular paradigm: Simple systems display complex behavior Complexity

Origin of the scale-free topology: Gene Duplication Perfect copy Mistake: gene duplication Wagner (2001); Vazquez et al. 2003; Sole et al. 2001; Rzhetsky & Gomez (2001); Qian et al. (2001); Bhan et al. (2002). Proteins with more interactions are more likely to get a new link: Π(k)~k (preferential attachment).

World Wide Web Over 3 billion documents ROBOT: collects all URL’s found in a document and follows them recursively Nodes: WWW documents Links: URL links R. Albert, H. Jeong, A-L Barabasi, Nature, (1999). WWW Expected P(k) ~ k -  Found γ out =2.5 γ in =2.1

What does it mean? Poisson distribution Exponential Network Power-law distribution Scale-free Network Airlines

Yeast protein network - lethality and topological position - Highly connected proteins are more essential (lethal)... Prot- robustness H. Jeong, S.P. Mason, A.-L. Barabasi, Z.N. Oltvai, Nature 411, (2001)

Inhomogeneity in the local flux distribution ~ k Mass flows along linear pathways Glutamate rich substrate Succinate rich substrate Mass flows along linear pathways

Pyramid Life’s Complexity Pyramid Z.N. Oltvai and A.-L. B. Science, 2002.