NETWORKS BASICS Danail Bonchev Center for the Study of Biological Complexity Virginia Commonwealth University Singapore, July 9-17, 2007.

Slides:



Advertisements
Similar presentations
Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Biological Networks Analysis Introduction and Dijkstras algorithm.
Advertisements

Network biology Wang Jie Shanghai Institutes of Biological Sciences.
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
Analysis and Modeling of Social Networks Foudalis Ilias.
报告人: 林 苑 指导老师:章忠志 副教授 复旦大学  Introduction about random walks  Concepts  Applications  Our works  Fixed-trap problem  Multi-trap problem.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Overview of part 1 Historical perspective
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Hierarchy in networks Peter Náther, Mária Markošová, Boris Rudolf Vyjde : Physica A, dec
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
University at BuffaloThe State University of New York Young-Rae Cho Department of Computer Science and Engineering State University of New York at Buffalo.
Exp. vs. Scale-Free Poisson distribution Exponential Network Power-law distribution Scale-free Network.
Mining and Searching Massive Graphs (Networks)
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Networks FIAS Summer School 6th August 2008 Complex Networks 1.
Network Statistics Gesine Reinert. Yeast protein interactions.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
I corsi vengono integrati e conterranno grosso modo due moduli:
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Global topological properties of biological networks.
CSE 222 Systems Programming Graph Theory Basics Dr. Jim Holten.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Graph Theory in 50 minutes. This Graph has 6 nodes (also called vertices) and 7 edges (also called links)
Biological Pathways & Networks
The importance of enzymes and their occurrences: from the perspective of a network W.C. Liu 1, W.H. Lin 1, S.T. Yang 1, F. Jordan 2 and A.J. Davis 3, M.J.
Class 2: Graph theory and basic terminology Learning the language Network Science: Graph Theory 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr.
Social Network Analysis: A Non- Technical Introduction José Luis Molina Universitat Autònoma de Barcelona
School of Information University of Michigan SI 614 Network subgraphs (motifs) Biological networks Lecture 11 Instructor: Lada Adamic.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
“Topological Index Calculator” A JavaScript application to introduce quantitative structure-property relationships (QSPR) in undergraduate organic chemistry.
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Hubba: Hub Objects Analyzer—A Framework of Interactome Hubs Identification for Network Biology 吳 信 宏, Hsin-Hung Wu Laboratory.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Algorithms for Biological Networks Prof. Tijana Milenković Computer Science and Engineering University of Notre Dame Fall 2010.
Cellular Automata Modeling of Signaling and Metabolic Pathways Danail Bonchev Lemont B. Kier Chao-Kun Cheng.
COSC 2007 Data Structures II Chapter 14 Graphs I.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Clusters Recognition from Large Small World Graph Igor Kanovsky, Lilach Prego Emek Yezreel College, Israel University of Haifa, Israel.
Bioinformatics Center Institute for Chemical Research Kyoto University
GENERAL NETWORK PATTERNS Danail Bonchev Center for the Study of Biological Complexity Virginia Commonwealth University Singapore, July 9-17, 2007.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Chapter 9: Graphs.
Class 2: Graph Theory IST402.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
1 Graph theory Outline A graph is an abstract data type for storing adjacency relations –We start with definitions: Vertices, edges, degree and sub-graphs.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Structures of Networks
CSCI2950-C Lecture 12 Networks
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Lecture 1: Introduction CS 765: Complex Networks
Biological networks CS 5263 Bioinformatics.
Social Network Analysis
Department of Computer Science University of York
Presentation transcript:

NETWORKS BASICS Danail Bonchev Center for the Study of Biological Complexity Virginia Commonwealth University Singapore, July 9-17, 2007

Recommended Literature 1. Linked: The New Science of Networks. Albert-László Barabási. Perseus Publisher, ISBN: , 304 pp., Price: $ The Structure and Dynamics of Networks. Mark Newman, Albert-László Barabási, and Duncan J. Watts, Princeton University Press, 2006 | $49.50 / ISBN: ; 624 pp. 3. Evolution of Networks. From Biological Nets to the Internet and WWW. Serguei N Dorogovtsev, Jose Fernando Ferreira Mendes, and A F Ioffe Oxford University Press, 2003, ISBN: , $95.00, 344 pp. 4. An introduction to Systems Biology: Design Principles of Biological Circuits, Uri Alon, Chapman & Hall/CRC, Taylor and Francis Group, 2006, ISBN:

 A branch of science that seeks to integrate different levels of information to understand how biological systems function.  It is not the number and properties of system elements but their relations!!  L. Hood: “Systems biology defines and analyses the interrelationships of all of the elements in a functioning system in order to understand how the system works.” Systems Biology. What Is It?

More on Systems Biology Essence of living systems is flow of mass, energy, and information in space and time. The flow occurs along specific networks  Flow of mass and energy (metabolic networks)  Flow of information involving DNA (transcriptional regulation networks)  Flow of information not involving DNA (signaling networks) The Goal of Systems Biology: To understand the flow of mass, energy, and information in living systems.

Networks and the Core Concepts of Systems Biology (i) Complexity emerges at all levels of the hierarchy of life (ii) System properties emerge from interactions of components (iii) The whole is more than the sum of the parts. (iv) Applied mathematics provides approaches to modeling biological systems.

How to Describe a System As a Whole? Networks - The Language of Complex Systems

What is a Network? Network is a mathematical structure composed of points connected by lines Network Theory Graph Theory Network  Graph Nodes  Vertices (points) Links  Edges (Lines) A network can be build for any functional system F. Harary, Graph Theory, Addison Wesley, Reading, MA, 1969 Gross & Yellen, Handbook of Graph Theory, CRC Press, Boca Raton, FL, 2004 System vs. Parts = Networks vs. Nodes

Networks As Graphs  Networks can be undirected or directed, depending on whether the interaction between two neighboring nodes proceeds in both directions or in only one of them, respectively.  The specificity of network nodes and links can be quantitatively characterized by weights Vertex-WeightedEdge-Weighted 

Networks As Graphs - 2 Networks having no cycles are termed trees. The more cycles the network has, the more complex it is. A network can be connected (presented by a single component) or disconnected (presented by several disjoint components). connecteddisconnected trees cyclic graphs

Networks As Graphs - 3 Some Basic Types of Graphs Paths Stars Cycles Complete Graphs Bipartite Graphs

Air Transportation Network

The World Wide Web

Fragment of a Social Network (Melburn, 2004)

Biological Networks A. Intra-Cellular Networks Protein interaction networks Metabolic Networks Signaling Networks Gene Regulatory Networks Composite networks Networks of Modules, Functional Networks Disease networks B. Inter-Cellular Networks Neural Networks C. Organ and Tissue Networks D. Ecological Networks E. Evolution Network

protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions Bio-Map L-A Barabasi miRNA regulation? _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

The Protein Network of Drosophila CuraGen Corporation Science, 2003

Source: ExPASy Metabolic Networks

FAS-L FAS-R FADD CASP10 CASP8 CASP6 CASP3 CASP7 DFF45 DFF40 Death activator DISC Death-Inducing Signaling Complex Heterodimer DFF Initiator Caspases Executor Caspases Start DNA Fragmentation Cleavage of Caspase Substrates Membrane protein Apoptosis Pathway - 1 D. Bonchev, L.B. Kier, C. Cheng, Lecture Series on Computer and Computational Sciences 6, (2006). Apoptosis is a mechanism of controlled cell death critically important in many biological processes

Gene Regulation Networks

The Longevity Gene-Protein Network (LGPN) T. Witten, D. Bonchev, in press C. elegans

Network of Interacting Pathways (NIP) A.Mazurie D.Bonchev G.A. Buck, organisms

Functional Networks Number of shared proteins Cell Cycle Cell Polarity & Structure Intermediate and Energy Metabolism Protein Synthesis and Turnover Protein RNA / Transport RNA Metabolism Signaling Transcription/DNA Maintenance/Chromatin Structure Number of protein complexes Number of proteins Membrane Biogenesis & Turnover Yeast: 1400 proteins, 232 complexes, nine functional groups of complexes (Data A.-M. Gavin et al. (2002) Nature 415, ) D. Bonchev, Chemistry & Biodiversity 1(2004)

Summary  All complex networks in nature and technology have common features.  They differ considerably from random networks of the same size  By studying network structure and dynamics, and by using comparative network analysis, one can get answers of important biological questions.

Fundamental biological questions to answer: (i) Which interactions and groups of interactions are likely to have equivalent functions across species? (ii) Based on these similarities, can we predict new functional information about proteins and interactions that are poorly characterized? (iii) What do these relationships tell us about the evolution of proteins, networks and whole species? (iv) How to reduce the noise in biological data: Which interactions represent true binding events? False-positive interaction is unlikely to be reproduced across the interaction maps of multiple species. Fundamental Biological Questions to Answer (i) Which interactions and groups of interactions are likely to have equivalent functions across species? (ii) Based on these similarities, can we predict new functional information about proteins and interactions that are poorly characterized? (iii) What do these relationships tell us about the evolution of proteins, networks and whole species? (iv) How to reduce the noise in biological data: Which interactions represent true binding events? False-positive interaction is unlikely to be reproduced across the interaction maps of multiple species.

All Complex Dynamic Networks Have Similar Structure and Common Properties  Scale-Freeness  Small-Worldness  Centrality  Robustness/Fragility  Hubs

How To Characterize a Network?

Quantifying Networks A1. Connectivity-based: A. Graph-Theoretical (Topological) Descriptors A2. Distance-based B. Information-Theoretic Descriptors B2. Structural C. Complexity Measures C3. Walk Count B1, Compositional C1. Subgraph Count C2. Overall Connectivity C4. Small-World Connectivity

Connectivity-Based Topological Descriptors Adjacency Matrix Adjacency relation, a ij a ij = 1 ( neighbors ) a ij = 0 ( otherwise ) Adjacency Matrix a i A(G) = a i – node degree V = 5 E = 4 G random node numbering node degrees

Connectivity Descriptors Local (node) descriptors: vertex (node) degrees, a i Global (Network) descriptors: total adjacency, A

Average and Normalized Descriptors Average vertex (node) degree: Network connectedness (density) Connectivity Descriptors = 8/5 = 1.6 A = = 8 Conn = 8/5.4 = 0.4 = 40% Example

Adjacency in Directed Graphs Adjacency relation, a ij a ij = -1 (incoming edge(arc)) a ij = +1 (outgoing edge(arc)) a ij = 0 (otherwise) In-degree = -1 Out-degree = a i (in) a i (out) A(DG) = -1,1 -3,1 0,1-1,1 0,1

Distance-Based Topological Descriptors Distance Matrix Distance relation : d ij = 1 for i,j - neighbors The distance between two non-neighboring nodes is equal to the number of edges along the shortest path that connects them d i D(G) = d i – node distance (node distance degree) d 26 = ? d 57 =?

Distance Descriptors Node descriptors : Network descriptors: Network distance, D(G) Network diameter, Diam(G ) Node eccentricity, e i e i = Max(d ij ) node distance, d i Network radius, Rad(G)

Distance Descriptors-2 Average and Normalized Descriptors Average node distance, Average network distance, (average degree of separation, average path length) = 36/5 = 7.2 D = = 36 = 36/5.4 = 1.8 Example

Distances in Directed Networks  Some distances in directed graphs are equal to infinity !! How to calculate D and ?  In-distances and out-distances 0, 2 -4, 0 0,3 -2, d 21 =? d 13 =? D(in) = D(out) = 6 = 6/(4x3) = 0.5 ???

 Network Node Accessibility Acc (G) = N d (DG)/N d Distances in Directed Networks - 2  Adjusted Average Network Distance Example: G D = 52, = 52/(6x5) = DG D = 34, = 34/20 = 1.70 < D(G)?? AC = 20/30 = 0.667AD = 1.70/0.667 = 2.55 > D(G)

Shannon’s Information Theory References 1. Shannon, C.; Weaver, W. Mathematical Theory of Communications.University of Illinois Press: Urbana, MI, Bonchev, D. Information ‑ Theoretic Indices for Characterization of Chemical Structures. Research Studies Press: Chichester, UK,1983. The more diverse the distribution of system elements, the larger its information content. Information is a measure of system’s diversity How to Measure Information? What Is Information? Wiener: Information is neither matter, nor energy. Forget about meaning! Information is contained in any system, the elements of which can be grouped according to one or more criteria. The more complex the system, the larger its information content. Information is a measure of system complexity

Shannon’s Information Theory Basic Equations Mean Information:, bits/element Total Information:, bits Finite Probability Scheme: System of N elements and k equivalence classes with equivalence criterion α: class number of elements probability 1 N 1 p 1 2 N 2 p 2 ………………………………………………………………... k N k p k where p i = N i / N, and Σ p i = 1. Normalized Information:

Network Information Descriptors Information on the system elements equivalence, e I distances degrees e I tot (deg) = 6log 2 6 – 3log 2 3 – 3x1log1 = bits Vertex degree equivalence distribution: 6{3, 1, 1, 1} e I(deg) = -(3/6) 2 log(3/6) – 3x(1/6)log 2 (1/6) = 1.79 bits/node e I norm (deg) = 10.75/(6log 2 6) = e I tot (dist) = 6log 2 6 – 2x2log 2 2 – 2x1log1 = bits e I(deg) = - 2x(2/6)log 2 (2/6) – 2x(1/6)log 2 (1/6) = 1.92 bits/node e I (deg) = 11.51/(6log 2 6) = Vertex distance equivalence distribution: 6{ 2, 2, 1, 1} Composition distribution: 6{2,2,1,1}

Information on the system elements weight (or magnitude), m I weighted information descriptors (indices) distances D = 52 degrees A = 14 Network Information Descriptors - 2 m I(dist) = 52log 2 52–12log 2 12–2x9log 2 9–2x8log 2 8-6log 2 6 = bits m I(deg) = -(12/52)log 2 (12/52) – 2x(9/52)log 2 (9/52) – 2x(8/52)log 2 (8/52) – (6/52)log 2 (6/52) = 2.55 bits/node m I norm (deg) = /(52log 2 52) = m I tot (deg) = 14log 2 14 – 4log 2 4 – 3log x2log log 2 1 = bits m I(deg) = -(4/14)log 2 (4/14) –(3/14)log 2 (3/14) – 3x(2/14)log 2 (2/14) – (1/14)log 2 (1/14) = 2.47 bits/node m I norm (deg) = 34.55/(14log 2 14) = Distance magnitude distribution: 52 {12, 2x9, 2x8, 6} Vertex degree distribution: 14 {4, 3, 3x2, 1}

Network Complexity Descriptors - 1 Subgraph Count, e SC SC = 17 (5, 4, 4, 3, 1) OC = 76 ( 8, 16, 23, 21, 8) V = 5, E = 4 e=0 e= e= e= e= SC = 5 0 OC = 8 1 SC = 4 1 OC = 16 2 SC = 4 2 OC = 23 3 SC = 3 3 OC = 21 4 SC=1 4 OC = 8 Example e = number of edges Overall Connectivity, e OC

Network Complexity Descriptors - 2 Walk Count, WC Example WC = 106 ( 8, 16, 28, 54) 1 3 l = 1 3 l=2 1 The three complexity measures, SC, OC, and WC, can discriminate very subtle complexity features. 1 2 SC 28(5,8,9,5,1) 30(5,9,10,5,1) OC(in) 111(12,28,41,25,5) 135(16,40,49,25,5) WC 15(5,5,5) 21(5,7,9) l=2 For networks use only complexity measures with e = 1, 2, and 3!!

Small-World Connectivity Network Complexity Descriptors - 3 Network complexity increases with connectivity Network complexity increases with the decrease in its radius Can one unite the two patterns into a single complexity measure? D. Bonchev and G. A. Buck, Quantitative Measures of Network Complexity. In: Complexity in Chemistry, Biology and Ecology, D. Bonchev and D. H. Rouvray, Eds., Springer, New York, 2005, p b i is a measure for node centrality

SC = OC = WC = B1 = B2 = SC = OC = WC = B1 = B2 = SC = OC = WC = A/D = B2 = Examples of Increasing Complexity: N = 5

Thank You for Your Attention!!!