I corsi vengono integrati e conterranno grosso modo due moduli:

Slides:



Advertisements
Similar presentations
Complex Networks Advanced Computer Networks: Part1.
Advertisements

Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Biological Networks Analysis Introduction and Dijkstras algorithm.
Network biology Wang Jie Shanghai Institutes of Biological Sciences.
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 – CHAPTER 4 GRAPHS 1.
Introduction to Graphs
Analysis and Modeling of Social Networks Foudalis Ilias.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Hierarchy in networks Peter Náther, Mária Markošová, Boris Rudolf Vyjde : Physica A, dec
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Transcription Networks And The Cell’s Functional Organization Presenter: Roni Sharf.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
C++ Programming: Program Design Including Data Structures, Third Edition Chapter 21: Graphs.
Exp. vs. Scale-Free Poisson distribution Exponential Network Power-law distribution Scale-free Network.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Network Statistics Gesine Reinert. Yeast protein interactions.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Sedgewick & Wayne (2004); Chazelle (2005) Sedgewick & Wayne (2004); Chazelle (2005)
Global topological properties of biological networks.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Computer Science 1 Web as a graph Anna Karpovsky.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Class 2: Graph theory and basic terminology Learning the language Network Science: Graph Theory 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr.
NETWORKS BASICS Danail Bonchev Center for the Study of Biological Complexity Virginia Commonwealth University Singapore, July 9-17, 2007.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
V 1 – Introduction Mon, Oct 15, 2012 Bioinformatics 3 — Volkhard Helms.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Most of contents are provided by the website Graph Essentials TJTSD66: Advanced Topics in Social Media.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Complex Network Theory – An Introduction Niloy Ganguly.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Bioinformatics Center Institute for Chemical Research Kyoto University
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Class 2: Graph Theory IST402.
Chapter 20: Graphs. Objectives In this chapter, you will: – Learn about graphs – Become familiar with the basic terminology of graph theory – Discover.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Course Name: Comparative Genomics Conducted by- Shigehiko kanaya & Md. Altaf-Ul-Amin.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Network (graph) Models
Structures of Networks
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Biological networks CS 5263 Bioinformatics.
Applications of graph theory in complex systems research
Network biology : protein – protein interactions
Assessing Hierarchical Modularity in Protein Interaction Networks
Social Network Analysis
Peer-to-Peer and Social Networks Fall 2017
Presentation transcript:

I corsi vengono integrati e conterranno grosso modo due moduli: SB: systems biology ML: machine learning Alternativa 1 Alternativa 2 Mattina (Fariselli) Pomeriggio (Martelli) Lun 1 Ott SVM (ML) Grafi (SB) Mar 2 Ott System Biology (SB) Probabilita (ML) Mer 3 Ott HMM (ML) Ven 5 Ott Lun 8 Ott Mar 9 Ott Linux Linux e python Mer 10 Ott Esercitazione su SVM Gio 11 Ott Esercitazione su HMM

Systems Biology. What Is It? A branch of science that seeks to integrate different levels of information to understand how biological systems function. L. Hood: “Systems biology defines and analyses the interrelationships of all of the elements in a functioning system in order to understand how the system works.” It is not (only) the number and properties of system elements but their relations!!

More on Systems Biology Essence of living systems is flow of mass, energy, and information in space and time. The flow occurs along specific networks Flow of mass and energy (metabolic networks) Flow of information involving DNA (transcriptional regulation networks) Flow of information not involving DNA (signaling networks) The Goal of Systems Biology: To understand the flow of mass, energy, and information in living systems.

Networks and the Core Concepts of Systems Biology Complexity emerges at all levels of the hierarchy of life System properties emerge from interactions of components (iii) The whole is more than the sum of the parts. (iv) Applied mathematics provides approaches to modeling biological systems.

How to Describe a System As a Whole? Networks - The Language of Complex Systems

Air Transportation Network

The World Wide Web

Fragment of a Social Network (Melburn, 2004) Friendship among 450 people in Canberra

Biological Networks A. Intra-Cellular Networks Protein interaction networks Metabolic Networks Signaling Networks Gene Regulatory Networks Composite networks Networks of Modules, Functional Networks Disease networks B. Inter-Cellular Networks Neural Networks C. Organ and Tissue Networks D. Ecological Networks E. Evolution Network

The Protein Interaction Network of Yeast Yeast two hybrid Uetz et al, Nature 2000

Metabolic Networks Source: ExPASy

Gene Regulation Networks

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ L-A Barabasi GENOME miRNA regulation? _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ PROTEOME - - protein-gene interactions protein-protein interactions Citrate Cycle METABOLISM Bio-chemical reactions

Functional Networks Yeast: 1400 proteins, 232 complexes, nine functional groups of complexes Number of shared proteins 20 125 55 740 43 221 33 419 28 692 12 160 147 7 83 65 41 49 172 75 321 24 260 11 596 35 103 Cell Cycle Cell Polarity & Structure Intermediate and Energy Metabolism Protein Synthesis and Turnover Protein RNA / Transport RNA Signaling Transcription/DNA Maintenance/Chromatin Structure Number of protein complexes Number of proteins 13 111 8 61 25 40 77 19 14 97 30 16 27 299 53 37 15 22 187 73 94 Membrane Biogenesis & Turnover (Data A.-M. Gavin et al. (2002) Nature 415,141-147) D. Bonchev, Chemistry & Biodiversity 1(2004)312-326

What is a Network? Network is a mathematical structure composed of points connected by lines Network Theory <-> Graph Theory Network  Graph Nodes  Vertices (points) Links  Edges (Lines) A network can be build for any functional system System vs. Parts = Networks vs. Nodes

The 7 bridges of Königsberg The question is whether it is possible to walk with a route that crosses each bridge exactly once.

The representation of Euler In 1736 Leonhard Euler formulated the problem in terms of abstracted the case of Königsberg: by eliminating all features except the landmasses and the bridges connecting them; by replacing each landmass with a dot (vertex) and each bridge with a line (edge). The shape of a graph may be distorted in any way without changing the graph itself, so long as the links between nodes are unchanged. It does not matter whether the links are straight or curved, or whether one node is to the left or right of another.

The solution depends on the node degree 3 In a continuous path crossing the edges exactly once, each visited node requires an edge for entering and a different edge for exiting (except for the start and the end nodes). 5 3 3 A path crossing once each edge is called Eulerian path. It possible IF AND ONLY IF there are exactly two or zero nodes of odd degree. Since the graph corresponding to Königsberg has four nodes of odd degree, it cannot have an Eulerian path.

The solution depends on the node degree 3 6 End 2 4 5 1 Start If there are two nodes of odd degree, those must be the starting and ending points of an Eulerian path.

Hamiltonian paths Find a path visiting each node exactly one Conditions of existence for Hamiltonian paths are not simple

Hamiltonian paths

Graph nomenclature Graphs can be simple or multigraphs, depending on whether the interaction between two neighboring nodes is unique or can be multiple, respectively. A node can have or not self loops

Graph nomenclature Networks can be undirected or directed, depending on whether the interaction between two neighboring nodes proceeds in both directions or in only one of them, respectively.  1 2 3 4 5 6 The specificity of network nodes and links can be quantitatively characterized by weights 2.5 7.3 3.3 12.7 5.4 8.1 2.5 Vertex-Weighted Edge-Weighted

Graph nomenclature A network can be connected (presented by a single component) or disconnected (presented by several disjoint components). connected disconnected Networks having no cycles are termed trees. The more cycles thenetwork has, the more complex it is. trees cyclic graphs

Graph nomenclature Paths Stars Cycles Complete Graphs

Large graphs = Networks

Statistical features of networks Vertex degree distribution (the degree of a vertex is the number of vertices connected with it via an edge)

Statistical features of networks Clustering coefficient: the average proportion of neighbours of a vertex that are themselves neighbours Node 4 Neighbours (N) 2 Connections among the Neighbours 6 possible connections among the Neighbours (Nx(N-1)/2) Clustering for the node = 2/6 Clustering coefficient: Average over all the nodes

Statistical features of networks Clustering coefficient: the average proportion of neighbours of a vertex that are themselves neighbours C=0 C=0 C=0 C=1

Statistical features of networks Given a pair of nodes, compute the shortest path between them Average shortest distance between two vertices Diameter: maximal shortest distance How many degrees of separation are they between two random people in the world, when friendship networks are considered?

How to compute the shortest path between home and work? Edge-weighted Graph The exaustive search can be too much time-consuming

The Dijkstra’s algorithm Fixed nodes NON –fixed nodes Initialization: Fix the distance between “Casa” and “Casa” equal to 0 Compute the distance between “Casa” and its neighbours Set the distance between “Casa” and its NON-neighbours equal to ∞

The Dijkstra’s algorithm Fixed nodes NON –fixed nodes Iteration (1): Search the node with the minimum distance among the NON-fixed nodes and Fix its distance, memorizing the incoming direction

The Dijkstra’s algorithm 4 Iteration (2): Update the distance of NON-fixed nodes, starting from the fixed distances Fixed nodes NON –fixed nodes

The Dijkstra’s algorithm Fixed nodes NON –fixed nodes The updated distance is different from the previous one Iteration: Fix the NON-fixed nodes with minimum distance Update the distance of NON-fixed nodes, starting from the fixed distances.

The Dijkstra’s algorithm Fixed nodes NON –fixed nodes Iteration: Fix the NON-fixed nodes with minimum distance Update the distance of NON-fixed nodes, starting from the fixed distances.

The Dijkstra’s algorithm Fixed nodes NON –fixed nodes Iteration: Fix the NON-fixed nodes with minimum distance Update the distance of NON-fixed nodes, starting from the fixed distances.

The Dijkstra’s algorithm Fixed nodes NON –fixed nodes Iteration: Fix the NON-fixed nodes with minimum distance Update the distance of NON-fixed nodes, starting from the fixed distances.

The Dijkstra’s algorithm Fixed nodes NON –fixed nodes Iteration: Fix the NON-fixed nodes with minimum distance Update the distance of NON-fixed nodes, starting from the fixed distances.

The Dijkstra’s algorithm Fixed nodes NON –fixed nodes Conclusion: The label of each node represents the minimal distance from the starting node The minimal path can be reconstructed with a back-tracing procedure

Statistical features of networks Vertex degree distribution Clustering coefficient Average shortest distance between two vertices Diameter: maximal shortest distance

Two reference models for networks Regular network (lattice) Random network (Erdös+Renyi, 1959) Regular connections Each edge is randomly set with probability p

Two reference models for networks Comparing networks with the same number of nodes (N) and edges Poisson distribution Degree distribution Exp decay Average shortest path ≈ N ≈ log (N) Average connectivity high low

Some examples for real networks size vertex degree shortest path Shortest path in fitted random graph Clustering Clustering in random graph Film actors 225,226 61 3.65 2.99 0.79 0.00027 MEDLINE coauthorship 1,520,251 18.1 4.6 4.91 0.43 1.8 x 10-4 E.Coli substrate graph 282 7.35 2.9 3.04 0.32 0.026 C.Elegans neuron network 14 2.65 2.25 0.28 0.05 Real networks are not regular (low shortest path) Real networks are not random (high clustering)

Adding randomness in a regular network Random changes in edges OR Addition of random links

Adding randomness in a regular network (rewiring) Networks with high clustering (like regular ones) and low path length (like random ones) can be obtained: SMALL WORLD NETWORKS (Strogatz and Watts, 1999)

Small World Networks A small amount of random shortcuts can decrease the path length, still maintaining a high clustering: this model “explains” the 6-degrees of separations in human friendship network

What about the degree distribution in real networks? Both random and small world models predict an approximate Poisson distribution: most of the values are near the mean; Exponential decay when k gets higher: P(k) ≈ e-k, for large k.

What about the degree distribution in real networks? In 1999, modelling the WWW (pages: nodes; link: edges), Barabasi and Albert discover a slower than exponential decay: P(k) ≈ k-a with 2 < a < 3, for large k

Scale-free networks Networks that are characterized by a power-law degree distribution are highly non-uniform: most of the nodes have only a few links. A few nodes with a very large number of links, which are often called hubs, hold these nodes together. Networks with a power degree distribution are called scale-free hubs It is the same distribution of wealth following Pareto’s 20-80 law: Few people (20%) possess most of the wealth (80%), most of the people (80%) possess the rest (20%)

Hubs Attacks to hubs can rapidly destroy the network

Three non biological scale-free networks Note the log-log scale LINEAR PLOT Albert and Barabasi, Science 1999

How can a scale-free network emerge? Network growth models: start with one vertex.

How can a scale-free network emerge? Network growth models: new vertex attaches to existing vertices by preferential attachment: vertex tends choose vertex according to vertex degree he In economy this is called Matthew’s effect: The rich get richer This explain the Pareto’s distribution of wealth

How can a scale-free network emerge? Network growth models: hubs emerge (in the WWW: new pages tend to link to existing, well linked pages)

Metabolic pathways are scale-free Hubs are pyruvate, coenzyme A….

Protein interaction networks are scale-free Degree is in some measure related to phenotypic effect upon gene knock-out Red : lethal Green: non lethal Yellow: Unknown

Caveat: different experiments give different results Titz et al, Exp Review Proteomics, 2004

How can a scale-free network emerge? Gene duplication (and differentiation): duplicated genes give origin to a protein that interacts with the same proteins as the original protein (and then specializes its functions)

Caveat on the use of the scale-free theory The same noisy data can be fitted in different ways A sub-net of a non-free-scale network can have a scale-free behaviour Finding a scale-free behaviour do NOT imply the growth with preferential attachment mechanism Keller, BioEssays 2006

Hierarchical networks Standard free scale models have low clustering: a modular hierarchical model accounts for high clustering, low average path and scale-freeness

Modules Sub-graphs more represented than expected 209 bi-fan motifs found in the E.coli regulatory network

Summary Many complex networks in nature and technology have common features. They differ considerably from random networks of the same size By studying network structure and dynamics, and by using comparative network analysis, one can get answers of important biological questions.

Fundamental Biological Questions to Answer (i) Which interactions and groups of interactions are likely to have equivalent functions across species? (ii) Based on these similarities, can we predict new functional information about proteins and interactions that are poorly characterized? (iii) What do these relationships tell us about the evolution of proteins, networks and whole species? (iv) How to reduce the noise in biological data: Which interactions represent true binding events? False-positive interaction is unlikely to be reproduced across the interaction maps of multiple species. Fundamental Biological Questions to Answer (i) Which interactions and groups of interactions are likely to have equivalent functions across species? (ii) Based on these similarities, can we predict new functional information about proteins and interactions that are poorly characterized? (iii) What do these relationships tell us about the evolution of proteins, networks and whole species? (iv) How to reduce the noise in biological data: Which interactions represent true binding events? False-positive interaction is unlikely to be reproduced across the interaction maps of multiple species.

Barabasi and Oltvai (2004) Network Biology: understanding the cell’s functional organization. Nature Reviews Genetics 5:101-113 Stogatz (2001) Exploring complex networks. Nature 410:268-276 Hayes (2000) Graph theory in practice. American Scientist 88:9-13/104-109 Mason and Verwoerd (2006) Graph theory and networks in Biology Keller (2005) Revisiting scale-free networks. BioEssays 27.10: 1060-1068