Biological Networks Analysis Degree Distribution and Network Motifs

Slides:



Advertisements
Similar presentations
Motif Mining from Gene Regulatory Networks
Advertisements

Course Evaluation Form About The Course -Go more slowly (||) -More lectures (||) -Problem Sets, Class Projects (|||) -Software tools About The Instructor.
Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Artificial Neural Networks Some slides adapted from Geoffrey Hinton.
Complex Networks: Complex Networks: Structures and Dynamics Changsong Zhou AGNLD, Institute für Physik Universität Potsdam.
Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Biological Networks Analysis Introduction and Dijkstras algorithm.
Network analysis Sushmita Roy BMI/CS 576
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Network biology Wang Jie Shanghai Institutes of Biological Sciences.
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.
Analysis and Modeling of Social Networks Foudalis Ilias.
Graphs Graphs are the most general data structures we will study in this course. A graph is a more general version of connected nodes than the tree. Both.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Elhanan Borenstein Spring 2011 Complex (Biological) Networks Some slides are based on slides from courses given by Roded Sharan and Tomer Shlomi Today:
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
1 Lesson 9 Networks & Systems Biology. 2 TOC  Into - Systems biology and Networks  Networks and Sys Biol.  What is a network?  Examples (social,)
Sedgewick & Wayne (2004); Chazelle (2005) Sedgewick & Wayne (2004); Chazelle (2005)
Global topological properties of biological networks.
Network Motifs: simple Building Blocks of Complex Networks R. Milo et. al. Science 298, 824 (2002) Y. Lahini.
Network Motifs Zach Saul CS 289 Network Motifs: Simple Building Blocks of Complex Networks R. Milo et al.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Models of Influence in Online Social Networks
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Topological Analysis in PPI Networks & Network Motif Discovery Jin Chen MSU CSE Fall 1.
School of Information University of Michigan SI 614 Network subgraphs (motifs) Biological networks Lecture 11 Instructor: Lada Adamic.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Motif finding with Gibbs sampling CS 466 Saurabh Sinha.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Analysis of biological networks Part III Shalev Itzkovitz Shalev Itzkovitz Uri Alon’s group Uri Alon’s group July 2005 July 2005.
Biological Networks & Network Evolution Eric Xing
Algorithms for Biological Networks Prof. Tijana Milenković Computer Science and Engineering University of Notre Dame Fall 2010.
Networks Igor Segota Statistical physics presentation.
Most of contents are provided by the website Graph Essentials TJTSD66: Advanced Topics in Social Media.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Introduction to biological molecular networks
Bioinformatics Center Institute for Chemical Research Kyoto University
Network resilience.
Constructing and Analyzing a Gene Regulatory Network Siobhan Brady UC Davis.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
Class 2: Graph Theory IST402.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Structures of Networks
CSCI2950-C Lecture 12 Networks
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Biological networks CS 5263 Bioinformatics.
Biological Networks Analysis Degree Distribution and Network Motifs
Department of Computer Science University of York
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Modelling Structure and Function in Complex Networks
Static properties of transcription factors (TFs) within the hierarchical framework. Static properties of transcription factors (TFs) within the hierarchical.
Presentation transcript:

Biological Networks Analysis Degree Distribution and Network Motifs Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein

A quick review Networks: Many types of biological networks Networks vs. graphs A collection of nodes and links Directed/undirected; weighted/non-weighted, … Networks as models vs. networks as tools Many types of biological networks The shortest path problem Dijkstra’s algorithm Initialize: Assign a distance value, D, to each node. Set D=0 for start node and to infinity for all others. For each unvisited neighbor of the current node: Calculate tentative distance, Dt, through current node and if Dt < D: D Dt. Mark node as visited. Continue with the unvisited node with the smallest distance

Comparing networks We want to find a way to “compare” networks. “Similar” (not identical) topology Common design principles We seek measures of network topology that are: Simple Capture global organization Potentially “important” (equivalent to, for example, GC content for genomes) Summary statistics

Node degree / rank Degree = Number of neighbors Node degree in PPI networks correlates with: Gene essentiality Conservation rate Likelihood to cause human disease

Degree distribution P(k): probability that a node has a degree of exactly k Common distributions: Poisson: Exponential: Power-law:

The power-law distribution Power-law distribution has a “heavy” tail! Characterized by a small number of highly connected nodes, known as hubs A.k.a. “scale-free” network Hubs are crucial: Affect error and attack tolerance of complex networks (Albert et al. Nature, 2000)

The Internet Nodes – 150,000 routers Edges – physical links P(k) ~ k-2.3 Govindan and Tangmunarunkit, 2000

Movie actor collaboration network Tropic Thunder (2008) Nodes – 212,250 actors Edges – co-appearance in a movie P(k) ~ k-2.3 (<k> = 28.78) tom cruise Barabasi and Albert, Science, 1999

Protein protein interaction networks Nodes – Proteins Edges – Interactions (yeast) P(k) ~ k-2.5 Yook et al, Proteomics, 2004

Metabolic networks Nodes – Metabolites Edges – Reactions P(k) ~ k-2.2±2 A.Fulgidus (archae) E. Coli (bacterium) Metabolic networks across all kingdoms of life are scale-free C.Elegans (eukaryote) Averaged (43 organisms) Jeong et al., Nature, 2000

Why do so many real-life networks exhibit a power-law degree distribution? Is it “selected for”? Is it expected by change? Does it have anything to do with the way networks evolve? Does it have functional implications? ?

Network motifs Going beyond degree distribution … Generalization of sequence motifs Basic building blocks Evolutionary design principles?

What are network motifs? Recurring patterns of interaction (sub-graphs) that are significantly overrepresented (w.r.t. a background model) (199 possible 4-node sub-graphs) 13 possible 3-nodes sub-graphs R. Milo et al. Network motifs: simple building blocks of complex networks. Science, 2002

Finding motifs in the network 1a. Scan all n-node sub-graphs in the real network 1b. Record number of appearances of each sub-graph (consider isomorphic architectures) 2. Generate a large set of random networks 3a. Scan for all n-node sub-graphs in random networks 3b. Record number of appearances of each sub-graph 4. Compare each sub-graph’s data and identify motifs

Finding motifs in the network

Network randomization How should the set of random networks be generated? Do we really want “completely random” networks? What constitutes a good null model? Preserve in- and out-degree

Generation of randomized networks Network randomization algorithm : Start with the real network and repeatedly swap randomly chosen pairs of connections (X1Y1, X2Y2 is replaced by X1Y2, X2Y1) (Switching is prohibited if the either of the X1Y2 or X2Y1 already exist) Repeat until the network is “well randomized” X1 Y1 X1 Y1 X2 Y2 X2 Y2 The two algorithms gave identical results for the sub graph statistics.

Feed-Forward Loop (FFL) Motifs in transcriptional regulatory networks E. Coli network 424 operons (116 TFs) 577 interactions Significant enrichment of motif # 5 (40 instances vs. 7±3) X Y Z Master TF Specific TF Target Feed-Forward Loop (FFL) Coherent FFLs: The direct effect of x on z has the same sign as the net indirect effect through y 85% of FFLs are coherent S. Shen-Orr et al. Nature Genetics 2002

A simple cascade has slower shutdown What’s so cool about FFLs Boolean Kinetics A simple cascade has slower shutdown Assume X and Y act on Z as an AND gate T – Threshold, a – degradation rate A coherent feed-forward loop can act as a circuit that rejects transient activation signals from the general transcription factor and responds only to persistent signals, while allowing for a rapid system shutdown.

Why do these networks have similar motifs? Network motifs in biological networks Why do these networks have similar motifs? Why is this network so different? Information flow vs. Energy Flow

Motif-based network super-families R. Milo et al. Superfamilies of evolved and designed networks. Science, 2004

Computational representation of networks B C A D List of edges: (ordered) pairs of nodes [ (A,C) , (C,B) , (D,B) , (D,C) ] Connectivity Matrix Object Oriented Name:A ngr: p1 Name:B ngr: Name:C ngr: Name:D ngr: p2 A B C D 1 Which is the most useful representation?

Generation of randomized networks Algorithm B (Generative): Record marginal weights of original network Start with an empty connectivity matrix M Choose a row n & a column m according to marginal weights If Mnm = 0, set Mnm = 1; Update marginal weights Repeat until all marginal weights are 0 If no solution is found, start from scratch The two algorithms gave identical results for the sub graph statistics. A B C D 1 2 A B C D 1 2 A B C D 1 2 A B C D 1 2 B C A D