The multi-layered organization of information in living systems

Slides:



Advertisements
Similar presentations
Network analysis Sushmita Roy BMI/CS 576
Advertisements

DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.
Network biology Wang Jie Shanghai Institutes of Biological Sciences.
Modularity and community structure in networks
A Novel Knowledge Based Method to Predicting Transcription Factor Targets
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
Transcription Networks And The Cell’s Functional Organization Presenter: Roni Sharf.
Mining and Searching Massive Graphs (Networks)
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Genomic analysis of regulatory network dynamics reveals large topological changes Paper Study Speaker: Cai Chunhui Sep 21, 2004.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Centrality Measures These measure a nodes importance or prominence in the network. The more central a node is in a network the more significant it is to.
Face Recognition Using Eigenfaces
Evidence for dynamically organized modularity in the yeast protein- protein interaction network Han, et al
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Assigning Numbers to the Arrows Parameterizing a Gene Regulation Network by using Accurate Expression Kinetics.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Protein Classification A comparison of function inference techniques.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Inferring Cellular Networks Using Probabilistic Graphical Models Jianlin Cheng, PhD University of Missouri 2009.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Spectral coordinate of node u is its location in the k -dimensional spectral space: Spectral coordinates: The i ’th component of the spectral coordinate.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Network Analysis and Application Yao Fu
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Lecture 13 - Networks Inference, Analysis, and Applications
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Lectures 6 & 7 Centrality Measures Lectures 6 & 7 Centrality Measures February 2, 2009 Monojit Choudhury
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
MicroRNA regulation in Arabidopsis thaliana
Analysis of the yeast transcriptional regulatory network.
Microarrays.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Introduction to biological molecular networks
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Network resilience.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Network deconvolution as a general method to distinguish direct dependencies in networks MIT group; Accepted Jun. 2013; Nature Biotechnology Presented.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Informatics tools in network science
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Advances and challenges in computational modeling and statistical learning of biological systems Qi Liu Department of Biomedical Informatics Vanderbilt.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Gene Regulation, Part 2 Lecture 15 (cont.) Fall 2008.
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Groups of vertices and Core-periphery structure
Biological networks CS 5263 Bioinformatics.
A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression By Alfredo A Kalaitzis and Neil.
Discrete Kernels.
Building and Analyzing Genome-Wide Gene Disruption Networks
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
CSCI2950-C Lecture 13 Network Motifs; Network Integration
CISC 667 Intro to Bioinformatics (Spring 2007) Genetic networks and gene expression data CISC667, S07, Lec24, Liao.
Presentation transcript:

By: Soheil Feizi Final Project Presentation 18.338 MIT Applications of Spectral Matrix Theory in Computational Biology By: Soheil Feizi Final Project Presentation 18.338 MIT

The multi-layered organization of information in living systems EPIGENOME DNA CHROMATIN HISTONES GENOME Genes DNA cis-regulatory elements TRANSCRIPTOME RNA miRNA piRNA mRNA ncRNA PROTEOME R1 R2 S1 S2 M1 M2 PROTEINS Transcription factors Signaling proteins Metabolic Enzymes

Biological networks at all cellular levels Transcriptional gene regulation Post-transcriptional Protein & signaling networks Metabolic Dynamics Modification Proteins Translation RNA Transcription Genome

Matrix Theory applications and Challenges Systems-level views Functional/physical datasets Network inference Regulator (TF) target gene TF binding Expression TF motifs Gene Human Disease datasets: GWAS, OMIM Fly Goals: Global structural properties of regulatory networks How eigenvalues are distributed? Positive and negative eigenvalues Finding modularity structures of regulatory networks using spectral decomposition Finding direct interactions and removing transitive noise using spectral network deconvolution Worm

Structural properties of networks using spectral density function Converges as k-th moment: NMk: number of directed loops of length k Zero odd moments: Tree structures Semi-circle law?

Regulatory networks have heavy-tailed eigenvalue distributions Eigenvalue distribution is asymmetric with heavy tails Scale-free network structures, there are some nodes with large connectivity Modular structures: positive and negative eigenvalues

An example of scale-free network structures Power-law degree distribution High degree nodes Preferential attachment

Key idea: use systems-level information: Key idea: use systems-level information: Network modularity, spectral methods Idea: Represent regulatory networks using regulatory modules Robust and informative compared to edge representation Method: Spectral modularity of networks Highlight modules and discover them

Key idea: use systems-level information: Key idea: use systems-level information: Network modularity, spectral methods Idea: Represent regulatory networks using regulatory modules Robust and informative compared to edge representation Method: Spectral modularity of networks Highlight modules and discover them

Eigen decomposition of the modularity matrix adjacency matrix modularity matrix eigenvector matrix Modularity profile matrix degree vector Total number of edges positive eigenvalues Method: Compute modularity matrix of the network Decompose modularity matrix to its eigenvalues and eigenvectors For modularity profile matrix using eigenvectors with positive eigenvalues Compute pairwise distances among node modularity profiles

Probabilistic background why does it work? Probabilistic definition of network modularity [Newman] Modularity matrix: Probabilistic background model For simplicity, suppose we want to divide network into only two modules characterized by S: Contribution of node 1 in network modularity +1 S= -1 1 k1 k2 kn k3

why does it work? Linear Algebra: Network modularity is maximized if S is parallel to the largest eigenvector of M However, S is binary and with high probability cannot be aligned to the largest eigenvector of M Considering other eigenvectors with positive eigenvalues gives more information about the modularity structure of the network Idea (soft network partitioning): if two nodes have similar modularity profiles, they are more likely to be in the same module adjacency matrix modularity matrix eigenvector matrix modularity profile matrix

Spectral modularity over simulated networks

Observed network: combined direct and indirect effects Transitive Effects Indirect edges may be entirely due to second-order, third-order, and higher-order interactions (e.g. 14) Each edge may contain both direct and indirect components (e.g. 24)

Model indirect flow as power series of direct flow Transitive Closure indirect effects i j k1 k2 kn converges with correct scaling 2nd order 3nd order This model provides information theoretic min-cut flow rates Linear scaling so that max absolute eigenvalue of direct matrix <1 Indirect effects decay exponentially with path length Series converges Inverse problem: Gdir is actually unknown, only Gobs is known

Network deconvolution framework

ND is a nonlinear filter in eigen space Theorem Suppose and are the largest positive and smallest negative eigenvalues of . Then, by having the largest absolute eigenvalue of will be less than or equal to Intuition:

Scalability of spectral methods O(n3) computational complexity for full-rank networks O(n) computational complexity for low-rank networks Local deconvolution of sub-networks of the network Parallelizing network deconvolution

Conclusions Eigenvalue distribution of regulatory networks is similar to scale-free ones and has a heavy positive tail. Regulatory networks have scale-free structures. Eigen decomposition of probabilistic modularity matrix can be used to detect modules in the networks Network deconvolution: a spectral method to infer direct dependencies and removing transitive information flows