CSCI2950-C Lecture 12 Networks

Slides:



Advertisements
Similar presentations
Graph Algorithms: Minimum Spanning Tree We are given a weighted, undirected graph G = (V, E), with weight function w:
Advertisements

Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Structure discovery in PPI networks using pattern-based network decomposition Philip Bachman and Ying Liu BIOINFORMATICS System biology Vol.25 no
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
1 Seminar in Bioinformatics An efficient algorithm for detecting frequent subgraphs in biological networks Paper by: M. Koyuturk, A. Grama and W. Szpankowski.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
CS273a Lecture 9/10, Aut 10, Batzoglou Multiple Sequence Alignment.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Data Mining Presentation Learning Patterns in the Dynamics of Biological Networks Chang hun You, Lawrence B. Holder, Diane J. Cook.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Short Primer on Comparative Genomics Today: Special guest lecture 12pm, Alway M108 Comparative genomics of animals and plants Adam Siepel Assistant Professor.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
ECE669 L10: Graph Applications March 2, 2004 ECE 669 Parallel Computer Architecture Lecture 10 Graph Applications.
Networks of Protein Interactions Network Alignment Antal Novak CS 374 Lecture 6 10/13/2005 Nuke: Scalable and General Pairwise and Multiple Network Alignment.
BCB 570 Spring Protein-Protein Interaction Networks & methods Julie Dickerson Electrical and Computer Engineering.
Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks Jacob Scott, Trey Ideker, Richard M. Karp, Roded Sharan RECOMB 2005.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
341: Introduction to Bioinformatics
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
MCS312: NP-completeness and Approximation Algorithms
Network Analysis and Application Yao Fu
BIBM 2008 Qiong Cheng Georgia State University Joint work with Piotr Berman (Pennstate) Robert Harrison (GSU) Alexander Zelikovsky (GSU) Fast Alignments.
QNET: A tool for querying protein interaction networks Banu Dost +, Tomer Shlomi*, Nitin Gupta +, Eytan Ruppin*, Vineet Bafna +, Roded Sharan* + University.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
ReferencesReferences AcknowledgementsAcknowledgements TORQUE server DefinitionsDefinitions MethodsMethods IntroductionIntroduction Experiments & Results.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London March 18, 2010 Lecture hour 18 Nataša Pržulj
Qiong Cheng, Robert Harrison, Alexander Zelikovsky Computer Science in Georgia State University Oct IEEE 7 th International Conference on BioInformatics.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Complementarity of network and sequence information in homologous proteins March, Department of Computing, Imperial College London, London, UK 2.
TORQUE: T OPOLOGY -F REE Q UERYING OF P ROTEIN I NTERACTION N ETWORKS Sharon Bruckner 1, Falk Hüffner 1, Richard M. Karp 2, Ron Shamir 1, and Roded Sharan.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Introduction to biological molecular networks
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Construction of Substitution matrices
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
 Signal Transduction transmits signals from outside to the inside of the cell  Integer Linear Programming model is used to unravel STN.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.
Spectral methods for Global Network Alignment
Representation, Learning and Inference in Models of Cellular Networks
CSCI2950-C Genomes, Networks, and Cancer
TORQUE: Topology-Free Querying of Protein Interaction Networks
Basics of Comparative Genomics
Michael Langberg: Open University of Israel
Discrete Kernels.
Large Scale Metabolic Network Alignments by Compression
Incremental Network Querying in Biological Networks
Intro to Alignment Algorithms: Global and Local
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Schedule for the Afternoon
Spectral methods for Global Network Alignment
SEG5010 Presentation Zhou Lanjun.
TRANSLATED BY: KARUN RAJESH
Modeling cells with protein networks
Basics of Comparative Genomics
Approximate Graph Mining with Label Costs
Presentation transcript:

CSCI2950-C Lecture 12 Networks Ben Raphael November 18, 2008 http://cs.brown.edu/courses/csci2950-c/

Biological Interaction Networks Many types: Protein-DNA (regulatory) Protein-metabolite (metabolic) Protein-protein (signaling) RNA-RNA (regulatory) Genetic interactions (gene knockouts)

Outline Biology of cellular interaction networks Network Alignment Network Motifs Network Integration

Metabolic Networks Nodes = reactants Edges = reactions labeled by enzyme (protein) that catalyzes reaction

Regulatory Networks

Regulatory Networks Protein-DNA interaction network Nodes = genes Edges = regulatory interaction A “activates” B A B A “represses” C C Protein-DNA interaction network

Signaling Networks

Protein Interaction Networks Proteins rarely function in isolation, protein interactions affect all processes in a cell. Forms of protein-protein interactions: Modification, complexation [Cardelli, 2005]. phosphorylation A brief introduction for protein interactions. Proteins in a cell interact with eachother in the forms of activation, binding and inhibition. And their interactions affect all processes in a cell. With the emergence of high-throughput methods, protein complex

Protein Interaction Networks High-throughput methods are available to find all interactions, “PPI network”, of a species. an undirected graph nodes: protein, edges: interactions Edges may have weights Yeast DIP network: ~5K proteins, ~18K interactions Fly DIP network: ~7K proteins, ~20K interactions A brief introduction for protein interactions. The interactions between proteins are important for many biological processes. They may activate/inhibit eachother by modification or they may bind to eachother to make a protein complex. High throughput methods have been available to find all interactions in the cell. This resulting interactome is represented by a Network. A protein protein interaction network is an undirected graph where the nodes are proteins and the edges are the interactions. PPI network

How are protein-protein interaction networks derived? Yeast two-hybrid screens

How are protein-protein interaction networks derived? Protein purification and separation

Computational Problems Classifying Network Topology Finding paths, cliques, dense subnetworks, etc. Comparing Networks Across Species Using networks to explain data Dependencies revealed by network topology Modeling dynamics of networks

Alignment Sequences Networks mouse human Sequences Evolve via substitutions Conservation implies function EFTPPVQAAYQKVVAG DFNPNVQAAFQKVVAG Networks Evolve via gain/loss of proteins or interactions (?)

Motivation By similar intuition, subnetworks conserved across species are likely functional modules

Network Alignment “Conserved” means two subgraphs contain proteins serving similar functions, having similar interaction profiles Key word is similar, not identical mismatch/substitution

Alignment Analogy Sharan and Ideker. Modeling cellular machinery through biological network comparison. Nature Biotechnology 24, pp. 427-433, 2006

Earlier approaches: interologs Interactions conserved in orthologs Orthology (descended from common ancestor) is a fuzzy notion Sequence similarity not necessary for conservation of function

Complications Protein sequence similarity not 1-1 Interaction data: Orthologs Paralogs Interaction data: Noisy Incomplete Dynamic Computational tractability

Network Alignment Sharan and Ideker. Modeling cellular machinery through biological network comparison. Nature Biotechnology 24, pp. 427-433, 2006

The Network Alignment Problem Given: k different interaction networks belonging to different species, Find: Conserved sub-networks within these networks Conserved defined by protein sequence similarity (node similarity) and interaction similarity (network topology similarity)

PathBLAST Goal: identify conserved pathways (chains) Idea: can be done efficiently by dynamic programming if networks are DAGs A A’ Score: match B + gap C X’ + mismatch D D’ + match Kelley et al (2003)

Why paths?

PathBLAST (Kelley, et al. PNAS 2003) Find conserved pathways in protein interaction maps of two species Model & Scoring: See class notes.

PathBLAST Problem: Networks are neither acyclic nor directed Solution: Randomize Impose random ordering on nodes, perform DP; repeat many times On average, highest scoring path preserved in 2/L! subgraphs Finds conserved paths of length L within networks of size n in O(L!n) expected time Drawbacks Computationally expensive Restricts search to specific topology 5 2 1 3 4 2 1 4 5 3 1 4 2 3 5 Kelley et al (2003)

PathBLAST

PathBLAST Scoring

PathBLAST: Computational Formulation Given: Undirected weighted graph G = (V, E, w) Set of start vertices I, and end vertex v, Find: a minimum-weight simple path starting in I and ending at v. NP-hard in general (reduction from TSP) Dynamic Programming formulation (see class notes) Scott, et al. JCB 2006

Color-coding (Alon, Yuster, & Zwick) Assign each vertex random color between 1 and k. Search for path w/ distinct colors. (colorful path) Resulting paths are simple. High-scoring path not discovered when two vertices have same color. Repeat for many random colorings.

Color-coding (Alon, Yuster, & Zwick) Extends to many other cases of subgraph isomorphism problem: Does a graph G have a subgraph isomorphic to graph H? H = simple path of length k. H = simple cycle of length k. H = tree. H = graph of fixed (bounded) tree-width

Additional Problems Efficient querying of a network (e.g. QNET) Find conserved subgraphs Heavy subgraphs in product graph Multiple network alignment