Biological networks: Types and origin Protein-protein interactions, complexes, and network properties Thomas Skøt Jensen Center for Biological Sequence.

Slides:



Advertisements
Similar presentations
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Advertisements

VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
University at BuffaloThe State University of New York Young-Rae Cho Department of Computer Science and Engineering State University of New York at Buffalo.
Biological networks Bing Zhang Department of Biomedical Informatics Vanderbilt University
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
Protein domains vs. structure domains - an example.
Gene and Protein Networks II Monday, April CSCI 4830: Algorithms for Molecular Biology Debra Goldberg.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Global topological properties of biological networks.
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Chip arrays and gene expression data. Motivation.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Discussion summary Cytoscape introduction ::Systems Biology2CBS, Department of Systems Biology.
Introduction to Systems Biology. Overview of the day Background & Introduction Network analysis methods Case studies Exercises.
Biological networks Construction and Analysis. Recap Gene regulatory networks –Transcription Factors: special proteins that function as “keys” to the.
Biological networks: Types and origin
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Discussion summary Cytoscape introduction Thomas Skøt Jensen Center for Biological Sequence Analysis The Technical University of Denmark.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Affinity chromatography/mass spec Bait protein GST Page 252.
Epistasis Analysis Using Microarrays Chris Workman.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Gene expression and the transcriptome II. SAGE SAGE = Serial Analysis of Gene Expression Based on serial sequencing of 15-bp tags that are unique to each.
Protein Interactions and Disease Audry Kang 7/15/2013.
Interaction Networks in Biology: Interface between Physics and Biology, Shekhar C. Mande, August 24, 2009 Interaction Networks in Biology: Interface between.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
A highly abbreviated introduction to proteomics
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
MN-B-WP II (BInf 2) Bioinformatische Datenbanken Kay Hofmann – Protein Evolution Group Woche 4: Interaktionsdatenbanken.
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Biological Pathways & Networks
Interactions and more interactions
Presentation by: Kyle Borge, David Byon, & Jim Hall
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Network Biology Presentation by: Ansuman sahoo 10th semester
School of Information University of Michigan SI 614 Network subgraphs (motifs) Biological networks Lecture 11 Instructor: Lada Adamic.
Finish up array applications Move on to proteomics Protein microarrays.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:
Proteome and interactome Bioinformatics.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
TAP(Tandem Affinity Purification) Billy Baader Genetics 677.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
CSCE555 Bioinformatics Lecture 18 Network Biology Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Biol 729 – Proteome Bioinformatics Dr M. J. Fisher - Protein: Protein Interactions.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Biological Network Analysis
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Protein-protein Interactions
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
CSCI2950-C Lecture 12 Networks
Biological networks CS 5263 Bioinformatics.
Presentation transcript:

Biological networks: Types and origin Protein-protein interactions, complexes, and network properties Thomas Skøt Jensen Center for Biological Sequence Analysis The Technical University of Denmark

Networks in electronics Radio kindly provided by Lazebnik, Cancer Cell, 2002

Model Generation Interactions Radio kindly provided by Lazebnik, Cancer Cell, 2002 Parts List YER001W YBR088C YOL007C YPL127C YNR009W YDR224C YDL003W YBL003C … YDR097C YBR089W YBR054W YMR215W YBR071W YBL002W YNL283C YGR152C … Sequencing Gene knock-out Microarrays etc. Interactions Genetic interactions Protein-Protein interactions Protein-DNA interactions Subcellular Localization Dynamics Microarrays Proteomics Metabolomics

Types of networks

Interaction networks in molecular biology Protein-protein interactions Protein-DNA interactions Genetic interactions Metabolic reactions Co-expression interactions Text mining interactions Association networks

Interaction networks in molecular biology Protein-protein interactions Protein-DNA interactions Genetic interactions Metabolic reactions Co-expression interactions Text mining interactions Association networks

Characterization of physical interactions Obligation –obligate (protomers only found/function together) –non-obligate (protomers can exist/function alone) Time of interaction –permanent (complexes, often obligate) –strong transient (require trigger, e.g. G proteins) –weak transient (dynamic equilibrium)

ol Examples: GPCR obligate, permanent non-obligate, strong transient

Approaches by interaction type Physical Interactions –Yeast two hybrid screens –Affinity purification (mass spec) –Protein-DNA by chIP-chip Other measures of ‘association’ –Genetic interactions (double deletion mutants) –Functional associations (STRING) –Co-expression

Yeast two-hybrid method Y2H assays interactions in vivo. Uses property that transcription factors generally have separable transcriptional activation (AD) and DNA binding (DBD) domains. A functional transcription factor can be created if a separately expressed AD can be made to interact with a DBD. A protein ‘bait’ B is fused to a DBD and screened against a library of protein “preys”, each fused to a AD.

Yeast two-hybrid method Fields and Song

Issues with Y2H Strengths –High sensitivity (transient & permanent PPIs) –Takes place in vivo –Independent of endogenous expression Weaknesses: False positive interactions –Auto-activation –‘sticky’ prey –Detects “possible interactions” that may not take place under real physiological conditions –May identify indirect interactions (A-C-B) Weaknesses: False negatives interactions –Similar studies often reveal very different sets of interacting proteins (i.e. False negatives) –May miss PPIs that require other factors to be present (e.g. ligands, proteins, PTMs)

Protein interactions by immuno-precipitation followed by mass spectrometry Start with affinity purification of a single epitope- tagged protein This enriched sample typically has a low enough complexity to be fractionated on a standard polyacrylamide gel Individual bands can be excised from the gel and identified with mass spectrometry.

Affinity Purification

Strengths High specificity Well suited for detecting permanent or strong transient interactions (complexes) Detects real, physiologically relevant PPIs Weaknesses Less suited for detecting weaker transient interactions (low sensitivity) May miss complexes not present under the given experimental conditions (low sensitivity) May identify indirect interactions (A-C-B)

Protein-protein interaction data growth Error rate may be as high as %

Topology based scoring of interactions Low confidence (4 unshared interaction partners) High confidence (1 unshared interaction partners) ABC Yeast two-hybrid Low confidence (rarely purified together) High confidence (often purified together) Complex pull-downs D de Lichtenberg et al., Science, 2005

Filtering by subcellular localization de Lichtenberg et al., Science, 2005

Reducing the error rate in PPI data Benchmark by measuring the overlap between the curated MIPS complexes and different PPI data sets derived from high-throughput screens. Overlap with curated MIPS complexesEstimated error rate de Lichtenberg et al., Science, 2005

Filtering reduces coverage and increases specificity

Network Properties Graphs, paths, topology

Graphs Graph G=(V,E) is a set of vertices V and edges E A subgraph G’ of G is induced by some V’  V and E’  E Graph properties: –Connectivity (node degree, paths) –Cyclic vs. acyclic –Directed vs. undirected

Sparse vs Dense G(V, E) where |V|=n, |E|=m the number of vertices and edges Graph is sparse if m~n Graph is dense if m~n 2 Complete graph when m=n 2

Connected Components G(V,E) |V| = 69 |E| = 71

Connected Components G(V,E) |V| = 69 |E| = 71 6 connected components

Paths A path is a sequence {x 1, x 2,…, x n } such that (x 1,x 2 ), (x 2,x 3 ), …, (x n-1,x n ) are edges of the graph. A closed path x n =x 1 on a graph is called a graph cycle or circuit.

Shortest-Path between nodes

Longest Shortest-Path

Degree or connectivity

Random vs scale-free networks P(k) is probability of each degree k, i.e fraction of nodes having that degree. For random networks, P(k) is normally distributed. For real networks the distribution is often a power- law: P(k) ~ k  Such networks are said to be scale-free

“The Swedish sex web” Target the ‘hubs’ to have an efficient safe sex education campaign Lewin Bo, et al., Sex i Sverige; Om sexuallivet i Sverige 1996, Folkhälsoinstitutet, 1998

Knock-out lethality and connectivity

Clustering coefficient k: neighbors of I n I : edges between node I’s neighbors The density of the network surrounding node I, characterized as the number of triangles through I. Related to network modularity The center node has 8 (grey) neighbors There are 4 edges between the neighbors C = 2*4 /(8*(8-1)) = 8/56 = 1/7

Protein complexes have a high clustering coefficient Proteins subunits are highly interconnected and thus have a high clustering coefficient There exists algorithms, such as MCODE, for identifying subnetworks (complexes) in large protein-protein interaction networks

Hierarchical Networks

Detecting hierarchical organization

Scale-free networks are robust Complex systems (cell, internet, social networks), are resilient to component failure Network topology plays an important role in this robustness –Even if ~80% of nodes fail, the remaining ~20% still maintain network connectivity Attack vulnerability if hubs are selectively targeted In yeast, only ~20% of proteins are lethal when deleted, and are 5 times more likely to have degree k>15 than k<5.

Other interesting features Cellular networks are assortative, hubs tend not to interact directly with other hubs. Hubs tend to be “older” proteins (so far claimed for protein-protein interaction networks only) Hubs also seem to have more evolutionary pressure—their protein sequences are more conserved than average between species (shown in yeast vs. worm)

Sub-cellular localization coverage

Co-localization of interacting proteins

Tendency to interact with your cousin

Over-representation of highly abundant proteins

Coverage versus Accuracy say a lot, of which most is wrong say a lot, of which most is right say little, of which most is wrong say little, of which most is right Specificity Sensitivity