Joint analysis of genetic and physical interactions in S. cerevisiae Igor Ulitsky Ron Shamir lab School of Computer Science Tel Aviv University.

Slides:



Advertisements
Similar presentations
Whole Genome Duplications (Polyploidy) Made famous by S. Ohno, who suggested WGD can be a route to evolutionary innovation (focusing on neofunctionalization)
Advertisements

CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Synthetic lethal analysis of Caenorhabditis elegans posterior embryonic patterning genes identifies conserved genetic interactions L Ryan Baugh, Joanne.
Darwinian Genomics Csaba Pal Biological Research Center Szeged, Hungary.
Metabolic functions of duplicate genes in Saccharomyces cerevisiae Presented by Tony Kuepfer et al
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Fault Tolerance in Protein Interaction Networks: Stable Bipartite Subgraphs and Redundant Pathways Lenore Cowen Tufts University.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Gene expression analysis summary Where are we now?
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Cs726 Modeling regulatory networks in cells using Bayesian networks Golan Yona Department of Computer Science Cornell University.
Functional genomics and inferring regulatory pathways with gene expression data.
Experimental and computational assessment of conditionally essential genes in E. coli Chao WANG, Oct
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Copyright  2004 limsoon wong Assessing Reliability of Protein- Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research.
Evidence for dynamically organized modularity in the yeast protein- protein interaction network Han, et al
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Transcription control reprogramming in genetic backup circuits Literature search WANG Chao 4/6/2005.
Biological networks: Types and origin
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Epistasis Analysis Using Microarrays Chris Workman.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Interaction Networks in Biology: Interface between Physics and Biology, Shekhar C. Mande, August 24, 2009 Interaction Networks in Biology: Interface between.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Network Analysis and Application Yao Fu
Network Biology Presentation by: Ansuman sahoo 10th semester
QNET: A tool for querying protein interaction networks Banu Dost +, Tomer Shlomi*, Nitin Gupta +, Eytan Ruppin*, Vineet Bafna +, Roded Sharan* + University.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London March 18, 2010 Lecture hour 18 Nataša Pržulj
Supplementary Figure S1 eQTL prior model modified from previous approaches to Bayesian gene regulatory network modeling. Detailed description is provided.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &
Integration of chemical-genetic & genetic interaction data links bioactive compounds to cellular target pathways Parsons et al Nature Biotechnology.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
The Bi-Module problem: new algorithms and applications Group meeting January 2013 David Amar.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Biological networks CS 5263 Bioinformatics.
System Structures Identification
Quantitative Genetic Interactions Reveal Biological Modularity
Schedule for the Afternoon
SEG5010 Presentation Zhou Lanjun.
Principle of Epistasis Analysis
Presentation transcript:

Joint analysis of genetic and physical interactions in S. cerevisiae Igor Ulitsky Ron Shamir lab School of Computer Science Tel Aviv University

Motivation Only ≈18% percent of S. cerevisiae genes essential in rich medium More are probably essential under other conditions Essentials % probably even smaller in higher organisms Buffering seems abundant

Genetic interactions (GI) “Aggravating” interactions The observed phenotype is worse than what we would “expect” Synthetic lethality: joint deletion of two nonessential genes  lethal phenotype Synthetic sickness: joint deletion of two nonessential genes  slow growth

Assaying YKO strains Yeast knock-out (YKO) collection contains ≈ 4,800 single deletion strains. Each deleted gene is replaced by a short DNA “barcode” (2 x 20bp), flanked by universal primers. The contents of a mixture of strains can be assayed before and after treatment. Presence/absence patterns Presence/absence patterns Under/over-representation Under/over-representation

GI availability Systematically mapped by SGA and dSLAM Available for ≈200 gene queries and ≈4,500 targets in S. cerevisiae GI network: 13,632 interactions, 2,682 genes

Physical interactions (PI) Protein-protein interactions Y2H Y2H TAP TAP Protein-DNA interactions Chip 2 experiments Chip 2 experiments PI network: 68,172 interactions, 6,184 proteins

GI Analysis Spectra Genetically interacting homologs tend to exhibit “compensation” on the expression level R. Kafri et al R. Kafri et al Genetically interacting proteins frequently have a similar fold O. Dror et al O. Dror et al Genetic interactions can be used to delineate regulatory pathways R.P. Onge et al R.P. Onge et al. 2007

Joint analysis of GI and PI Motivation: Identifying pathways Identifying pathways Connect pathways with phenotypes Connect pathways with phenotypes Understand system features (robustness, essentiality…) Understand system features (robustness, essentiality…) Initial analysis: Proteins close in the GI network slightly more likely to physically interact (Tong et al., 2004) Proteins close in the GI network slightly more likely to physically interact (Tong et al., 2004) PI hubs likely to be GI hubs (Ozier et al., 2003) PI hubs likely to be GI hubs (Ozier et al., 2003)

BPMs Kelley and Ideker, 2005: modeling “explanations” for GIs Within-pathway model Between-pathway model GI PI

Kelley & Ideker conclusions 40% of the GIs can be explained by physical models Between-pathway models explain x 3.5 times more GIs than within-pathway models Models capable of predicting protein functions

BPM rationale Between pathway models suggest redundancy Alternative paths Independent KOs  little effect Independent KOs  little effect Joint KOs in both  severe effect Joint KOs in both  severe effect Viable phenotype Lethal phenotype

Density model (Kelley and Ideker) Density scores: Drawbacks A pair of dense pathways with little GIs will score high A pair of dense pathways with little GIs will score high Some pathways may not be dense Some pathways may not be dense PI density within pathways GI density between pathways

DNA damage response example DNA Metabolism Oxydative stress response

BPM Hunting Look for pairs of pathways with GI evidence for buffering A pathway – a connected subnetwork in of PIs Model scoring based on the density of GIs Log-likelihood scores accounting for GI degrees

Connectivity vs. Density Dense pathways models Connected pathway models Average model size GIs between pathways GI density between pathways PI density within pathways Functional enrichment in GO 72.6%71.4%

Algorithmics High-scoring seeds Finding heavy bicliques with connectivity constraints Finding heavy bicliques with connectivity constraints Seed optimization Greedy search maintaining connectivity Greedy search maintaining connectivity Significance filtering

Finding seeds Kelley and Ideker start from single GIs Maximal biclique heuristic: For every edge (u,v) in the PI network For every edge (u,v) in the PI network Identify B – the set of nodes adjacent to both u and v Identify B – the set of nodes adjacent to both u and v Divide B into connected components in PI B 1,..B m Divide B into connected components in PI B 1,..B m For every B i with |B i |>k min For every B i with |B i |>k min Identify A i – the set of nodes adjacent to all the nodes in B i Identify A i – the set of nodes adjacent to all the nodes in B i Divide A i into connected components A i 1,…, A i n, Divide A i into connected components A i 1,…, A i n, If |A i j |>k min add (B i, A i j ) to the seed list If |A i j |>k min add (B i, A i j ) to the seed list Filter for overlaps Filter for overlaps

Adding “pivot proteins” to BPMs Pathway bifurcationRedundant sub-complexes

Pivot proteins Physically connected to both pathways in the BPM Connection significant given the general network degrees

Analysis flow 13,632 genetic interactions 68,172 physical interactions 140 models 124 pivot proteins in 40 models BPM analysis Pivot extraction Physiological characteristics Essentiality Phenotypes Protein Abundance Codon adaptation index mRNA half-life Phoshorylation

BPMs as functional modules Functional enrichment analysis (TANGO): 71.4% enriched for GO “biological process” 71.4% enriched for GO “biological process” 69.3% enriched for GO “cellular compartment” 69.3% enriched for GO “cellular compartment” 46.3% of known complexes enriched in at least one BPM 46.3% of known complexes enriched in at least one BPM

BPMs as phenotypic modules Source data: quantitative patterns of fitness of single deletion mutants in diverse conditions (Brown et al., 2006) Phenotypic pattern homogeneity within a BPM pathway highly significant (p<10 -3 )

Pivots tend to be multifunctional No. of proteins PivotsExpectedSignificance GO complexes p=1.27ּ10 -9 KEGG pathways p=3.68 ּ Curated multi- complexed proteins (Krause et al. 2006) p=7.49 ּ 10 -9

Pivot proteins are essential Q: Are proteins active in multiple partially redundant pathways more essential? 72/124 (58%) pivots essential (p=1.42 · ) Enrichment is not explained by node degrees (p<10 -5 ) Essential pivots closer in function to their BPMs than nonessential pivots Pivots significantly retained during evolution (p=9.79 · )

Evolutionary retention of pivots Essentiality linked to evolutionary retention (Gustafson et al., 2006) Essentiality linked to evolutionary retention (Gustafson et al., 2006) Retention computed using 26 eukaryotic genomes Retention computed using 26 eukaryotic genomes Pivots significantly retained in evolution (Wilcoxon rank-sum test, p=9.79·10 -9 ) Pivots significantly retained in evolution (Wilcoxon rank-sum test, p=9.79·10 -9 ) Enrichment is not explained by essentiality (p = 0.029) Enrichment is not explained by essentiality (p = 0.029)

SWR1 Ino80 SWR1/Ino80 Example Pivots

SAGA/Nuclear pore example Nuclear pore SAGA Pivots

Spt15 example

Smc1 example MIND Ndc80 COMA Mitotic spindle checkpoint

Physiological properties of BPMs Proteins studied: 850 proteins within BPMs 850 proteins within BPMs 120 pivot proteins 120 pivot proteins Focus on mRNA half life mRNA half life # of phosphorylation sites # of phosphorylation sites

BPMs are strongly regulated Genes within BPMs Shorter mRNA half-life (p=1.9·10 -9 ) Shorter mRNA half-life (p=1.9·10 -9 ) More phosphorylation sites (p=6.3·10 -9 ) More phosphorylation sites (p=6.3·10 -9 ) Both properties may represent regulation Redundant pathways  Strict regulation ? Enrichments not explained by degrees, essentiality, any enriched function mRNA half-life : experimental data, phosphosites : predicted Properties generally not correlated

Next generation – quantitative GIs Data on quantitative genetic interactions scores is becoming available Data on quantitative genetic interactions scores is becoming available These include also alleviating interactions These include also alleviating interactions

Alleviating interactions The observed phenotype is better than what we would “expect” “Expect” – multiplicative model: Mutation A – 80% fitness is retained Mutation A – 80% fitness is retained Mutation B – 60% fitness is retained Mutation B – 60% fitness is retained A & B – 48% fitness is retained A & B – 48% fitness is retained The first deletion ruins the functionally of the pathway, and the second one does not have an effect An alleviating interaction is a sign of participation in the same pathway

Extracting BPMs from quantitative data Probabilistic model for the aggravating/alleviating interaction scores Detection of pathways and buffering using quantitative data

Extracting BPMs from quantitative data Next challenge: simultaneously extraction of multiple pathways The output includes: Pathway boundaries Pathway boundaries Pairs of buffering pathways Pairs of buffering pathways

Extracting BPMs from quantitative data

Summary Computational analysis outlines pathways and buffering pathway pairs - BPMs BPM pathways tend to be strictly regulated BPM pivots correspond to essential multifunctional proteins Quantitative GIs carry the promise of a reacher analysis