Predicting essential genes via impact degree on metabolic networks ISSSB’11 Takeyuki Tamura Bioinformatics Center, Institute for Chemical Research Kyoto.

Slides:



Advertisements
Similar presentations
Global States.
Advertisements

Neural Networks  A neural network is a network of simulated neurons that can be used to recognize instances of patterns. NNs learn by searching through.
Darwinian Genomics Csaba Pal Biological Research Center Szeged, Hungary.
Metabolic functions of duplicate genes in Saccharomyces cerevisiae Presented by Tony Kuepfer et al
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Prediction of Therapeutic microRNA based on the Human Metabolic Network Ming Wu, Christina Chan Bioinformatics Advance Access Published January 7, 2014.
Project Selection Three main categories of methods/approaches:  Strategic approach  Analytical approach  Financial methods.
E.coli aerobic/anaerobic switch study Chao Wang, Mar
Integration of enzyme activities into metabolic flux distributions by elementary mode analysis Kyushu Institute of Technology Hiroyuki Kurata, Quanyu Zhao,
1 2 Extreme Pathway Lengths and Reaction Participation in Genome Scale Metabolic Networks Jason A. Papin, Nathan D. Price and Bernhard Ø. Palsson.
The activity reaction core and plasticity of metabolic networks Almaas E., Oltvai Z.N. & Barabasi A.-L. 01/04/2006.
Nature Genetics 37, (2004) Modular epistasis in yeast metabolism Daniel Segr � 1, Alexander DeLuna2, George M Church1 & Roy Kishony2.
Mathematical Representation of Reconstructed Networks The Left Null space The Row and column spaces of S.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
In silico aided metaoblic engineering of Saccharomyces cerevisiae for improved bioethanol production Christoffer Bro et al
Experimental and computational assessment of conditionally essential genes in E. coli Chao WANG, Oct
Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae Speaker: Zhu YANG 6 th step, 2006.
Gene Regulatory Networks - the Boolean Approach Andrey Zhdanov Based on the papers by Tatsuya Akutsu et al and others.
Introduction to CSE 591: Autonomous agents - theory and practice. Chitta Baral Professor Department of Computer Sc. & Engg. Arizona State University.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
1 Chapter 3 Structuring Decision. 2 Structuring Decisions Learning Objectives Fundamental steps in model creation Identify and structure values and objectives.
Modeling the Gene Expression of Saccharomyces cerevisiae Δcin5 Under Cold Shock Conditions Kevin McKay Laura Terada Department of Biology Loyola Marymount.
1 ECE 453 – CS 447 – SE 465 Software Testing & Quality Assurance Instructor Kostas Kontogiannis.
Gene Set Enrichment Analysis (GSEA)
Lecture #23 Varying Parameters. Outline Varying a single parameter – Robustness analysis – Old core E. coli model – New core E. coli model – Literature.
Functional Genomic Hypothesis Generation and Experimentation by a Robot Scientist King et al, Nature : Presented by Monica C. Sleumer February.
The importance of enzymes and their occurrences: from the perspective of a network W.C. Liu 1, W.H. Lin 1, S.T. Yang 1, F. Jordan 2 and A.J. Davis 3, M.J.
Genetic modification of flux (GMF) for flux prediction of mutants Kyushu Institute of Technology Quanyu Zhao, Hiroyuki Kurata.
Transcriptional Regulation in Constraints-based metabolic Models of E. coli Published by Markus Covert and Bernhard Palsson, 2002.
The Optimal Metabolic Network Identification Paula Jouhten Seminar on Computational Systems Biology
Improving NADPH availability for natural product biosynthesis in Escherichia coli by metabolic engineering 汇报人:刘巧洁.
Comparison of networks in cell biology Jörn Behre, Dept. of Bioinformatics, Friedrich-Schiller-University Jena 4th SFB-Workshop "Gene regulatory networks",
CHAPTER 15 Metabolism: Basic Concepts and Design.
Reconstruction of Transcriptional Regulatory Networks
BIOINFORMATICS ON NETWORKS Nick Sahinidis University of Illinois at Urbana-Champaign Chemical and Biomolecular Engineering.
Human liver rate-limiting enzymes influence metabolic flux via branch points and inhibitors Min Zhao Center for Bioinformatics Peking University.
Clustering Metabolic Networks Using Minimum Cut Trees Ryan Kellogg 1, Allison Heath 2, Lydia Kavraki 2,3 1 Carnegie Mellon University, Department of Electrical.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
1 Departament of Bioengineering, University of California 2 Harvard Medical School Department of Genetics Metabolic Flux Balance Analysis and the in Silico.
Higher Biology Chapter 16 Gene Mutations. This type of mutation involves a change in one or more of the nucleotides in a strand of DNA. There are four.
Introduction: Acknowledgments Thanks to Department of Biotechnology (DBT), the Indo-US Science and Technology Forum (IUSSTF), University of Wisconsin-Madison.
10. Lecture WS 2014/15 Bioinformatics III1 V10 Metabolic networks - Graph connectivity Graph connectivity is related to analyzing biological networks for.
System Biology (ISA5101) Preliminary Project Group 14 g 李昀 g 何瓊雯.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
Metabolic pathway alteration, regulation and control (3) Xi Wang 01/29/2013 Spring 2013 BsysE 595 Biosystems Engineering for Fuels and Chemicals.
In silico gene targeting approach integrating signaling, metabolic, and regulatory networks Bin Song Jan 29, 2009.
Yeast geneticists frequently invoke:
 Signal Transduction transmits signals from outside to the inside of the cell  Integer Linear Programming model is used to unravel STN.
Metabolic inheritance Eörs Szathmáry Collegium Budapest AND Eötvös University.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Copyright OpenHelix. No use or reproduction without express written consent1 1.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Chapter 7: The Distribution of Sample Means
Creating a Worksheet Fiscal periods typically begin at the beginning of a month and lasts a specified amount of time. Usually a fiscal period for a business.
RDF based on Integration of Pathway Database and Gene Ontology SNU OOPSLA LAB DongHyuk Im.
By Chris Paine Metabolism (AHL) Essential idea: Metabolic reactions are regulated in response to the cell’s needs.
Structural analysis of metabolic network models
The Pathway Tools FBA Module
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Analysis of in silico flux changes along the exponential growth phase: (A) In silico flux changes from 24 to 36 h, from 36 to 48 h, and from 48 to 60 h.
Overview of Microbial Pathway and Genome Databases
1 Department of Engineering, 2 Department of Mathematics,
V11 Metabolic networks - Graph connectivity
V11 Metabolic networks - Graph connectivity
1.3.1 Function of Food Why do we need food?.
Characteristics of Life
V11 Metabolic networks - Graph connectivity
Label propagation algorithm
Presentation transcript:

Predicting essential genes via impact degree on metabolic networks ISSSB’11 Takeyuki Tamura Bioinformatics Center, Institute for Chemical Research Kyoto University, Japan

Essential genes, lethal pairs E. coli K12 has more than 4000 coding genes. By checking cell growth rate of single knockout of each gene, only 303 genes are identified as essential for growth in rich medium. (Baba et al. 2006) Screening of cell growth rate of double knockouts are ongoing on E. coli and S. Cerevisiae by some biological groups. Although these experiments will be completed in a few years, reasons why these single (double) knockouts are essential (or lethal) will not be directly revealed.

The aim of this research is to reveal how each single (or double) knockout affects cell growth rates in silico especially on metabolic networks. To do so, some mathematical model for metabolic networks and gene knockouts is necessary. A good model may predict the effect of double knockouts, triple knockouts… As the first step of the study, we extend the impact degree model (Jiang et al. 2009), which is a combination of Boolean model and flux balance model, to asses the effect of gene knockouts on metabolic networks. As a result of computer experiments, it Is seen that genes with high impact degree tend to be essential for single knockouts. Aim of the research

∨ ∨ ∨ ∨ ∧ ∧ ∨ ∨ A B C D reaction 1 reaction 2 E F target compound A B E F C D reaction 1 reaction For each compound, the sum of incoming flow must equal to the sum of outgoing flow. For reaction 1 A + 2B → 2C + D For reaction 2 E + F → D Flux balance model For each reaction, ratio of compounds must be satisfied. (Papin et al. 2003, Stelling et al. 2002) Boolean model (Sridhar et al. 2008) For each compound, amount is represented only by 1(exist) or 0(not exist). For each reaction, state is represented only by 1(occur) or 0(not occur). Model of metabolic network

∨ ∨ ∨ ∨ ∧ ∧ ∨ C D E F G target compound reaction 2 reaction 3 Which reactions should be inactivated so that the target compound becomes non-producible (assigned 0)? ∨ ∨ ∧ A B reaction 1 Source nodes, whose indegrees are 0, are always assigned 1 (exist, producible). Boolean model of metabolic network inactivate Source node

∨ ∨ ∨ ∨ ∧ ∧ ∨ C D E F G target compound reaction 2 reaction 3 ∧ A B reaction 1 inactivate Source nodes whose indegrees are 0 are always assigned 1. Which reactions should be inactivated so that the target compound becomes non-producible (assigned 0)? Source node Boolean model of metabolic network

Impact degree model of metabolic network The impact degree model (Jiang et al. 2009) is a kind of Boolean model focusing on steady states. Different from usual Boolean model, each node is affected by its successors. To be active, not only predecessors but also successors must be active in steady states. C1C1 R1R1 R2R2 R3R3 R4R4 R1R1 C1C1 C2C2 C3C3 C4C4

Impact degree model of metabolic network The impact degree is defined as the number of reactions inactivated by deleting a specified reaction (or a set of specified reactions). (Jiang et al. 2009) Since cycles are not taken into account in their method, we extend the definition of impact degree so that cycles can be treated. Cycles may yield multiple stable states. Assume all nodes are active initially. C1C1 R1R1 R2R2 R3R3 R4R4 R1R1 C1C1 C2C2 C3C3 C4C4

To calculate the impact degree of reaction R 1. Example 1 For compounds For reactions R 1 (1)=0,R 2 (1)=1, R 3 (1)=1, t=1 t=2 A(2)=0,B(2)=1,C(2)=1,D(2)=1, Thus, the impact degree for reaction R1 is 1. A(1)=0,B(1)=1,C(1)=1,D(1)=1, R 1 (2)=0,R 2 (2)=1, R 3 (2)=1,

To calculate the impact degree of reaction R 3, Example 2 For compounds For reactions R 1 (1)=1,R 2 (1)=1, R 3 (1)=0, t=1 t=2 A(2)=1,B(2)=0,C(2)=1,D(2)=0, Then, the states become stable and thus the impact degree for reaction R 3 is 3. R 1 (0)=1,R 2 (0)=1,R 3 (0)=0, A(1)=1,B(1)=0,C(1)=1,D(1)=0, R 1 (1)=0,R 2 (1)=0, R 3 (1)=0, t=3 A(3)=0,B(3)=0,C(3)=0,D(3)=0, R 1 (3)=0,R 2 (3)=0, R 3 (3)=0,

Impact degree by deletion of multiple reaction Deletion of R1 Deletion of R4 Newly inactivated Multiple deletion of (R1,R4)

Relation between essential genes of KEIO collection and top 14 reactions with high impact degree 28 R b3730 Essential R b3176 Non-essential R b3730 Essential 17 R b1849,b2550 Non-essential 15 R b1288 Essential R b1288 Essential R b3804 Essential R b3805 Essential R b0369 Essential R b0154 Essential R b2400 Essential R b1210 Essential R b0421 Essential R b0421 Essential Impact degree Reaction Enzyme gene Avg Calculate the impact degrees of single knockout for all reactions included in E. coli of KEGG database reactions, 831 compounds

Relation between essential genes of KEIO collection and top 14 reactions with high impact degree 28 R b3730 Essential R b3176 Essential in updated version R b3730 Essential 17 R b1849,b2550 Non-essential 15 R b1288 Essential R b1288 Essential R b3804 Essential R b3805 Essential R b0369 Essential R b0154 Essential R b2400 Essential R b1210 Essential R b0421 Essential R b0421 Essential Impact degree Reaction Enzyme gene Avg Calculate the impact degrees of single knockout for all reactions included in E. coli of KEGG database reactions, 831 compounds

12 of the 14 genes are included in the list of essential genes of KEIO collection. 13 of the 14 are essential in the updated version of KEIO collection. (Yamamoto et al. 2009) However, most genes with high impact degree are located outside central metabolism, consisting of Glycolysis, Gluconeogenesis, Citrate cycle and Pentose phosphate pathway. Since the central metabolism is of No.1 interest of most researchers, it is necessary to develop a mathematical model elucidating the relation between knockouts and essential genes. Relation between essential genes of KEIO collection and top 14 reactions with high impact degree

alternative pathways, flux balance, capability of producing important compounds, chemical structure of each compound, error of experiments etc. Should take account of

Summary Introduced mathematical model of metabolic network Flux balance model, Boolean model Impact degree model Combination of flux balance model and Boolean model Focusing on steady state #reactions(genes) impacted by knockout(s) Applied to data of KEGG E. coli, 12 (13 in updated version) of the 14 genes with the highest impact degrees are included in the list of essential genes of KEIO collection. Good prediction outside central metabolism, but not good in central metabolism. Necessary to develop a mathematical model elucidating relation between knockouts and cell growth rate. Should take account of alternative pathways, flux balance, capability of producing important compounds, chemical structure of each compound, error of experiments etc. Analyzing cell growth data of double knockouts is also ongoing.