BiGG: Biochemical, Genetic and Genomic Database Jun Young Park 1, Jan Schellenberger 2, Tom M. Conrad 3, Bernhard Ø. Palsson 1,2 1 Department of Bioengineering,

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Predicting essential genes via impact degree on metabolic networks ISSSB’11 Takeyuki Tamura Bioinformatics Center, Institute for Chemical Research Kyoto.
CellDesigner Tutorial Laurence Calzone, Andrei Zinovyev UMR U900 INSERM/Institut Curie/Ecole des Mines de Paris Wednesday, April 30th.
Interoperation of Molecular Biology Databases Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International Menlo Park, CA
Systems Biology Biological Sequence Analysis
Chapter 12: ADO.NET and ASP.NET Programming with Microsoft Visual Basic.NET, Second Edition.
Introduction to Bioinformatics - Tutorial no. 13 Probe Design Gene Networks.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae Speaker: Zhu YANG 6 th step, 2006.
Systems Biology Biological Sequence Analysis
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Pathway databases Goto S, Bono H, Ogata H, Fujibuchi W, Nishioka T, Sato K, Kanehisa M. (1997) Organizing and computing metabolic pathway data in terms.
Systems Biology Biological Sequence Analysis
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
TIBCO Designer TIBCO BusinessWorks is a scalable, extensible, and easy to use integration platform that allows you to develop, deploy, and run integration.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Ch10. Intermolecular Interactions and Biological Pathways
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
Lecture 3: Pathway Generation Tool I: CellDesigner: A modeling tool of biochemical networks Y.Z. Chen Department of Pharmacy National University of Singapore.
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery l Cellular Overview.
Introducing Dreamweaver MX 2004
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
Bioinformatics Dr. Víctor Treviño BT4007
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Networks and Interactions Boo Virk v1.0.
Fission Yeast Computing Workshop -1- Searching, querying, browsing downloading and analysing data using PomBase Basic PomBase Features Gene Page Overview.
BASys: A Web Server for Automated Bacterial Genome Annotation Gary Van Domselaar †, Paul Stothard, Savita Shrivastava, Joseph A. Cruz, AnChi Guo, Xiaoli.
SRI International Bioinformatics 1 The Structured Advanced Query Page Tomer Altman & Mario Latendresse Bioinformatics Research Group SRI, International.
Reconstruction of Transcriptional Regulatory Networks
Copyright OpenHelix. No use or reproduction without express written consent1.
Virtual Cell and CellML The Virtual Cell Group Center for Cell Analysis and Modeling University of Connecticut Health Center Farmington, CT – USA.
BIOINFORMATICS ON NETWORKS Nick Sahinidis University of Illinois at Urbana-Champaign Chemical and Biomolecular Engineering.
1 Relational Databases and SQL. Learning Objectives Understand techniques to model complex accounting phenomena in an E-R diagram Develop E-R diagrams.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
1 Departament of Bioengineering, University of California 2 Harvard Medical School Department of Genetics Metabolic Flux Balance Analysis and the in Silico.
Sharing Models. How Can I Exchange Models? SBML (Systems Biology Markup Language): de facto standard for representing cellular networks. A large number.
The ERATO Systems Biology Workbench: Enabling Interaction and Exchange Between Tools for Computational Biology Michael Hucka, Andrew Finney, Herbert Sauro,
es/by-sa/2.0/. Simulation Programs: What is out there? A critical evaluation. Prof:Rui Alves
A collaborative tool for sequence annotation. Contact:
Systems Biology Markup Language Ranjit Randhawa Department of Computer Science Virginia Tech.
Introduction to biological molecular networks
Metabolic pathway alteration, regulation and control (3) Xi Wang 01/29/2013 Spring 2013 BsysE 595 Biosystems Engineering for Fuels and Chemicals.
Lesson 13 Databases Unit 2—Using the Computer. Computer Concepts BASICS - 22 Objectives Define the purpose and function of database software. Identify.
Constraint-based Metabolic Reconstructions & Analysis © 2015 H. Scott Hinton Lesson: Introduction BIE 5500/6500Utah State University Introduction to Systems.
Chapter 5 Introduction To Form Builder. Lesson A Objectives  Display Forms Builder forms in a Web browser  Use a data block form to view, insert, update,
Copyright OpenHelix. No use or reproduction without express written consent1.
SRI International Bioinformatics 1 The Structured Advanced Query Page Mario Latendresse Tomer Altman Bioinformatics Research Group SRI International March,
SRI International Bioinformatics 1 Editing Pathway/Genome Databases Ron Caspi.
SAGExplore web server tutorial. The SAGExplore server has three different modules …
Welcome to Gramene’s RiceCyc (Pathways) Tutorial RiceCyc allows biochemical pathways to be analyzed and visualized. This tutorial has been developed for.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
SRI International Bioinformatics 1 The Structured Advanced Query Page Tomer Altman Mario Latendresse Bioinformatics Research Group SRI International April.
Recent Developments and Future Directions in Pathway Tools Peter D. Karp SRI International.
Department of Genetics • Stanford University School of Medicine
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Overview of Microbial Pathway and Genome Databases
1 Department of Engineering, 2 Department of Mathematics,
Annotation Presentation
Tutorial 7 – Integrating Access With the Web and With Other Programs
Overview of the Pathway Tools FBA Module
Logging In Using CAT for the Participant Version 1.6
Presentation transcript:

BiGG: Biochemical, Genetic and Genomic Database Jun Young Park 1, Jan Schellenberger 2, Tom M. Conrad 3, Bernhard Ø. Palsson 1,2 1 Department of Bioengineering, University of California San Diego 2 Bioinformatics Program, University of California San Diego 3 Department of Chemistry and Biochemistry, University of California San Diego DATABASE CONTENTS Introduction EXPORTING BROWSING ABSTRACT References 1.Becker SA, Feist AM, Mo ML, Hannum G, Palsson BO, Herrgard MJ: Quantitative prediction of cellular metabolism with constraint-based models: The COBRA Toolbox. Nat Protocols 2007, 2(3): Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res 1999, 27(1): Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, Arkin AP, Bornstein BJ, Bray D, Cornish-Bowden A et al: The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics 2003, 19(4): We describe BiGG, a database of Biochemically, Genetically and Genomically structured genome-scale metabolic network reconstructions. BiGG integrates several published genome-scale metabolic networks into one resource with standard nomenclature which allows components to be compared across different species. Furthermore, BiGG contains links to several publicly available databases where additional information can be found and integrated. In addition, BiGG contains a customized export tool that enables the generation of SBML files for further network analysis by external software packages. BiGG addresses a need in the systems biology community to have access to high quality curated metabolic reconstructions. CONCLUSION The scope of covered reactions is often greater than for other databases. BiGG uses both genetics and literature based data to assess whether a reaction is present. BiGG assigns confidence levels to each reaction which can be used when evaluating the resultant model. BiGG includes relationships between genes and proteins (GPR). Compartmentalization in BiGG gives a more accurate description of reactions involving membrane transporters. BiGG provides the gap between a reconstruction and a model. The BiGG database provides the first collection of curated high quality metabolic reconstructions suitable for study with COBRA methods. Gene-Protein-Reaction (GPR) associations Single Gene ReactionMultiple Gene Reaction DNA mRNA Protein Reaction translationcomplexing transcription activity sphingosine kinase 2platelet-activating factor acetylhydrolase The on or off state of each reaction in the network may be controlled by the genotype and expression level of associated genes. Some cases involve multiple genes and proteins whose relationship is described using Boolean logic. A single protein may be composed of subunits coded by two (or more) genes. GPRs may be used to evaluate the effects of gene knockouts and gene regulation on the metabolic reconstructions, ruling out reactions whose necessary genes are not available. E coli iAF1260 H sapiens S cerevisiae 240 (160) 134 (122) 106 (67) 200 (195) 3197 (2915) 1901 (1733) 691 (672) 766 (745) E coli iAF1260 H sapiens S cerevisiae Reactions Metabolites Reactions may be searched for by name, EC number, or associated gene as well as by using the model name as the only search parameter. Specifying compartment, pathway, or metabolite participation is also a possibility. Results may be limited by only including reactions with known gene associations, high or low confidence, or by excluding transport reactions. In addition, reactions may be searched across reconstructions allowing for model comparison. Lists of reactions matching a set of criteria may be exported as a tab delimited flat file. The exported files can contain information for multiple models, simplifying model comparison. Metabolites may be searched for by name, KEGG ID, CAS ID, or charge. Limiting searches by compartment, pathway, and organism is possible. In addition to basic metabolite information such as formula and charge, lists of reactions in which the metabolite participates are listed and categorized by the metabolites role as a reactant or a product. This feature facilitates the tracing of a metabolite through a pathway in the absence of graphical pathway maps. Lists of metabolites matching a set of search criteria may be exported, and contain information such as metabolite name, abbreviation, formula, KEGG ID, and CAS ID. Metabolic Maps The left diagram shows the number of reactions shared by the three largest reconstructions. The numbers in parentheses represent non-exchange reactions. The right diagram shows the number of metabolites shared by the three largest reconstructions. Compartmentalization Optional Information Simulation The BiGG database is capable of exporting reconstructions in SBML format. This XML format is widely used for distributing systems biology models. The user has several options to customize export on the Web. A compartment in a metabolic reconstruction has a distinct pool of metabolites and a set of reactions which may be unique to that compartment. By default, reactions and metabolites are compartmentalized in the models meaning they exist in distinct compartments such as the Cytosol or the Golgi. The user can choose the model to be partially decompartmentalized or fully decompartmentalized. If partially decompartmentalized, reactions and metabolites ordinarily assigned to subcompartments of the Cytosol (Mitochondria, Peroxisome, etc) are instead assigned to the Cytosol, while the Extraorganism compartment is untouched. In a fully decompartmentalized model, there are no compartments and all reactions and metabolites exist in an unsegregated single-compartment system. The user can choose which optional information to include in the SBML file. The notes field of the Reaction entries can include Boolean strings corresponding to the GPR statements. The GPR field is read and interpreted by the COBRA toolbox. The SBML file may also include information on genes, proteins and citations. Because the SBML specification does not include fields for this kind of data, this information is stored in the notes field of the reaction entries. Each model includes several metabolic maps. All the maps are drawn in SVG format and can be displayed on all major browsers. When there are available maps that include any chosen reaction or metabolite, the maps will be listed in details page under appropriate organisms. Primary molecules are drawn larger compared to other non-primary molecules. Molecules that are outside the cell (extraorganism) are colored yellow. Molecules in different compartment have different suffixes in their names. For example, Cytosol is [c] and Nucleus is [n]. In case of reversible reactions, reactant-side molecules are pointed with smaller arrowheads. The reaction or metabolite the user searched is highlighted red so that it is easier to locate it on the map. The components of the maps, lines and circles, are hyperlinked to display more information on them when they are clicked. This graphical representation would provide the user with another way of understanding chemical pathways. The map to the right shows a part of Carbohydrate Metabolism in human. The SBML file contains a few additional reactions that are necessary for simulation purposes. In case of H. pylori iIT341, reactions DM_HMFURN, sink_ahcys(c), and sink_amob are present in the exported model, for example. To run meaningful simulations, it is important that the bounds of exchange fluxes be specified to model the environment. By including the flux bound vectors in the SBML file, the simulation process is simplified for simulations. In addition, upper and lower flux bounds of all reactions may be refined before exporting so as to allow the user to create SBML files with customized parameters. The last ten years have seen the emergence of many genome-scale metabolic reconstructions. These manually-curated, component-by-component (bottom- up) reconstructions of genomic and bibliomic data have lead to a biochemically, genetically and genomically structured (BiGG) knowledgebase. Such reconstructions are of interest for their detailed curated content and for their utility in assessing metabolic capabilities. A metabolic reconstruction can be mathematically represented as an in silico model for computing allowable network states through the application of governing chemical and genetic constraints under the constraint based reconstruction and analysis (COBRA) framework. Furthermore, gap analysis identifies possible missing reactions by finding so called dead end metabolites which can be produced by the network but not consumed. BiGG includes seven different genome-scale reconstructions of six organisms: Homo sapiens Recon 1, Escherichia coli iJR 904 and iAF1260, Saccharomyces cerevisiae iND750, Staphylococcus aureus iSB619, Methanosarcina barkeri iAF692, and Helicobacter pylori iIT341. The Website The BiGG browser and exporter. SBML files are compatible with the COBRA toolbox which allows performing many computational procedures. Using the COBRA toolbox, the SBML file exported from BiGG may be imported as a network data structure into Matlab. COBRA Compatibility BiGG is available at NMN Metabolism in S. cerevisiae All queries are performed by a Linux/Apache Server using Perl with the CGI and DBI modules. Database Schema Reconstructions are developed in and stored on a Genomatica (San Diego, CA)-supplied Simpheny TM server running an Oracle TM database. Access to this database is provided by a read-only client with several tables and views for accessing information on Reactions, Metabolites, Genes, Proteins and Citations.