Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010.

Slides:



Advertisements
Similar presentations
How pathway databases were created and curated Peifen Zhang Plant Metabolic Network (PMN)
Advertisements

The Arabidopsis Information Resource (TAIR)
SRI International Bioinformatics Comparative Analysis Q
Journal Club Jenny Gu October 24, Introduction Defining the subset of Superfamilies in LUCA Examine adaptability and expansion of particular superfamilies.
Overview of the Pathway Tools Software and Pathway/Genome Databases.
Overview of the Pathway Tools Software and Pathway/Genome Databases.
SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi.
Curation of the EcoCyc Database: The EcoCyc Update Project Martha Arnaud Scientific Database Curator Bioinformatics Research Group SRI International
KEGG: Kyoto Encyclopedia of Genes and Genomes Susan Seo Intro to Bioinformatics Fall 2004.
陳虹瑋 國立陽明大學 生物資訊學程 Genome Engineering Lab. Genome Engineering Lab The Newest.
Pathway/Genome Databases and Software Tools Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
Update on The Pathway Tools Software Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org MetaCyc.org.
Methods and resources for pathway analysis PABIO590B Week 2.
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)
PathoLogic Pathway Predictor. SRI International Bioinformatics Inference of Metabolic Pathways Pathway/Genome Database Annotated Genomic Sequence Genes/ORFs.
1 SRI International Bioinformatics Advanced PGDB Editing: Regulation GO Terms Ingrid M. Keseler Bioinformatics Research Group SRI International
Integration of E. Coli Data (E. coli Pathway and Genomic Data from BioCyc) Jesse Walsh.
Ch10. Intermolecular Interactions and Biological Pathways
1 SRI International Bioinformatics BioCyc Tutorial Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org,
1 SRI International Bioinformatics The Pathway Tools Software and BioCyc Database Collection Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International.
SRI International Bioinformatics 1 Pathway Tools: Recent Developments GMOD Meeting, June 2006.
1 SRI International Bioinformatics EcoCyc, MetaCyc, and the Pathway Tools Software Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International.
Data Content of the BioCyc Databases. BioCyc Tier 1 Databases.
Sequence Databases What are they and why do we need them.
TAIR/Gramene/SGN Workshop I ASPB Meeting July 08, 2007 Chicago, IL Metabolic Databases.
The BioCyc Collection of Pathway/Genome Databases Alexander Shearer Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
SRI International Bioinformatics 1 Advanced Editing of Pathway/Genome Databases Ron Caspi.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
PlantCyc, AraCyc, PoplarCyc and more... Building databases and connecting to researchers at the Plant Metabolic Network kate dreher curator PMN/TAIR.
The consistency Checker, or Overhauling a PGDB By Ron Caspi.
MetaCyc and AraCyc: Plant Metabolic Databases Hartmut Foerster Carnegie Institution.
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
1 SRI International Bioinformatics And now for our ‘Feature’ presentation: Automatic Loading of Protein Sequence Annotation Data from UniProt to Pathway.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
Metabolic Pathway Databases and Tools Speaker and Schedule Update PMN (Peifen Zhang) KEGG (auto-slide show) MetaCrop (cancelled)
2009 IADR, MIAMI, FL, USA Hands-on Experience for using the Human Oral Microbiome Database (HOMD) 2009 IADR Workshop, Miami, FL, USA Tsute (George) Chen.
PlantCyc, AraCyc, PoplarCyc and more... Building databases with YOUR help at the Plant Metabolic Network kate dreher curator PMN/TAIR.
Western New York Genetics in Research Partnership Expanding Exposure, Career Exploration and Interactive Projects in Basic Genome Analysis and Bioinformatics.
1, StarOmics course,Lausanne, Monday November 19 th Training agenda Chemicals Reactions Enzymes Pathways.
Editing Pathway/Genome Databases Compounds, Reactions and Pathways Ron Caspi.
SRI International Bioinformatics Update your computers! To install a patch: Tools => Instant Patch => Download and Activate All Patches.
Building and Refining AraCyc: Data Content, Sources, and Methodologies Kate Dreher TAIR, AraCyc, PMN Carnegie Institution for Science.
1 AraCyc Metabolic Pathway Annotation. 2 AraCyc – An overview  AraCyc is a metabolic pathway database for Arabidopsis thaliana;  Computational prediction.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Reconstructing the metabolic network of a bacterium from its genome: the construction of LacplantCyc Christof Francke In silico reconstruction of the metabolic.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
SRI International Bioinformatics Selected PathoLogic Refining Tasks Creation of Protein Complexes Assignment of Modified Proteins Operon Prediction.
Recent Developments and Future Directions in Pathway Tools Peter D. Karp SRI International.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Lecture 4.31 Protein Pathways and Pathway Databases Shan Sundararaj University of Alberta Edmonton, AB
` Comparison of Gene Ontology Term Annotations Between E.coli K12 Databases REDDYSAILAJA MARPURI WESTERN KENTUCKY UNIVERSITY.
Pathway Team SNU, IDB Lab. DongHyuk Im DongHee Lee.
Comparative Analysis in BioCyc
Why Create a PGDB? Perform pathway analyses as part of a genome project Analyze omics data Create a central public information resource for the organism,
An Advanced Web Query Interface for Biological Databases
The Pathway Tools FBA Module
Building Metabolic Models
A Community Effort to Model the Human Microbiome
Mangaldai College, Mangaldai
INFORMATION FLOW AARTHI & NEHA.
Overview of Microbial Pathway and Genome Databases
Cuong Nguyen, Deng Xin, Dongmei, Zheng Wang
Incremental PathoLogic
Propagating Changed Annotation and Pathway Information
Overview of the Pathway Tools FBA Module
Overview of the Pathway Tools Software and Pathway/Genome Databases
Presentation transcript:

Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010

Outline 1.Metabolic annotation from genomic data 2.Pathway Tools and MetaCyc facilitated genome/pathway annotation 3.Pathway/genome database (PGDB) exploration 4.Potential applications in environmental ‘omics

?!?!

Metabolic Annotation Workflow Assembly of DNA sequence from an organism Identification of protein-encoding genes GLIMMER FGENESB Functional annotation of genes InterProScan BLAST (COG, KEGG, etc.) Inference of metabolic pathways Microsoft Excel Illustrator Web browser Generation of a cellular/community metabolic workflow Illustrator Automated Manual Pathway Tools

is a software suite for working with pathway/genome databases

Pathway/Genome Databases (a.k.a. PGDBs) integrate genomic data with detailed functional annotations (including metabolic pathways)

The Three Tiers of PGDBs Tier 1 Literature derived 2 PGDBs MetaCyc and EcoCyc Tier 2 Computationally derived 15 PGDBs HumanCyc, AraCyc and other model organisms Tier 3 No curation ~400 PGDBs

Pathway Tools PGDBs MetaCyc EcoCyc YourCyc PGDB viewerPGDB editorPathoLogic

PGDB Viewer: Genome Overview

PGDB Viewer: Pathway Details

PGDB Viewer: Cellular Overview

MetaCyc is a continuously curated PGDB of reference pathways from organisms encompassing all domains of life

MetaCyc Statistics by Year Adapted from Metabolic pathways Reactions Enzymes Genes Chemical compounds Organisms Citations

The Metabolic Hierarchy of PGDBs Metabolic pathways in PGDBs are organized in hierarchies where each pathway can have superpathways and subpathways and variants e.g., superpathway of arginine and ornithine degradation Superclasses: Degradation/Utilization/Assimilation  Amino Acids Degradation  Arginine Degradation Superpathways Subpathways: 4-aminobutyrate degradation I, arginine degradation III (arginine decarboxylase/agmatinase pathway), superpathway of arginine, putrescine, and 4-aminobutyrate degradation, superpathway of ornithine degradation, putrescine degradation II, putrescine degradation I Variants: arginine degradation II (AST pathway)

Novel metabolic pathways can be created at will in Pathway Tools (PGDBs are therefore dynamic)

PathoLogic predicts metabolic pathways in a PGDB based on reference PGDBs

PathoLogic reads GenBank and nucleotide FASTA files (but it also has a native file format)

PathoLogic infers metabolic pathways from functional annotations Product names EC numbers GO terms

PathoLogic applies an iterative algorithm that keeps track of candidate and undecided pathways

How PathoLogic Works Determine candidate and undecided pathways Prune pathways using “keep tests” and “delete tests” Keep remaining candidate and undecided pathways

How PathoLogic Works Iterative algorithm that tracks candidate and undecided pathways “Keep tests” – “Mostly present” pathways – Pathways with unique reactions present – Pathways with “key reactions” present “Delete tests” – “Mostly absent” pathways – Biosynthentic pathways lacking final steps – Degradative pathways missing initial steps – Pathways missing “key reactions”

Demonstration on two rRNA-containing fosmids of an uncultivated organism

Resources Pathway Tools MetaCyc