Methods and resources for pathway analysis PABIO590B Week 2.

Slides:



Advertisements
Similar presentations
Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute.
Advertisements

Predicting essential genes via impact degree on metabolic networks ISSSB’11 Takeyuki Tamura Bioinformatics Center, Institute for Chemical Research Kyoto.
Biological pathway and systems analysis An introduction.
CellDesigner Tutorial Laurence Calzone, Andrei Zinovyev UMR U900 INSERM/Institut Curie/Ecole des Mines de Paris Wednesday, April 30th.
Ontology annotation: mapping genomic regions biological function Paul D Thomas, Huaiyu Mi and Suzanna Lewis.
Gene Ontology John Pinney
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Gene expression analysis summary Where are we now?
Introduction to Bioinformatics - Tutorial no. 13 Probe Design Gene Networks.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010.
SCB : 1 Department of Computer Science Simulation and Complexity SCB : Simulating Complex Biosystems Susan Stepney Department of Computer Science Leo Caves.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Pathway databases Goto S, Bono H, Ogata H, Fujibuchi W, Nishioka T, Sato K, Kanehisa M. (1997) Organizing and computing metabolic pathway data in terms.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Data Mining Presentation Learning Patterns in the Dynamics of Biological Networks Chang hun You, Lawrence B. Holder, Diane J. Cook.
陳虹瑋 國立陽明大學 生物資訊學程 Genome Engineering Lab. Genome Engineering Lab The Newest.
Internet tools for genomic analysis: part 2
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Computational Biology Networks and Pathways Lecture Slides Week 11.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
From Databases to Dynamics Dr. Raquell M Holmes Center for Computational Science Boston University.
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
Biological pathway and systems analysis An introduction.
Ch10. Intermolecular Interactions and Biological Pathways
1 SRI International Bioinformatics BioCyc Tutorial Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org,
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Shankar Subramaniam University of California at San Diego Data to Biology.
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery l Cellular Overview.
Biological Pathways & Networks
© Wiley Publishing All Rights Reserved. Protein and Specialized Sequence Databases.
Networks and Interactions Boo Virk v1.0.
The BioCyc Collection of Pathway/Genome Databases Alexander Shearer Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
Tutorial on Current Biochemical Pathway Visualization Tools By Rana Khartabil.
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
Reconstruction of Transcriptional Regulatory Networks
Copyright OpenHelix. No use or reproduction without express written consent1.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Sharing Models. How Can I Exchange Models? SBML (Systems Biology Markup Language): de facto standard for representing cellular networks. A large number.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
Biological systems and pathway analysis
EB3233 Bioinformatics Introduction to Bioinformatics.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
A database of biological pathways and processes (borrowed from a presentation created by Steve Jupe)
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
Development of a Signaling Pathway Map for the FXM Gil Sambrano, Lily Jiang, Madhu Natarajan, Alex Gilman, Adam Arkin University of California San Francisco,
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Modeling and Simulation of Signal Transduction Pathways Mark Moeller & Björn Oleson Supervisors: Klaus Prank Ralf Hofestädt.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Lecture 4.31 Protein Pathways and Pathway Databases Shan Sundararaj University of Alberta Edmonton, AB
University of California at San Diego
The Pathway Tools FBA Module
A Community Effort to Model the Human Microbiome
University of California at San Diego
Overview of Microbial Pathway and Genome Databases
Annotation Presentation
Computational Biology
Presentation transcript:

Methods and resources for pathway analysis PABIO590B Week 2

Pathways overview Introduction to pathways and networks Examples of pathways and networks Review of pathway databases and tools Representing pathways and networks Methods of inferring pathways and networks Pathway and cellular simulations

Pathways vs. networks Gene networks Clusters of genes (or gene products) with evidence of co- expression Connections usually represent degrees of co-expression In-depth knowledge of process is not necessary Networks are non-predictive Biochemical pathways Series of chained, chemical reactions Connections represent describable (and quantifiable) relations between molecules, proteins, lipids, etc. Enzymatic process is elucidated Changes via perturbation are predictable downstream

Pathways vs. networks Gene networksBiochemical pathways Curation Relatively easy: automated and manual Difficult: mostly manual Nodes Genes or gene productsAny general molecule Edges Levels of co- expression/influence or a qualitative relation Representation of possibly quantifiable mechanisms between compounds Fidelity Low – usually very little detail High – specific processes Predictive power Relatively lowRelatively high

Pathway and network granularity Level of detail Effort to curate General interaction networks Mathematical simulation models Probabilistic networks Qualitative networks Curated reaction pathways

Introduction to pathways and networks Examples of pathways and networks Review of pathway databases and tools Representing pathways and networks Methods of inferring pathways and networks Pathway and cellular simulations

Yeast gene interaction network Tong, et al., Science 303, 808 (2004)

Characteristics of the yeast gene network Some genes (e.g. regulatory factors) act as ‘hubs’ in a network and have many interactions –Degrees of connectivity follows the power law –Hubs may make interesting anti-cancer targets Clusters of genes with known function suggest function for hypothetical genes in same cluster Network characteristics can be used to predict protein- protein interactions Path between two genes tends to be short (average ~3.3 hops) Tong, et al., Science 303, 808 (2004)

E. coli metabolic pathway Karp, et al., Science 293, 2040 (2001) glycolysis

Pathways: E. coli metabolic map Encompasses >791 chemical compounds in >744 noted biochemical reactions Pathway was compiled via literature information extraction and extensive manual curation –System allows for users to indicate evidence of pathway annotations –Curation is done collaboratively with numerous experts outside of EcoCyc Karp, et al., Science 293, 2040 (2001)

Pathways in bioinformatics Most resources for pathways focus on metabolic pathways (signaling and regulatory gaining prominence) Pathways as a very specific subtype of networks –Like networks, can be made in computable (symbolic) form –Specificities in chemical reactions are more predictive –Pathways can chain together, forming larger pathways Karp, et al., Science 293, 2040 (2001)

Introduction to pathways and networks Examples of pathways and networks Review of pathway databases and tools Representing pathways and networks Methods of inferring pathways and networks Pathway and cellular simulations

Pathway repositories BioCyc/MetaCyc Kyoto Encyclopedia of Genes and Genomes (KEGG) PATHWAY DB BioCarta BioModels database

BioCyc database Pathway/genome database (PGDB) for organisms with completely sequenced genomes 409 full genomes and pathways deposited Species-specific pathways are inferred form MetaCyc Query/navigation/pathway creation support through the Pathway Tools software suite

MetaCyc database Non-redundant reference database for metabolic pathways, reactions, enzymes and compounds Curation through experimental verification and manual literature review >1200 pathways from species (mostly plants and microorganisms)

Glycolysis pathway in MetaCyc

KEGG PATHWAY database Consolidated set of databases that cover genomics (GENE), chemical compounds (LIGAND) and reaction networks (PATHWAY) Broad focus on metabolics, signal transduction, disease, etc. Species-specific views available (but networks are static across all organisms)

Glycolysis pathway in KEGG

Global Pathway Map

BioCarta database Corporate-owned, publicly-curated pathway database Series of interactive, “cartoon” pathway maps Predominantly human and mouse pathways Contains 120,000 gene entries and 355 pathways

Glycolysis pathway in BioCarta

BioModels database Database for published, quantitative models of biochemical processes All models/pathways curated manually, compliant with MIRIAM Models can be output in SBML format for quantitative modeling 86 curated models, 40 models pending curation

Glycolysis pathways in BioModels

Comparison of pathway databases MetaCyc/ BioCyc KEGG PATHWAYS BioCartaBioModels Curation Manual and automated AutomatedManual Size ~621+ pathways~289 reference pathways ~355 pathways~126 models Nomenclature EC, GOEC, KONoneGO Organism coverage ~500 speciesVariousPrimarily human and mouse ~475 species Visuals Species-specific custom Reference and species-specific Animated, cartoonish Non-standardized Primary usage PGDB, computational biology PGDB, pathway comparisons Human pathways, disease Simulations, modeling

Introduction to pathways and networks Examples of pathways and networks Review of pathway databases and tools Representing pathways and networks Methods of inferring pathways and networks Pathway and cellular simulations

Pathway formats Extensible Markup Language (XML) Systems Biology Markup Language (SBML) BioPax

Extensible Markup Language (XML) Standard of representing information in a machine-readable way Similar to HTML; tags can enclose or contain data Some data here More stuff here

Systems Biology Markup Language XML-based language for representing biochemical reactions Oriented towards software data-sharing Tiered, upward-compatible architecture (two, upward-compatible levels, third planned) Primary intended use is for quantitative model simulations

SBML

BioPax Like SBML, XML-based pathway representation Tiered structure –Level 1: Metabolic pathway information –Level 2: Level 1 + Molecular interaction, post- translational modification Intended to be a lingua franca for pathway databases

BioPax XML representation

Introduction to pathways and networks Examples of pathways and networks Review of pathway databases and tools Representing pathways and networks Methods of inferring pathways and networks Pathway and cellular simulations

Inferring pathways and networks Experimental methods –Microarray co-expression –Quantitative trait locus mapping (QTL) –Isotope-coded affinity tagging (ICAT) –Yeast two-hybrid assay –Green florescent protein tagging (GFP tagging) Computational methods –Database-driven protein-protein interactions –Expression clustering techniques –Literature-mining for specified interactions

Introduction to pathways and networks Examples of pathways and networks Review of pathway databases and tools Representing pathways and networks Methods of inferring pathways and networks Pathway and cellular simulations

Cellular simulations Study the effect perturbation has on a pathway (and thus the organism) Generally require extensive detail on the pathway or reactions of interest (flux equations, metabolite concentration, etc.) Cellular pathway simulations must manage both temporal and spatial complexity

Spatial dimension Adapted from Kelly, H., via Neal, Yngve 2006 VHS, UW MEBI 591http:// Temporal intervals 0.1 nm 10nm 1um 1mm1cm1m picosec. nanosec. microsec. millisec. sec. min. yr. quantum mechanics molecular dynamics cellular processes systems physiology organs and organisms

Simulation methods and techniques Biological processPhenomenaComputation scheme MetabolismEnzymatic reactionDifferential-algebraic equations, flux-based analysis Signal transductionBindingDifferential-algebraic equations, stochastic algorithms, diffusion- reaction Gene expressionBinding Polymerization Degradation Object-oriented modeling, differential-algebraic equations, stochastic algorithms, boolean networks DNA replicationBinding Polymerization Object-oriented modeling, differential-algebraic equations Membrane transportOsmotic pressure Membrane potential Differential-algebraic equations, electrophysiology Adapted from Tomita 2001

Research in simulation and modeling Virtual Cell (National Resource for Cell Analysis and Modeling) MCell (the Salk Institute) Gepasi (Virginia Tech) E-CELL (Institute for Advanced Biosciences, Keio University) Karyote/CellX (Indiana University)

Your task is to: Identify the functions of proteins X, Y & Z Identify the pathway(s) in which they are involved Look for differences in pathways between databases Examine the same pathway(s) in humans Exercise