Pathway databases Goto S, Bono H, Ogata H, Fujibuchi W, Nishioka T, Sato K, Kanehisa M. (1997) Organizing and computing metabolic pathway data in terms.

Slides:



Advertisements
Similar presentations
3.1 Nucleic Acids are Informational Macromolecule  Diagram and describe the structure of the DNA molecule including:  The monomer and its parts (all.
Advertisements

Prof. Drs. Sutarno, MSc., PhD.. Biology is Study of Life Molecular Biology  Studying life at a molecular level Molecular Biology  modern Biology The.
Chapter 6.4: The Building Blocks of Life
Cell Structure and Function Chapter 3 Basic Characteristics of Cells Smallest living subdivision of the human body Diverse in structure and function.
Ontology annotation: mapping genomic regions biological function Paul D Thomas, Huaiyu Mi and Suzanna Lewis.
Gene Ontology John Pinney
Systems Biology Study Group Chapter 3 Walker Research Group Spring 2007.
KEGG: Kyoto Encyclopedia of Genes and Genomes Susan Seo Intro to Bioinformatics Fall 2004.
Introduction to Bioinformatics - Tutorial no. 13 Probe Design Gene Networks.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
陳虹瑋 國立陽明大學 生物資訊學程 Genome Engineering Lab. Genome Engineering Lab The Newest.
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
From Databases to Dynamics Dr. Raquell M Holmes Center for Computational Science Boston University.
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
1. 2 all chemical reactions that occur in the body Two (2) types : Anabolism Larger molecules are made from smaller ones Requires energy Catabolism Larger.
B-3.2: Summarize the basic aerobic and anaerobic processes of cellular respiration and interpret the chemical equation for cellular respiration.
Nucleic Acids, Proteins, & Gene Expression Honors Biology ch 4 & 8.
Functional Genomic Hypothesis Generation and Experimentation by a Robot Scientist King et al, Nature : Presented by Monica C. Sleumer February.
The importance of enzymes and their occurrences: from the perspective of a network W.C. Liu 1, W.H. Lin 1, S.T. Yang 1, F. Jordan 2 and A.J. Davis 3, M.J.
Chapter 4: Cellular metabolism
Organisms are made up of carbon-based molecules.
Section 3: Chemical Compounds in Cells
Tutorial on Current Biochemical Pathway Visualization Tools By Rana Khartabil.
PROTEINS. Learning Outcomes: B4 - describe the chemical structure of proteins List functions of proteins Draw and describe the structure of an amino acid.
Polymer Molecule made of many monomers bonded together
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Energy Production 3 Biochemical Mechanisms Utilized Aerobic Respiration Anaerobic Respiration Fermentation.
A Biology Primer Part IV: Gene networks and systems biology Vasileios Hatzivassiloglou University of Texas at Dallas.
 During DNA replication, the two strands of the original parent DNA molecule, shown in blue, each serve as a template for making a new strand, shown in.
Human Anatomy & Physiology I Chapter 4 Cell Metabolism 4-1.
Protein Synthesis From DNA to Protein. Protein Synthesis Protein Synthesis is the process that cells use to produce protein. - it involves 2 distinct.
ORGANIC CHEMISTRY THE STUDY OF CARBON- BASED MOLECULES, and molecules made by living things.
What our bodies are made of Chemistry of Cells. Nature of Matter All matter is made of atoms. Atoms consist of electrons, protons and neutrons. Molecules.
BIOCHEMISTRY VOCABULARY UNIT 2. 1.Lipid- Organic compound used for long term energy storage and makes up cell membranes. Contains carbon and hydrogen.
An overview of Bioinformatics. Cell and Central Dogma.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Brief Introduction to Biochemistry
Introduction to biological molecular networks
Biochemistry MACROMOLECULES The molecules of life!!!!
Copyright OpenHelix. No use or reproduction without express written consent1 1.
What are some other organic molecules? Lipids Fats.
Chemical structures of bacteria Bacterial cells are similar to plant and animal cells in their contents of biogenic and trace elements, as well as in basic.
Chapter 1.  Element – a substance that cannot be broken down into simpler substances – smallest unit of an element is an atom  Compound – two or more.
RDF based on Integration of Pathway Database and Gene Ontology SNU OOPSLA LAB DongHyuk Im.
Metabolic pathways. What do we mean by metabolism? Metabolism is the collective term for the thousands of biochemical _________ that occur within a living.
Nucleic Acids, Proteins, & Gene Expression Honors Biology.
Do Now In the equation below, which molecules are considered the substrates? monosaccharide + monosaccharide  disaccharide + water.
Biochemistry Vocabulary
Would YOU Eat This? And why is it important? What is food?
Organic Chemistry for Biologists
Brief Introduction to Biochemistry
Organic Compounds: Proteins
Chemistry of Living Things
Bioinformatics Capstone Project
Section 6 – Metabolism & Enzymes
Chemical Compounds in Cells
Cellular Metabolism Metabolic processes – all chemical reactions that occur in the body Cellular metabolism- refers to all of the chemical processes that.
Origin of Life.
Visual vocab part 2.
Section 6 – Metabolism & Enzymes
Biochemistry Vocabulary
Viruses Viruses – are segments of nucleic acids
Annotation Presentation
The Structure of Biomolecules
Cellular Chemistry Unit 2, Module 2.
Origin of Life.
Introduction to Biochemistry
Life’s Chemistry.
Viruses Viruses – are segments of nucleic acids
Presentation transcript:

Pathway databases Goto S, Bono H, Ogata H, Fujibuchi W, Nishioka T, Sato K, Kanehisa M. (1997) Organizing and computing metabolic pathway data in terms of binary relations. Pac Symp Biocomput, P.R. Romero and P. Karp Nutrition-Related Analysis of Pathway/Genome Databases Pacific Symposium on Biocomputing 6: (2001). (KEGG vs EcoCyc)

seems pretty obvious As compared to a sequence database –(DNA or amino acid sequences) –Prior organization based on sequence homologies Pathways as alternative approach to whole-cell model –But similar- based on systemic understanding –Building from the bottom up instead of top-down –Also similarly in initial phases Technical details of implementation will be irritatingly vague Links constructed from functional interactions –Fairly logical next step for biologists –Moving from diagrams to graph/network structures So…metabolic networks and regulatory networks

Apoptosis

Citric acid cycle

Pain the Boehringer-Mannheim wallcharts

more pain

Metabolic networks Each enzyme/reaction can be a path between nodes –Each node is an enzyme substrate (product or reactant) Converting individual reactions to paths and nodes –Produces directed graphs Classification of biochemical reactions –EC numbering system (Enzyme Commission) –Hierarchical numerical system i.e –Based on organic chemistry involved, not proteins How to translate from function to specific proteins?

Function to Enzyme mapping in KEGG Original network based on biochemical studies –Boehringer Mannheim and Japanese Biochemical Society How to assign function to enzyme? Do it by hand –Then use FASTA sequence comparisons on each new genome –Why does this irritate me? (feel free to shoot me down here) Which assignments are supported by experimental evidence? But anyway… –Binary relations linking various pieces of data EC number to a particular enzyme Organism to gene Reactant to enzyme Enzyme to product Can produce derivative structures like reactant to product Lots of opportunities for relational database fun

Examples of relational links

The join function in KEGG Query relaxation –Relaxing constraints on the lower two numbers –in the EC number system, i.e. 5.1.x.x –Can search up hierarchy of sequences in EC family Path computation –Construction based on substrate-product relations

KEGG example

EcoCyc (e. Coli encyclopedia) Takes approach based on single enzyme reactions –All assignments based on hand-annotations –Integrated with complete genome data Allows for building of metabolic network –You can start with known required compounds (growth media and inside cell) –Feed to get all required organic compounds (amino acids, nucleic acids, etc) –Focus on small molecules- i.e. no polymers Proteins (amino acid polymers) DNA and RNA (nucleic acid polymers)

What they did List of metabolites produced by network: –Starting compound A, A -> B –B -> C –Therefore A-> C, repeat as necessary Finding precursors –Product C not produced from above computation Find all reactions that produce C, i.e. A + B -> C –Backtrack A and B to find their precursors –Repeat as necessary until no reaction can be found –This identifies earliest precursors with unknown origin »outputs every possible combination of precursors

Results (sort of) Known inputs and outputs –Necessary for cell survival and growth –“bootstrapping” elements - already present in the cell Are required for their own synthesis

Results (2) Essential Elements –DNA, RNA, amino acids –Membrane components (phospholipids) –Extracellular components Peptidoglycan- cross-linked sugar-amino acid chains Cell wall (surrounds bacterial cell)

Results (3) –Identifies unknown synthesis pathways and oddities in simulation –Cob(I)alamin Already known it can’t be synthesized Only used in anaerobic growth But it was simulated under aerobic conditions Aerobic version of reaction requires 5- … –Synthesis of which is unknown –Proteins are treated as bootstrap compounds –Some looped compounds –Others have unknown synthesis pathways