Joined up ontologies: incorporating the Gene Ontology into the UMLS.

Slides:



Advertisements
Similar presentations
The Gene Ontology Project: Content for the Semantic Web.
Advertisements

Annotation of Gene Function …and how thats useful to you.
Www. GeneOntology.org Gene Ontology Collaboration.
1 Knowledge Management for Disease Coding (KMDC): Background & Introduction Timothy Hays, Ph.D. Project Manager, Knowledge Management for Disease Coding.
The Role of the UMLS in Vocabulary Control CENDI Conference “Controlled Vocabulary and the Internet” Stuart J. Nelson, MD.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
1 Using Gene Ontology. 2 Assigning (or Hypothesizing About) Biological Meaning to Clusters What do you want to be able to to? –Identify over-represented.
Bioinformatics master course DNA/Protein structure-function analysis and prediction Lecture 13: Protein Function Centre for Integrative Bioinformatics.
Proteins and Protein Function Charles Yan Spring 2006.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Literature Mining Tools for Analysis of Genomic Data Ramin Homayouni, Ph.D. Associate Professor of Biology Director of Bioinformatics UTHSC BINF April.
CACAO - Penn State Gene Function and Gene Ontology January 2011
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
Unified Medical Language System® (UMLS®) NLM Presentation Theater MLA 2007 National Library of Medicine National Institutes of Health U.S. Dept. of Health.
Gene Ontology Project
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
1 Betsy L. Humphreys, MLS Betsy L. Humphreys, MLS National Library of Medicine National Library of Medicine National Institutes of Health National Institutes.
Using The Gene Ontology: Gene Product Annotation.
Gene Ontology (GO) Project
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.
University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY BeeSpace: An Interactive Environment for Functional Analysis of Social Behavior.
Betsy L. Humphreys Betsy L. Humphreys Associate Director for Library Operations NLM, NIH, HHS NLM, NIH, HHS National Library.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
1 st June 2006 St. George’s University of LondonSlide 1 Using UMLS to map from a Library to a Clinical Classification: Improving the Functionality of a.
Only build an ontology if: You have a body of data to annotate.
March 24, Integrating genomic knowledge sources through an anatomy ontology Gennari JH, Silberfein A, and Wiley JC Pac Symp Biocomputing 2005:
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
Gene Ontology Consortium
Shelly Warwick, MLS, Ph.D – Permission is granted to reproduce and edit this work for non-commercial educational use as long as attribution is provided.
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology Project
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
UMLS Unified Medical Language System. What is UMLS? A Unified knowledge representation system Project of NLM Large scale Distributed First launched in.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Integrating the Cell Cycle Ontology with the Mouse Genome Database David R. Smith Mary Dolan Dr. Judith Blake.
Modeling of complex systems: what is relevant? Arno Knobbe, Marvin Meeng, Joost Kok Leiden Institute of Advanced Computer Science (LIACS)
Workshop Aims NMSU GO Workshop 20 May Aims of this Workshop  WIIFM? modeling examples background information about GO modeling  Strategies for.
24th Feb 2006 Jane Lomax GO Further. 24th Feb 2006 Jane Lomax GO annotations Where do the links between genes and GO terms come from?
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Copyright OpenHelix. No use or reproduction without express written consent1.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
Gene Ontology Project
Gene Ontology Consortium
Digital Libraries, Archives, and Large Data Sets Alexa T. McCray National Library of Medicine Bethesda, Maryland USA WHOI, June 3, 2004.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Gene Ontology Project
Computer Science Ph. D. Seminar Gene Ontology (GO) Based Search for Protein Structure Similarity Clustering Metrics Ph.D. Candidate Steve Johnson Committee.
The UMLS Semantic Network Alexa T. McCray Center for Clinical Computing Beth Israel Deaconess Medical Center Harvard Medical School
MAPPING OF SEQUENCES TO GENE ONTOLOGY. GO consortium.
Japan Consortium for Glycobiology and Glycotechnology DataBase 日本糖鎖科学統合データベース PACDB - Pathogen Adherence to Carbohydrate Database The Pathogen Adherence.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Gene Ontology TM (GO) Consortium
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Gene Annotation & Gene Ontology May 24, Gene lists from RNAseq analysis What do you do with a list of 100s of genes that contain only the following.
` Comparison of Gene Ontology Term Annotations Between E.coli K12 Databases REDDYSAILAJA MARPURI WESTERN KENTUCKY UNIVERSITY.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Annotating with GO: an overview
Department of Genetics • Stanford University School of Medicine
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Presentation transcript:

Joined up ontologies: incorporating the Gene Ontology into the UMLS

The Gene Ontology (GO) Controlled vocabulary for describing molecular biology  hierarchical  multiple parentage allowed  defined terms

Structure of GO (Created using the tool GenNav, developed at NLM)

The ontologies Where does it act? What processes is it involved in? What does it do? gene product

The ontologies Where does it act? What processes is it involved in? What does it do?molecular function gene product

The ontologies Where does it act? What processes is it involved in? What does it do?molecular function biological process gene product

The ontologies Where does it act? What processes is it involved in? What does it do?molecular function cellular component biological process gene product

Gene annotation: assigning GO terms to gene products Genes or gene products GO terms “linked” to gene products Gene products annotated to all 3 ontologies May be linked to more than one term in each ontology DNA binding regulation of transcription nucleus ATP dependent helicase

Queries across databases mouse rat yeast fly Find me all gene products with ‘DNA binding activity’… DNA binding osmosensory signaling pathway regulation of transcription nucleus signal transducer nuclease DNA binding toxin catabolism helicase nucleus cytoplasm membrane mitotic cell cycle

Associating with different levels of ontology (Created using the tool GenNav, developed at NLM)

GO and other systems Useful to equate GO with other systems  Mappings files e.g. ec2go  References in GO as dbxrefs e.g. BioCyc  References in other systems e.g. BRENDA (in process) UMLS Metathesaurus

GO into UMLS Unified Medical Language System  Long-term project at NLM  Three parts: specialist lexicon; sematic network; Metathesaurus  Metathesaurus interrelates biomedical vocabularies  Includes ~60 vocabularies including SNOMED and MeSH.

Inserting GO into UMLS inversion  converting GO to correct format for UMLS insertion  inserting GO using matching algorithms editing  all concepts containing GO term reviewed by hand

Statistics % of GO in sources with other concepts, by source CSP2002 (Computer Retrieval of Information on Scientific Projects Thesaurus) 7.34 % MSH2003_2002_08_14 (Medical Subject Headings) % SNMI98 (Systemized Nomenclature of Human and Veterinary Medicine) % GO CRISP MeSH SNOMED

Potential applications Mining abstracts using GO terms: DNA helicase ; GO: UMLS MeSH term GO MeSH

Status of GO into UMLS Molecular function ontology already inserted Hope to insert other two ontologies by April Release GO with UMLS by end of year

FlyBase & Berkeley Drosophila Genome Project Saccharomyces Genome Database PomBase (Sanger Institute) Rat Genome Database Genome Knowledge Base (CSHL) The Institute for Genomic Research Compugen, Inc The Arabidopsis Information Resource WormBase DictyBase Mouse Genome Informatics Swiss-Prot/TrEMBL/InterPro Pathogen Sequencing Unit (Sanger Institute) National Library of Medicine Alexa McCray Stuart Nelson Bill Hole Oak Ridge Institute for Science and Education National Library of Medicine U. S. Department of Energy The Gene Ontology Consortium is supported by an R01 grant from the National Human Genome Research Institute (NHGRI) [grant HG02273]. SGD is supported by a P41, National Resources, grant from the NHGRI [grant HG01315]; MGD by a P41 from the NHGRI [grant HG00330]; GXD by the National Institute of Child Health and Human Development [grant HD33745]; FlyBase by a P41 from the NHGRI [grant HG00739] and by the Medical Research Council, London. TAIR is supported by the National Science Foundation [grant DBI ]. WormBase is supported by a P41, National Resources, grant from the NHGRI [grant HG02223]; RGD is supported by an R01 grant from the NHLBI [grant HL64541]; DictyBase is supported by an R01 grant from the NIGMS [grant GM064426].