Fahim IMAM, Stephen LARSON, Georgio ASCOLI, Gordon SHEPHERD,

Slides:



Advertisements
Similar presentations
Developing an application ontology for biomedical resource annotation and retrieval: challenges and lessons learned C. Torniai, M. Brush, N. Vasilevsky,
Advertisements

IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
The Neuroscience Information Framework Establishing a practical semantic framework for neuroscience Maryann Martone, Ph. D. University of California, San.
Data Landscapes neuinfo.org Anita Bandrowski, Ph. D. University of California, San Diego.
N EUROSCIENCE I NFORMATION F RAMEWORK S TANDARD O NTOLOGIES (NIFSTD) Fahim IMAM, Stephen LARSON, Georgio ASCOLI, Gordon SHEPHERD, Anita BANDROWSKI, Jeffery.
Ontology Notes are from:
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
UML CASE Tool. ABSTRACT Domain analysis enables identifying families of applications and capturing their terminology in order to assist and guide system.
How to Organize the World of Ontologies Barry Smith 1.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Enriching the Ontology for Biomedical Investigations (OBI) to Improve Its Suitability for Web Service Annotations Chaitanya Guttula, Alok Dhamanaskar,
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
The use of ontologies within the Neuroscience Information Framework, a neuroscience-centered portal for searching and accessing diverse resources Maryann.
Mental Functioning and Semantic Search in the Neuroscience Information Framework Maryann Martone Fahim Imam Funded in part by the NIH Neuroscience Blueprint.
The Neuroscience Information Framework Establishing a practical semantic framework for neuroscience Maryann Martone, Ph. D. University of California, San.
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 7 Slide 1 System models l Abstract descriptions of systems whose requirements are being.
N IF S TD AND N EURO L EX : D EVELOPMENT OF A C OMPREHENSIVE N EUROSCIENCE O NTOLOGY Fahim IMAM, Stephen LARSON, Georgio ASCOLI, Gordon SHEPHERD, Anita.
Imports, MIREOT Contributors: Carlo Torniai, Melanie Courtot, Chris Mungall, Allen Xiang.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
Principles and Practice of Ontology Development: Making Definitions Computable Chris Mungall LBL.
N IF : A C OMPREHENSIVE O NTOLOGY FOR N EUROSCIENCE & P RACTICAL G UIDE FOR D ATA -O NTOLOGY I NTEGRATION Maryann E. MARTONE, Fahim IMAM, Anita Bandrowski,
System models l Abstract descriptions of systems whose requirements are being analysed.
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NIFSTD Maryann Martone University of California, San Diego.
Ontologies for Neuroscience and Neurology The Neuroscience Information Framework Fahim Imam, Stephen Larson, Georgio Ascoli, Gordon Shepherd, Anita Bandrowski,
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
The Neuroscience Information Framework Making Resources Discoverable for the Computational Neuroscience Community Jeffrey S. Grethe, Ph. D. Co-Principal.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Anatomy Ontology Community Melissa Haendel. The OBO Foundry More than just a website, it’s a community of ontology developers.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
University of California, San Diego Ontology-based annotation of multiscale imaging data: Utilizing and building the Neuroscience Information Framework.
N IF S TD : A C OMPREHENSIVE O NTOLOGY FOR N EUROSCIENCE Fahim IMAM 1, Stephen LARSON 1, Sridevi POLAVARAM 2, Georgio ASCOLI 2, Gordon SHEPHERD 3, Jeffery.
The Neuroscience information framework A User’s Guide.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
No Euphemisms Required: BIRN on the Leading Edge of Community Ontology Development.
ISO TC37/SC4 N435 Nov 12, 2007 Presented by Miran Choi/ETRI Written by Jae Sung Lee/Chungbuk National Univ.
2007 Mouse All Hands Meeting BIRN Ontology Day Jeff Grethe & Bill Bug (BIRN OTF) - March 7th, 2007.
Contributions to mouse BIRN tools and resources Maryann Martone and Mark Ellisman University of California, San Diego 2008.
Semantic Media Wiki Open Terminology Development - Initial Steps - Frank Hartel, Ph.D. Associate Director, Enterprise Vocabulary Services National Cancer.
EDM Council / Object Management Group Semantic Standards Workstream Definitions and Detailed Objectives May 04, 2011.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Engineering, 7th edition. Chapter 8 Slide 1 System models.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Randi Vita, M.D. Better living through ontologies at the Immune Epitope Database La Jolla Institute for Allergy & Immunology Division of Vaccine Discovery.
The Semantic Web By: Maulik Parikh.
University of California, San Diego
Integrating SysML with OWL (or other logic based formalisms)
Ilya Zaslavsky Jeffrey Grethe amarnath Gupta burak Ozyurt
CCNT Lab of Zhejiang University
OASIS Quantities and Units of Measure Ontology Standard (QUOMOS) An Introduction v Rev. D / April
Development of the Amphibian Anatomical Ontology
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Mental Functioning and the Gene Ontology
Abstract descriptions of systems whose requirements are being analysed
ece 720 intelligent web: ontology and beyond
WikiNeuron: Semantic Neuro-Mashup
Harmonizing SNOMED CT with BioTopLite
OBO Foundry Principles
Chapter 20 Object-Oriented Analysis and Design
ece 627 intelligent web: ontology and beyond
OBI – Standard Semantic
An ontology for e-Research
LOD reference architecture
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Semantic Interoperability in Digital Library Systems
Presentation transcript:

NifStd and NeuroLex: Development of a Comprehensive Neuroscience Ontology Fahim IMAM, Stephen LARSON, Georgio ASCOLI, Gordon SHEPHERD, Anita BANDROWSKI, Jeffery S. GRETHE, Amarnath GUPTA, Maryann E. MARTONE University of California, San Diego, CA George Mason University, Fairfax, VA Yale University, New Haven, CT ICBO Workshop 2011 July 26, 2011 Funded in part by the NIH Neuroscience Blueprint HHSN271200800035C via NIDA NEUROSCIENCE INFORMATION FRAMEWORK

NIF: Discover and utilize web-based Neuroscience resources A portal for finding and using neuroscience resources A consistent framework for describing resources Provides simultaneous search of multiple types of information, organized by category NIFSTD Ontology, a critical component Enables concept-based search UCSD, Yale, Cal Tech, George Mason, Harvard MGH Easier Supported by NIH Blueprint The Neuroscience Information Framework (NIF), http://neuinfo.org

NIF Standard Ontologies (NifStd) Bill Bug et al. Set of modular ontologies Covering neuroscience relevant terminologies Comprehensive ~60, 000 distinct concepts + synonyms Expressed in OWL-DL language Supported by common DL Resoners Closely follows OBO community best practices Avoids duplication of efforts Standardized to the same upper level ontologies e.g., Basic Formal Ontology (BFO), OBO Relations Ontology (OBO-RO), Phonotypical Qualities Ontology (PATO) Relies on existing community ontologies e.g., CHEBI, GO, PRO, OBI etc. Modules cover orthogonal domain e.g. , Brain Regions, Cells, Molecules, Subcellular parts, Diseases, Nervous system functions, etc.

NIFSTD External Community Sources Domain External Source Import/ Adapt Module Organism taxonomy NCBI Taxonomy, GBIF, ITIS, IMSR, Jackson Labs mouse catalog Adapt NIF-Organism Molecules IUPHAR ion channels and receptors, Sequence Ontology (SO), ChEBI, and Protein Ontology (PRO); pending: NCBI Entrez Protein, NCBI RefSeq, NCBI Homologene, NIDA drug lists IUPHAR, ChEBI;Import PRO, SO NIF-Molecule NIF-Chemical Sub-cellular Sub-cellular Anatomy Ontology (SAO). Extracted cell parts and subcellular structures. Imported GO Cellular Component Import NIF-Subcellular Cell CCDB, NeuronDB, NeuroMorpho.org. Terminologies; pending: OBO Cell Ontology NIF-Cell Gross Anatomy NeuroNames extended by including terms from BIRN, SumsDB, BrainMap.org, etc; multi-scale representation of Nervous System Macroscopic anatomy NIF-GrossAnatomy Nervous system function Sensory, Behavior, Cognition terms from NIF, BIRN, BrainMap.org, MeSH, and UMLS NIF-Function dysfunction Nervous system disease from MeSH, NINDS terminology; Disease Ontology (DO) Adapt/Import NIF- Dysfunction Phenotypic qualities PATO is Imported as part of the OBO foundry core NIF-Quality Investigation: reagents Overlaps with molecules above, especially RefSeq for mRNA NIF-Investigation Investigation: instruments, protocols Based on Ontology for Biomedical Investigation (OBI) to include entities for biomaterial transformations, assays, data transformations Investigation: Resource NIF, OBI, NITRC, Biomedical Resource Ontology (BRO) NIF-Resource Biological Process Gene Ontology’s (GO) biological process in whole NIF-BioProcess Cognitive Paradigm Cognitive Paradigm Ontology (CogPO)

Importing or Adapting a new ontology or vocabulary source Import/adapt a source already in OWL, uses the OBO-RO and the BFO and is orthogonal to existing modules the import simply involves adding an owl:import statement existing orthogonal ontology is in OWL but does not use the same foundational ontologies as NIFSTD an ontology-bridging module (explained later) is constructed declaring the deep level semantic equivalencies such as foundational objects and processes. external source is satisfied by the above two rules but observed to be too large for NIF’s scope of interests a relevant subset is extracted. MIREOT principles has been adopted external source has not been represented in OWL, or does not use the same foundation as NIFSTD, the terminology is adapted to OWL/RDF in the context of the NIFSTD foundational layer ontologies

NIFSTD Design Principles Single Inheritance for Named Classes Follows simple inheritance principle for named classes An asserted named class can have only one named class as its superclass Promotes the named classes to be univocal and to avoid ambiguities Classes with multiple named superclasses Can be inferred using automated reasoners Saves a great deal of manual labor and minimizes human errors Alan Rector’s Normalization principles.

Design Principles Unique Identifiers and Annotation Properties. NIFSTD entities are identified by a unique identifier and accompanied by a variety of annotation properties Derived from Dublin Core Metadata (DC) and Simple Knowledge Organization System (SKOS) model. Synonyms, acronyms, definition, defining source etc. Reuse the same URI through MIREOTed classes from external source, Allows to avoid extra mapping annotations, e.g., class identifiers remain unaltered.

Design Principles Annotation properties associated with versioning different levels of contents creation date and modification dates file level versioning for each of the modules annotations for retiring antiquated concept definitions hasFormerParentClass and isReplacesByClass etc. tracking former ontology graph position and replacement concepts.

Design Principles Object Properties and Bridge Modules. Mostly drawn from OBO Relations Ontology (OBO-RO) Intra-module relations are kept within the same module ONLY universal restrictions are considered e.g., partonomy relations within different brain regions The cross-module relations are specified in separate bridging modules Modules that only contain logical restrictions on a set of classes assigned between multiple modules. Allows main domain modules—e.g., anatomy, cell type, etc. to remain independent of one another

Design Principles Two example bridging modules in NIFSTD Helps keeping the modularity principles intact facilitate extensions for broader communities without NIF-centric views These bridging modules can easily be excluded in order to focus on core modules

Typical Knowledge Model A typical knowledge model in NIFSTD. Both cross-modular and intra-modular classes are associated through object properties mostly drawn from the OBO Relations ontology (RO).

An Analogy Easier Difficult Analogy for modularization of ontologies… Given 5 different lines with different colors and a given a set of possible angular relationships easier to build different shapes and patterns Difficult

Typical Use of Ontology in NIF Basic feature of an ontology Organizing the concepts involved in a domain into a hierarchy and Precisely specifying how the classes are ‘related’ with each other (i.e., logical axioms) Explicit knowledge are asserted but implicit logical consequences can be inferred A powerful feature of an ontology

Ontology – Asserted Hierarchy Class name Asserted defining (necessary & sufficient) expression Cerebellum neuron Is a ‘Neuron’ whose soma lies in any part of the ‘Cerebellum’ or ‘Cerebellar cortex’ Principal neuron Is a ‘Neuron’ which has ‘Projection neuron role’, i.e., a neuron whose axon projects out of the brain region in which its soma lies GABAergic neuron Is a ‘Neuron’ that uses ‘GABA’ as a neurotransmitter Here is an example, that would hopefully illustrates the strengths and usefulness of having our ontology. NIFSTD has various neuron types with an asserted simple hierarchy within the NIF-Cell module (here is an example with five neuron types). However, we assert various logical restrictions about these neurons. Class name Asserted necessary conditions Cerebellum Purkinje cell Is a ‘Neuron’ Its soma lies within 'Purkinje cell layer of cerebellar cortex’ It has ‘Projection neuron role’ It uses ‘GABA’ as a neurotransmitter It has ‘Spiny dendrite quality’

NIF Concept-Based Search Search Google: GABAergic neuron Search NIF: GABAergic neuron NIF automatically searches for types of GABAergic neurons Types of GABAergic neurons Having the defined classes enabled us to have useful concept-based queries through the NIF search interface. For example, while searching for ‘GABAergic neuron’, the system recognizes the term as ‘defined’ from the ontology, and looks for any neuron that has GABA as a neurotransmitter (instead of the lexical match of the search term like in Google) and enhances the query over those inferred list of neurons.

NifStd Current Version Key feature: Includes useful defined concepts to infer useful classification NIF Standard Ontologies

NifStd and NeuroLex Wiki Semantic wiki platform Provides simple forms for structured knowledge Can add concepts, properties Generate hierarchies without having to learn complicated ontology tools Good teaching tool for principles behind ontologies Community can contribute One of the largest roadblocks that we encountered during our ontology development was the lack of tools for domain experts to contribute their knowledge to NIFSTD. To bridge these gaps, NIF has created NeuroLex (http://neurolex.org), a semantic wiki interface for the domain experts as an easy entry point to the NIFSTD contents. It has been extensively used in the area of neuronal cell types where NIF is working with a group of neuroscientists such as Gordon Shephard and Georgio Ascoli, to create a comprehensive list of neurons and their properties. Stephen D. Larson et al. NIF Standard Ontologies

NeuroLex vs.NIFSTD NeuroLex NIFSTD A semantic mediawiki based website containing the content of the NIFSTD plus additional community contributions Collection of cohesive, unified modular ontologies deployed in OWL Categories Classes Content is fluid and can be updated at any time. Structure is based on OBO foundry principles Defines relationships between categories as simple properties Defines relationships between classes as OWL restrictions derived from RO At a glance guide to the differences between NeuroLex and NIFSTD Larson et. al

Top Down Vs. Bottom up NIFSTD NEUROLEX Larson et. al Top-down ontology construction A select few authors have write privileges Maximizes consistency of terms with each other Making changes requires approval and re-publishing Works best when domain to be organized has: small corpus, formal categories, stable entities, restricted entities, clear edges. Works best with participants who are: expert catalogers, coordinated users, expert users, people with authoritative source of judgment NIFSTD Bottom-up ontology construction Multiple participants can edit the ontology instantly Control of content is done after edits are made based on the merit of the content Semantics are limited to what is convenient for the domain Not a replacement for top-down construction; sometimes necessary to increase flexibility Necessary when domain has: large corpus, no formal categories, no clear edges Necessary when participants are: uncoordinated users, amateur users, naïve catalogers Neuroscience is a domain that is less formal and neuroscientists are more uncoordinated NEUROLEX Larson et. al

NeuroLex Wiki Contributions http://neurolex.org/wiki/Special:ContributionScores

NifStd/Neurolex Curation Workflow ‘has soma location’ in NeuroLex == ‘Neuron X’ has_part some (‘Soma’ and (part_of some ‘Brain region Y’)) in NIFSTD We envision NeuroLex as the main entry point for the broader community to access, annotate, edit and enhance the core NIFSTD content. The peer-reviewed contributions in the media wiki are later implanted in formal OWL modules. While the properties in NeuroLex are meant for easier interpretation, the restrictions in NIFSTD are usually based on rigorous OBO-RO standard relations. For example, the property ‘soma located in’ is translated as ‘Neuron X’ has_part some (‘Soma’ and (part_of some ‘Brain region Y’)) in NIFSTD.

Access to NIFSTD Contents NIFSTD is available as OWL Format http://ontology.neuinfo.org RDF and SPARQL Endpoint http://ontology.neuinfo.org/sparql-endpoint.html Specific contents through web services http://ontology.neuinfo.org/ontoquest-service.html Available through NCBO Bioportal Provides annotation and mapping services http://bioportal.bioontology.org/ NIF Standard Ontologies

Working to Incorporate Community NeuroPsyGrid http://www.neuropsygrid.org NDAR Autism Ontology http://ndar.nih.gov Disease Phenotype Ontology http://openccdb.org/wiki/index.php/Disease_Ontology Cognitive Paradigm Ontology (CogPO) http://wiki.cogpo.org Neural ElectroMagnetic Ontologies (NEMO) http://nemo.nic.uoregon.edu

Summary and Conclusions NIF with NIFSTD provides an example of how ontologies can be practically applied to enhance search and data integration across diverse resources We believe, we have defined a process to form complex semantics to various neuroscience concepts through NIFSTD and through NeuroLex collaborative environment. NIF encourages the use of community ontologies Moving towards building rich knowledgebase for Neuroscience that integrates with larger life science communities.

Point of Discussion Gaining OBO Foundry community consensus for a production system is difficult as we often need to move quickly along with the project We rather favor a system whereby we start with minimal complexity as required and add more as the ontologies evolve over time towards perfection What should be the most effective way to collaborate and gain community consensus? While the principles promote developing highly interoperable and reusable reference ontologies in ideal cases, following some of them in a rigid manner is often proven to be too ambitious for day-to-day development.