1 eXtended Metadata Registry (XMDR): Input for Open Ontology Repository OOR Panel - Ontology Registry and Repository Technology & Infrastructure Landscape.

Slides:



Advertisements
Similar presentations
Chapter 1: The Database Environment
Advertisements

Chapter 7 System Models.
1 Copyright ©2007 Sandpiper Software, Inc. Vocabulary, Ontology & Specification Management at OMG Elisa Kendall Sandpiper Software
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
1 eXtended Metadata Registry (XMDR) Two Slides for Ontology Summit Presentation Bruce Bargmeyer Lawrence Berkeley National Laboratory and University of.
August 6, 2009 Joint Ontolog-OOR Panel 1 Ontology Repository Research Issues Joint Ontolog-OOR Panel Discussion Ken Baclawski August 6, 2009.
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Who am I? Senior Researcher of The National Library of Korea (2007 ~ )
Cultural Heritage in REGional NETworks REGNET T1.4: Development of the system specification.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
Presented to: By: Date: Federal Aviation Administration Registry/Repository in a SOA Environment SOA Brown Bag #5 SWIM Team March 9, 2011.
Extended Metadata Registry (XMDR) September 2004 Bruce Bargmeyer +1 (510) Interagency/International Cooperation on Ecoinformatics.
Ecoinformatics International Technical Collaboration
Language Specification using Metamodelling Joachim Fischer Humboldt University Berlin LAB Workshop Geneva
Copyright 2006 Digital Enterprise Research Institute. All rights reserved. MarcOnt Initiative Tools for collaborative ontology development.
XML in the Emerging U.S. Federal Information Architecture Presented by Eliot Christian, USGS April 30, 2003.
31242/32549 Advanced Internet Programming Advanced Java Programming
Macromedia Dreamweaver MX 2004 – Design Professional Dreamweaver GETTING STARTED WITH.
1 Technical Projects Ecoinformatics International Technical Collaboration Seattle, Washington, USA January 26, 2010 Bruce Bargmeyer Lawrence Berkeley National.
1 Extended Metadata Registries and Semantics April 18, 2007 Bruce Bargmeyer University of California, Berkeley and Lawrence Berkeley National Laboratory.
Steffen Staab 1WeST Web Science & Technologies University of Koblenz ▪ Landau, Germany Structured Data on the Web Introduction to.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Direction of Proposals for New Edition (E3) of ISO/IEC 11179
SDC JE-xxxx. Bruce Bargmeyer EPA/OIRM/EIM Division Tel: (202) WWW URL:
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
WG2 Tutorial ISO/JTC1/SC32 Larry Fitzwater (202) SDC JE-4029.
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
1 Future Database Needs SC 32 Study Period February 5, 2007 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. XMDR Prototype Day: 21.
A Standard & Prototype Starting Point for An Open Ontology Repository: The Extended Metadata Registry Project John L. McCarthy XMDR Project Lawrence Berkeley.
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
1 Collaborative Research, Development and Demonstration Ecoinformatics International Technical Collaboration Copenhagen, Denmark March, Bruce Bargmeyer.
SDC JE-Matsue May 1999 Bruce Bargmeyer U.S. Environmental Protection Agency Tel: (202) WWW URL:
1 eXtended Metadata Registry (XMDR) International Ecoinformatics Technical Collaboration Berkeley, California October 24, 2006 Bruce Bargmeyer, Lawrence.
Interoperability Standards at Levels of Syntax and Semantics Presented by Eliot Christian at the First Meeting of WMO / CBS / ISS / ET-ADRS (Expert Team.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
1 Extended Metadata Registry (XMDR) November 2004 Bruce Bargmeyer +1 (510) ISO/IEC JTC 1/SC 32/WG 2.
SDC JE-8019 February 16, 1999 Bruce Bargmeyer EPA/OIRM/EIM Division Tel: (202) WWW URL:
Interfacing Registry Systems December 2000.
Cooperating Registries Draft Content for OASIS/ebXML Reg/Rep f2f November 1, 2001 Bruce Bargmeyer (510)
The Final Study Period Report on MFI 6: Model registration procedure SC32WG2 Meeting, Sydney May 26, 2008 H. Horiuchi, Keqing He, Doo-Kwon Baik SC32WG2.
1 eXtended Metadata Registry (XMDR) for Ecoinformatics Test Bed Interagency/International Cooperation on Ecoinformatics Copenhagen, Denmark June,
Requirements for Standardization on the Service Registries ISO/IEC JTC1 SC /10/161 A comment to WSSG, JTC1 SC32WG2 N
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Clinical Data Interchange Standards Consortium (CDISC) uses NCIt for its Study Data Tabulation Model (SDTM) and other global data standards for medical.
FEA Data and Information Reference Model (DRM): the Interoperability Message Presented by Eliot Christian, USGS based on work of ISO/IEC JTC1/SC32 Data.
Registry Services Bringing Value to US EPA, States, and Tribes Exchange Network Vendors Meeting April 24, 2007 Cynthia Dickinson EPA/OEI/OIC Data Standards.
th Open Forum on Metadata Registries, Kobe, Japan1 XMDR Project Overview Frank Olken & Kevin D. Keck Lawrence.
1 eXtended Metadata Registry (XMDR) Interagency/International Cooperation on Ecoinformatics Ispra, Italy January 17, 2006 Bruce Bargmeyer, Lawrence Berkley.
1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
1 SC 32/WG 2 Tutorial Metadata Registry Standards July 16, 2007 Bruce Bargmeyer University of California, Berkeley and Lawrence Berkley National Laboratory.
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
1 Technical Projects Workgroup Report to Plenary Ecoinformatics International Technical Collaboration April 10, 2008 Research Triangle Park, North Carolina,
SDC JE-2027 January 18, 2000 Bruce Bargmeyer Chair, SC 32 – Data Management and Interchange U.S. Environmental Protection Agency Telephone: (202)
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
SDC JE-xxxx September 1999 Bruce Bargmeyer U.S. Environmental Protection Agency Tel: (202) WWW URL:
Concept Proposal Sixth Open Forum on Metadata Registries Semantic Interoperability between Registries To be held January 20-24, 2003 Bruce Bargmeyer
International/Interagency Collaboration – IT for Environmental Information & Environmental Data Exchange Network Copenhagen, Denmark April 25, 2002 Bruce.
US-EU Research Cooperation Interagency/International Cooperation on Ecoinformatics September 2004 Bruce Bargmeyer +1 (510)
Update on Ecoinformatics Technical Working Group Activities Larry Fitzwater Computer Scientist US Environmental Protection Agency Rome, Italy – 17 May.
“Sharing and advancing knowledge and experience about standards, technologies and implementations. Sharing and advancing knowledge and experience about.
Extended Metadata Registries and Semantics (Part 2: Implementation) Karlo Berket Ecoterm IV Environmental Terminology Workshop April 18, 2007 Diplomatic.
Concept Presentation Sixth Open Forum on Metadata Registries To be held January 20-24, 2003 Bruce Bargmeyer
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
ISO/IEC Past, Present, Future -- A Thumbnail Sketch
Report on Eighth Open Forum on Metadata Registries, Berlin, April 2005
The Re3gistry software and the INSPIRE Registry
Ecoinformatics Technical Projects Workgroup
Presentation transcript:

1 eXtended Metadata Registry (XMDR): Input for Open Ontology Repository OOR Panel - Ontology Registry and Repository Technology & Infrastructure Landscape February 28, 2008 Bruce Bargmeyer Lawrence Berkeley National Laboratory and University of California, Berkeley Tel:

Topics F Describe the technology/infrastructure that XMDR brings to the table for the OOR project. F How does that contribute to the overall OOR initiative F How does that fit in with the other things that the rest of the teams are bringing to the table 2

What XMDR Brings to the Table F Use cases - semantics challenges - and Requirements F Proposed specifications for ISO/IEC Edition 3 – Model, definitions, ontology F Modular software architecture and open source software modules F Open Source XMDR software F Test content 3

4 Challenge: Combine Data, Metadata & Concept Systems IDDateTempHg A B X NameDatatypeDefinitionUnits IDtext Monitoring Station Identifier not applicable DatedateDateyy-mm-dd Tempnumber Temperature (to 0.1 degree C) degrees Celcius Hgnumber Mercury contamination micrograms per liter Inference Search Query: find water bodies downstream from Fletcher Creek where chemical contamination was over 10 micrograms per liter between December 2001 and March 2003 Data: Metadata: BiologicalRadioactive Contamination leadcadmium mercury Chemical Concept system:

5 Challenge: Find and process non- explicit data Analgesic Agent Non-Narcotic Analgesic AcetominophenNonsteroidal Antiinflammatory Drug Analgesic and Antipyretic Datril Anacin-3Tylenol For example… Patient data on drugs contains brand names (e.g. Tylenol, Anacin-3, Datril,…); However, want to study patients taking analgesic agents

6 Challenge: Specify and compute across Relations, e.g., within a food web in an Arctic ecosystem An organism is connected to another organism for which it is a source of food energy and material by an arrow representing the direction of biomass transfer. Source: (from SPIRE)

7 Challenge: Use data from systems that record the same facts with different terms F Reduce the human toil of drawing information together and performing analysis -> shift to computer processing.

8 Challenge: Use data from systems that record the same facts with different terms Common Content OASIS/ebXML Registries Common Content ISO Registries Common Content Ontological Registries Common Content CASE Tool Repositories Common Content UDDI Registries Country Identifier Data Element XML Tag Term Hierarchy Attribute Business Specification Table Column Software Component Registries Common Content Database Catalogs Business Object Dublin Core Registries Common Content Coverage

9 Data Elements DZ BE CN DK EG FR... ZW ISO 3166 English Name ISO Numeric Code ISO Alpha Code Algeria Belgium China Denmark Egypt France... Zimbabwe Name: Context: Definition: Unique ID: 4572 Value Domain: Maintenance Org. Steward: Classification: Registration Authority: Others ISO 3166 French Name L`Algérie Belgique Chine Danemark Egypte La France... Zimbabwe DZA BEL CHN DNK EGY FRA... ZWE ISO Alpha Code Same Fact, Different Terms Algeria Belgium China Denmark Egypt France... Zimbabwe Name: Country Identifiers Context: Definition: Unique ID: 5769 Conceptual Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others Data Element Concept

Challenge: Draw information together from a broad range of studies, databases, reports, etc. 10

11 Challenge: Gain Common Understanding of meaning between Data Creators and Data Users Users Information systems Data Creation Users EEA USGS DoD EPA environ agriculture climate human health industry tourism soil water air textdata environ agriculture climate human health industry tourism soil water air text ambiente agricultura tiempo salud hunano industria turismo tierra agua aero textdata environ agriculture climate human health industry tourism soil water air textdata Others... ambiente agricultura tiempo salud huno industria turismo tierra agua aero textdata A common interpretation of what the data represents

12 Semantics Challenges F Managing, harmonizing and vetting semantics is important for traditional data management. u In the past we just covered the basics F Managing, harmonizing, and vetting semantics is essential to enable enterprise semantic computing

Enterprise Vocabulary Services (EVS) Concepts Unite NCI MDR 13 Object Class Chemopreventive Agent Property NSCNumber Conceptual Domain Agent Data Element Concept Chemopreventive Agent NSC Number Data Element Chemopreventive Agent Name Value Domain NSC Code Context caCORE Representation Code Classification Schemes caDSRTraining Valid Values Cyclooxygenase Inhibitor Doxercalciferol Eflornithine … Ursodiol Source: Denise Warzel, National Cancer Institute

XMDR Prototype Demonstrate capabilities: F Register existing concept systems, based on their underlying structures, such as graphs of varying complexity. F Interrelate concepts systems with each other. n E.g., register mappings between multiple vocabularies F Support harmonization and vetting of concept systems for a community of interest. n E.g., Register, harmonize, validate, and vet definitions and relations F Interrelate concepts in concept systems with concepts in metadata and concepts in databases, knowledgebases, and text. F Provide semantic services needed to support traditional computing as well as semantic computing. u E.g., dereferencing the URIs used in creating RDF statements, by providing relevant information describing the referenced concept and its authoritative standing within some community of interest. F Register and manage the provenance of data F XMDR is part of the infrastructure for semantics and data management. 14

XMDR Use F Upside u Collaborative n Supports interaction with community of interest n Shared evolution and dissemination n Enables Review Cycle u Standards-based – dont lock semantics into proprietary technology u Foundation for strategic data centric applications u Lays the foundation for Ontology-based Information Management u Content is reusable for many purposes F Downside u Managing semantics is HARD WORK - No matter how friendly the tools u Needs integration with other components 15

Modular XMDR Archtitecture Registry Store Search & Content Serving (Jena, Lucene) XMDR metamodel (OWL & xml schema) standard XMDR files Logic Index Content Loading & Transformation (Lexgrid & custom) Human User Interface (HTML fromJSP and javascript; Exhibit) Metadata Sources concept systems, data elements USERS Web Browsers…..Client Software Application Program Interface (REST) Authentication Service Validation (XML Schema) Mapping Engine Logic Indexer (Jana & Pellet) Text Indexer (Lucene) Metamodel specs (UML & Editing) (Poseidon, Protege) XMDR data model & exchange format XML, RDF, OWL Text Index Postgres Database Third Party Software

Initial XMDR REST-style Application Programming Interface (API) F Search Methods (GET) u Text Search u SPARQL Search u XMDR Search (not documented yet) F Registry Information Methods u Summary information u registered models u Identified Items F Method Parameters u can be included as part of any method u as part of URL u Accept_type (what xml components to expect) u Stylesheet (how to display results)

REST API (Search Methods) ResourceURI (relative to application root) MethodRepresentationAccept RequestDescription Text Search search/text? query={queryText} GETapplication/xml (searchResult) Any (ignores)Start a text search. Text Search Results search/text/{queryID}? offset={offset}& maxResults={maxResults} GETapplication/xml (textResultSet) application/xml, application/*, or */* Retrieve the results of a text search. application/exhibi*application/exhibit SPARQL Search search/sparql? query={queryText}& model={modelNameN} GETapplication/xml (searchResult) Any (ignores)Start a SPARQL search. SPARQL Search Results search/sparql/{queryID}? offset={offset}& maxResults={maxResults} GETapplication/xml (sparqlResultSet) application/xml, application/*, or */* Retrieve the results of a SPARQL search. application/ sparql-results+xml ** application/ sparql-results+xml application/ sparql-results+json *** application/ sparql-results+json, application/json application/exhibit * application/exhibit

XMDR F Content (selected portions of): u ISO/IEC u ISO/IEC 3166 – Country codes u ISO 4217 – Currency codes u EPA Environmental Data Registry content (ISO/IEC based registry) u Standard Industrial Codes u North American Industrial Classification System u Mapping NAICS 02 to SIC 87 u Adult Mouse Anatomical Dictionary u Defense Technology Info. Center Thesaurus u NBII Biocomplexity Thesaurus u GEneral Multilingual Environmental Thesaurus u NCI_Thesaurus u Cancer Data Standards Repository (NCI registry based on ISO?IEC 11179) F Loading new content (ongoing) u OMEGA linguistic ontology u OpenCyc ontology u SIC – NAICS codes u Mapping of NAICS to SIC codes 19

Contribution How does that contribute to the overall OOR initiative? F It is free for the taking F Save time on development of use cases, specifications, architectures, software, etc. 20

Fitting In How does that fit in with the other things that the rest of the teams are bringing to the table? F Collaboration on standards development F Collaboration on prototype development and demonstration F Collaboration on proposals? 21

22 Align, Coordinate, Integrate Standards/Recommendations/Specifications for Semantic Computing ISO/IEC JTC 1/SC 32 Us er s ISO/IEC Metadata Registries Metadata Registry Terminology Thesaurus Taxonomy Data Standards Ontology Structured Metadata Terminology CONCEPT Referent Refers To Symbolizes Stands For Rose, ClipArt Rose ISO TC 37 Semantic Web W3C Object Management MOF ODM CWM IMM OMG Node Edge Subject Predicate Object Graph RDF

Standards Development Semantics Management and Semantics Services – Semantic Computing 23 OMG W3C ISO/IEC JTC 1 SC 32 Align, Co-develop, Fast Track, PAS Submission … OASIS ISO TC 154

Standards Development Semantics Management and Semantics Services – Semantic Computing 24 OMG W3C ISO/IEC JTC 1 SC 32 Align, integrate, co-develop, Fast Track, PAS Submission … Can we coordinate content? OASIS/ ISO TC 154

A Success 25 OMG ISO/IEC JTC 1 SC 32 Some text and figures are identical in the two standards. ISO/IEC OMG ODM ISO/IEC – Common Logic OMG Ontology Definition Metamodel

Standards Development Semantics Management and Semantics Services – Semantic Computing 26 ISO/IEC (Edition 3) ISO/IEC JTC 1 SC 32 Ongoing effort

Standards Development Semantics Management and Semantics Services – Semantic Computing 27 ISO/IEC (Edition 3) ISO/IEC JTC 1 SC 32 Hopeful? OMG IMM &

Other Possibilities F OASIS ebXML Registry F W3C Semantic Web Deployment WG F TC 37 28

Acknowledgements F John McCarthy, LBNL F Kevin Keck, LBNL F Harold Solbrig, Apelon F This material is based upon work supported by the National Science Foundation under Grant No , USEPA and USDOD. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation, USEPA or USDOD. 29