2006-03-219th Open Forum on Metadata Registries, Kobe, Japan1 XMDR Project Overview Frank Olken & Kevin D. Keck Lawrence.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

1 eXtended Metadata Registry (XMDR) Two Slides for Ontology Summit Presentation Bruce Bargmeyer Lawrence Berkeley National Laboratory and University of.
Ontology Assessment – Proposed Framework and Methodology.
Meta Data Larry, Stirling md on data access – data types, domain meta-data discovery Scott, Ohio State – caBIG md driven architecture semantic md Alexander.
1 Extended Metadata Registries and Semantics April 18, 2007 Bruce Bargmeyer University of California, Berkeley and Lawrence Berkeley National Laboratory.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Direction of Proposals for New Edition (E3) of ISO/IEC 11179
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
SDC JE-xxxx. Bruce Bargmeyer EPA/OIRM/EIM Division Tel: (202) WWW URL:
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. XMDR Prototype Day: 21.
A Standard & Prototype Starting Point for An Open Ontology Repository: The Extended Metadata Registry Project John L. McCarthy XMDR Project Lawrence Berkeley.
LexEVS 6.0 Overview Scott Bauer Mayo Clinic Rochester, Minnesota February 2011.
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Classification and the Metadata Registry Judith Newton NIST IRS XML Stakeholders/ XML Working Group May 18, 2004.
Baba Piprani (SICOM Canada) Robert Henkel (Transport Canada)
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Metadata Management Case Study Date: 10/21/2008 Dan McCreary President Dan McCreary & Associates (952) M D Metadata Solutions.
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
2004 Open Forum for eBusiness and Metadata Technology Standardization Metamodel Framework for Ontology Keqing He, Yixin Jing, Yangfan He State Key Laboratory.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Revelytix SICoP Presentation DRM 3.0 with WordNet Senses in a Semantic Wiki Michael Lang February 6, 2007.
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University.
FEA Data and Information Reference Model (DRM): the Interoperability Message Presented by Eliot Christian, USGS based on work of ISO/IEC JTC1/SC32 Data.
Registry Services Bringing Value to US EPA, States, and Tribes Exchange Network Vendors Meeting April 24, 2007 Cynthia Dickinson EPA/OEI/OIC Data Standards.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Proposed NWI KIF/CG --> Common Logic Standard A working group was recently formed from the KIF working group. John Sowa is the only CG representative so.
1 eXtended Metadata Registry (XMDR) Interagency/International Cooperation on Ecoinformatics Ispra, Italy January 17, 2006 Bruce Bargmeyer, Lawrence Berkley.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:
A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Metadata Registries Workshop Metadata Registries Workshop U.S. Bureau of Labor Statistics Conference Center April 15-17, 1998.
Tutorial on XML Tag and Schema Registration in an ISO/IEC Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
Data Registry to support HIPAA standards The Health Insurance Portability and Accountability Act of 1996 Title II - Subtitle F Administrative Simplification.
SDC JE-2031 Linda Spencer U.S. EPA January 19, 2000 Open Forum on Metadata Registries Santa Fe, NM.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Terminology Components for Ecoinformatics Sharing Gail Hodge Consultant to USGS BIO/NBII Information International Associates, Inc. 28 January 2004 science.
ONION Ontologies In Ontology Community of Practice Leader
Data Element Classification ISO/IEC 11179, Part 2
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
Improvement of Semantic Interoperability based on Metadata Registry(MDR) Doo-Kwon Baik Dept. of CSE Korea University.
Update on Ecoinformatics Technical Working Group Activities Larry Fitzwater Computer Scientist US Environmental Protection Agency Rome, Italy – 17 May.
Ontology Technology applied to Catalogues Paul Kopp.
Extended Metadata Registries and Semantics (Part 2: Implementation) Karlo Berket Ecoterm IV Environmental Terminology Workshop April 18, 2007 Diplomatic.
Object Management Group Information Management Metamodel
Report on Eighth Open Forum on Metadata Registries, Berlin, April 2005
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Presentation transcript:

th Open Forum on Metadata Registries, Kobe, Japan1 XMDR Project Overview Frank Olken & Kevin D. Keck Lawrence Berkeley National Laboratory Presentation to Open Metadata Forum Kobe, Japan March 21, 2006

th Open Forum on Metadata Registries, Kobe, Japan2 XMDR means: Extended Metadata Registry

th Open Forum on Metadata Registries, Kobe, Japan3 The Cast ● Bruce Bargmeyer (LBNL) = Principal Investigator ● Kevin Keck (LBNL) = architect & stds. (design) ● Frank Olken (LBNL) = content characterization & stds. (design) ● John McCarthy (LBNL) = prototype development (management) ● Karlo Berket (LBNL) = prototype development ● Harold Solbrig (Mayo) = content preprocessing via LexGrid, stds ● Gayle Hodge (USGS) = content characterization, acquisition ● Denise Warzel (NCI) = content acquisition, standards, design ● Larry Fitzwater (EPA) = program mgt. (vision, direction) ● Nancy Lawler (DOD) = program mgt. (vision, direction) ● Sam Chance (DOD) = program mgt. (vision, direction)

th Open Forum on Metadata Registries, Kobe, Japan4 Organizational Cast ● Lawrence Berkeley National Laboratory ● Environmental Protection Agency ● National Cancer Institute ● Mayo Clinic ● United States Geological Survey ● Department of Defense

th Open Forum on Metadata Registries, Kobe, Japan5 Goals ● Assist revisions of ISO/IEC Metadata Registry Standard to encompass additional semantic descriptions and resources  Vocabularies, thesauri, etc.  Ontologies  Relationships  Semantic types ● Design and implement prototype Extended Metadata Registry ● Load metadata content into prototype ● Demonstrate prototype

th Open Forum on Metadata Registries, Kobe, Japan6 Why Metadata Registries? ● Facilitate reuse/standardization/integration/exchange of data ● Design time:  Database / messaging / application / forms designers  Data warehouse design ● Run-time:  Query formulation / optimization  Federated data query optimization / processing  Extraction, Translation, Load (ETL) of Data Warehouses  Semantic services, composition, workflows,... ● Users  Finding, understanding data  Understanding data entry forms

th Open Forum on Metadata Registries, Kobe, Japan7 Why Standards? ● Developing metamodel to serve as design for next generation metadata registries ● Evolve ISO/IEC Metadata Data Registry Standard  Edition 2 (current) ● UML modeling, relational DB technology implementation  Edition 3 (new) ● UML + OWL (Ontology Web Language) / MOF (Meta Object Facility) / CL (Common Logic) modeling ● Add support for ontologies

th Open Forum on Metadata Registries, Kobe, Japan8 More on Why MDR Standards? ● MDR Standards  Can improve metadata creation practice  Can improve metadata and data reuse  Facilitate MDR adoption by organizations  Facilitate MDR interoperability  Facilitate MDR software marketing  Facilitate MDR procurement  Facilitate alignment / mapping among metadata schemas,...

th Open Forum on Metadata Registries, Kobe, Japan9 Proposed Changes to ISO/IEC ● Support for ontologies, etc. ● More formal modeling of relationships ● Semantic types (?)

th Open Forum on Metadata Registries, Kobe, Japan10 Changes to ISO/IEC Std. ● Add support for ontologies, vocabularies  Add ontologies  Add predicates (logical formulae)  Add axioms (asserted to be true)  Add support for modularization of ontologies ● Add inclusion mechanisms for concept systems and ontologies ● Assert axioms in context of containing ontology

th Open Forum on Metadata Registries, Kobe, Japan11 Why add support for ontologies? ● More precise specification of data semantics (than natural language definitions) ● Machine processing of semantic specifications of data  Classification, subsumption testing, alignment, spatial, temporal reasoning ● Reusable semantic specifications for subject domains ● Conceptual data models to facilitate data integration ● Encoding of much current work on data semantics and terminologies as ontologies ● Useful for machine learning.

th Open Forum on Metadata Registries, Kobe, Japan12 Issues in Including Ontologies in ISO/IEC ● Lack of agreement on logical formalisms  FOL, description logic (which?),... ● Hence, MDR std must be agnostic among logic formalisms ● Poses difficulties for:  Standards specification  MDR implementation  MDR interoperability ● See work of OMG Ontology Definition Metamodel (ODM) standard

th Open Forum on Metadata Registries, Kobe, Japan13 Changes to ISO/IEC Std. ● Formalize specification of semantic relationships  Refinement of Edition 2 Classification Schemes  Add relationships (types), roles, links (instances) among concepts  Specify attributes of relationships ● Reflexivity, irreflexivity, symmetry, anti-symmetry, transitivity  To support inference across semantic relationships ● e.g., transitive closure over is-a, part-of,...

th Open Forum on Metadata Registries, Kobe, Japan14 Relationship Modeling in ISO/IEC Edition 3 ● Edition 2 has classification schemes and specialized relationships among various metamodel entities ● Proposed for Edition 3 ● Binary and N-ary semantic relationships among concepts (a.k.a. relations) ● Treat data element concept, conceptual value domain, value meaning, etc. as subtypes of concept ● More detailed characterization of relationships:  Roles / links  Reflexivity, symmetry, anti-symmetry, transitivity,....

th Open Forum on Metadata Registries, Kobe, Japan15 Why care about relationship characterization? ● Who cares about reflexivity, irreflexivity, symmetry, transitivity? ● Answer: need this information for inference on semantic relationships (usually binary)  Example: Does it make sense to compute transitive closure? ● Is-a: transitive ● Part-of: sometimes transitive ● Equals: transitive, symmetric ● Similar: usually symmetric, typically not transitive

th Open Forum on Metadata Registries, Kobe, Japan16 Semantic Types for ISO/IEC ● ISO/IEC Edition 2 has “datatypes”  Associated with “value domain”  i.e., datatypes are an aspect of representation NOT semantics ● Semantic Types  Concern meaning rather than representation  Uses: ● Constraints over relationship roles ● Attribute of concepts, conceptual value domains,... ● Ubiquitous in ontologies, schemas,...

th Open Forum on Metadata Registries, Kobe, Japan17 Some Issues for Semantic Types ● Alternative approaches:  Build semantic types into metamodel  Reuse relationships for semantic type specifications  Treat semantic types as unary predicates in ontologies + axioms ● Should we have a standard set of semantic types (at least base types)  Yes, for interoperability  No, for flexibility ● Collection types, type constructors ?

th Open Forum on Metadata Registries, Kobe, Japan18 Why Construct A Prototype? ● To explore alternative revisions to ISO/IEC ● To demonstrate that proposed revisions to ISO/IEC Metadata Registry Std. are:  Feasible  Useful ● To experiment with alternative architectures / technologies for constructing extended metadata registries.  Text retrieval engines - Lucene  Inference engines – Jena, Kowari (?),....  Service oriented architecture (SOA) ● To facilitate deployment of revised ISO/IEC Metadata Registries  Example implementation  Open Source Code !

th Open Forum on Metadata Registries, Kobe, Japan19 Why Content? ● Content characterization assists in shaping revisions to ISO/IEC ● Content characterization assists in selection of content to load ● Content ingestion, installation, querying provides a means to exercise the prototype  Testing  Demonstration  Performance evaluation  Utility evaluation

th Open Forum on Metadata Registries, Kobe, Japan20 Metadata Content Activities ● Content Characterization  e.g., graph theoretic characterization ● Content Acquisition ● Content Preprocessing  Into standard formats for loading (H. Solbrig) ● Content Loading ● Content Querying

th Open Forum on Metadata Registries, Kobe, Japan21 Desiderata for Content Selection ● Accessibility  Licensing, source cooperation, unclassified ● Documentation, familiarity to XMDR collaborators ● Funder interest ● Diversity of metadata types, subject areas ● Diverse graph structures (of semantic relationships) ● OWL encodings available ● Moderate size ● Opportunities for mappings among metadata sets ● Multi-linguality

th Open Forum on Metadata Registries, Kobe, Japan22 Content Characterization ● Provenance: Name, source, contact,... ● Type of metadata:  thesauri, ontology, ISO/IEC metadata registry,... ● Graph Characterization  Tree, Faceted Classification, partial order (directed acyclic graph), cyclic graph,... ● Size: # concepts, # links, # bytes ● Definitions ? ● File Formats ● OWL encoding ? ● Multilingual ● Availability / licensing issues

th Open Forum on Metadata Registries, Kobe, Japan23 Why Graph-theoretic Content Characterization? ● Important structural taxonomy ● Impacts:  Expressivity required of registry  Content representation, index structures  Search, matching algorithms  Computational complexity of search, matching,...  Inference algorithms  Computational complexity of inference  Design / implementation / performance of metadata registries

th Open Forum on Metadata Registries, Kobe, Japan24 Loaded content metadatasets ● National Cancer Institute Thesaurus (NCIT) ● Defense Technology Information Center (DTIC) Thesaurus ● General Multilingual Environmental Thesaurus (GEMET) ● Adult Mouse Anatomical Dictionary ● EPA Terms of the Environment ● ISO 3166 Country Codes ● ISO 4217 Currency Codes

th Open Forum on Metadata Registries, Kobe, Japan25 Other Metadatasets of Interest ● NCI Cancer Data Standards Repository (caDSR) ● EPA Environmental Data Registry (EDR) ● NLM Uniform Medical Language System (UMLS) ● USGS Geographic Names Information System (GNIS) ● Integrated Taxonomic Information System (ITIS) ● NBII Biocomplexity Thesaurus ● ISO 639 Language Identifiers ● Logical Observations, Identifiers, Codes (LOINC) ● Getty Thesaurus of Geographical Names (TGN) ● NASA Semantic Web Earth and Environmental Terminologies (SWEET) ● Dublin Core Metadata (?)

th Open Forum on Metadata Registries, Kobe, Japan26 Conclusions ● XMDR Activities  ISO/IEC Revisions ● Support for ontologies, etc. ● Relationships ● Semantic types  Prototype Development  Content (characterization, loading, query)  Prototype testing, performance evaluation, demos

th Open Forum on Metadata Registries, Kobe, Japan27 Coming in Second Part of Talk (Kevin Keck) : ● Detailed discussion of the architecture and technology of the prototype...

th Open Forum on Metadata Registries, Kobe, Japan28 Acknowledgements ● Financial support from U.S. Dept. of Defense, U.S. Environmental Protection Agency ● In kind contributions from U.S. National Cancer Institute, Mayo Clinic, US Geological Survey ● Support from program managers: Nancy Lawler (DOD) and Sam Chance (DOD) ● Comments on drafts of this talk by John L. McCarthy

th Open Forum on Metadata Registries, Kobe, Japan29 Contact Information: ● Project:  ● Frank Olken:  Lawrence Berkeley National Laboratory   Tel:  URL: