Jena based Implementation of a ISO 11179 Meta-data Registry A. Anil Sinaci, SRDC 1 / 37.

Slides:



Advertisements
Similar presentations
Introduction The cancerGrid metadata registry (cgMDR) has proved effective as a lightweight, desktop solution, interoperable with caDSR, targeted at the.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
GSA Office of Intergovernmental Solutions Fostering a Collaborative Environment with Federal, State, Local and International Governments The Health IT.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Semantic Metadata Registry/Repository
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
Status Report of the Study Group on MDR/MFI Implemenations ISO/IEC JTC 1/SC 32/WG2 Interim Meeting Santa Fe, NM, USA, November 11~15, 2013 Dongwon Jeong,
1 Metadata Registry Standards: A Key to Information Integration Jim Carpenter Bureau of Labor Statistics MIT Seminar June 3, 1999 Previously presented.
Direction of Proposals for New Edition (E3) of ISO/IEC 11179
ICT Monica Monachini – 1° KYOTO Workshop – Amsterdam 2/ KYOTO (ICT ) Yielding Ontologies for Transition-Based Organization Intelligent.
Edition 3 Metadata registry (MDR) Ray Gates May 12, /05/20151.
EleMAP: An Online Tool for Harmonizing Data Elements using Standardized Metadata Registries and Biomedical Vocabularies Jyotishman Pathak, PhD 1 Janey.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Metadata Agents and Semantic Mediation Mikhaila Burgess Cardiff University.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Metadata Tools and Methods Chris Nelson Metanet Conference 2 April 2001.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. XMDR Prototype Day: 21.
1 CSE 2102 CSE 2102 Ph.D. Proposal A Process Framework For Ontology Modeling, Design, And Development Realized By Extending OWL and ODM Candidate: Rishi.
Status report of : Framework for generating ontologies ISO/IEC JTC 1/SC 32/WG 2 Interim Meeting, Redwood City, USA, November 17, 2010 Dongwon Jeong,
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
Classification and the Metadata Registry Judith Newton NIST IRS XML Stakeholders/ XML Working Group May 18, 2004.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Metadata Management Case Study Date: 10/21/2008 Dan McCreary President Dan McCreary & Associates (952) M D Metadata Solutions.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
Metadata Registries Workshop April 15, 1998 Slide 1 of 20 ANSI X Douglas D. Mann Stewardship Naming & Identification Classification.
Interfacing Registry Systems December 2000.
The Final Study Period Report on MFI 6: Model registration procedure SC32WG2 Meeting, Sydney May 26, 2008 H. Horiuchi, Keqing He, Doo-Kwon Baik SC32WG2.
The United States Health Information Knowledgebase: Federal/State Initiatives An AHRQ Research Project J. Michael Fitzmaurice, PhD, AHRQ Robin Barnes,
Interoperability Framework Overview Health Information Technology (HIT) Standards Committee June 24, 2010 Presented by: Douglas Fridsma, MD, PhD Acting.
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Issues for ISO/IEC : Procedure for the Specification of Web Ontology (PSO) ISO/IEC JTC 1/SC 32/WG 2 Interim Meeting London, UK, November 17, 2009.
12/03/ Second International Workshop on New Generation Enterprise and Business Innovation NGEBIS 2013 Semantic UBL-like documents for innovation.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Query Health Concept-to-Codes (C2C) SWG Meeting #11 February 28,
Metadata Registries Registry: authoritative, centrally controlled store of information – W3C Web Services Glossary, 2004
A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Tutorial on XML Tag and Schema Registration in an ISO/IEC Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
ISO TC37/SC4 N435 Nov 12, 2007 Presented by Miran Choi/ETRI Written by Jae Sung Lee/Chungbuk National Univ.
Extending the Metadata Registry for Semantic Web - Enforcing the MDR for supporting ontology concept - May 28, 2008 ISO/IEC JTC 1/SC 32 WG 2 Meeting Sydney,
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
National Cancer Institute caDSR Briefing for Small Scale Harmonication Project Denise Warzel Associate Director, Core Infrastructure caCORE Product Line.
OASIS SET TC MeetingAugust 14, 2008 A Proposal for SET TC Requirements.
Collaborative Vocabulary Management
Achieving Semantic Interoperability of Cancer Registries
OPM/S: Semantic Engineering of Web Services
Federal Health IT Ontology Project (HITOP) Group
Networking and Health Information Exchange
Wsdl.
The Re3gistry software and the INSPIRE Registry
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
LOD reference architecture
Presentation transcript:

Jena based Implementation of a ISO Meta-data Registry A. Anil Sinaci, SRDC 1 / 37

About me PhD student at Senior Software Engineer at FP7 Projects: 2 / 37

Agenda Introduction Motivation FP7 – SALUS & BIVEE Projects Background ISO/IEC Common Data Elements Design & Implementation Use-case Summary 3 / 37

What is Meta-data? data about data… (deprecated) at design time the application contains no data Descriptive metadata Structural Metadata – data about the containers of data meta-data is data can be stored and managed meta-data registries 4 / 37

Importance of Meta-data Data Meta-data 5 / 37

Problem of Interoperability Syntactic vs. Semantic The ability to exchange information access The ability to use the information once it has been exchanged understand The figure is taken from a presentation of caSDR 6 / 37

Meta-data for Semantic Interoperability Precise knowledge about how data is structured More efficient and productive with a central, well-administered place to seek for meta-data Central, easily consumable Classifications with well-known terminology systems Build (or map) data models based on a common meta-model Patient Name Surname Birth Date Sex Patient Firstname Surname Date of Birth Gender MDR ISO/IEC / 37

Jena based ISO There are lots of MDR instances out there Most of them are based on ISO/IEC have the chance to interoperate semantically ISO/IEC ontology common vocabulary for meta-data level Manage all items, classifications, inter-relations and links to the external world (terminology systems, taxonomies, vocabularies) in a triple-store easily expose as RDF easily import as RDF 8 / 37

Interoperable through LOD Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. MDR RDFRDF RDFRDF RDFRDF TripleStore 9 / 37

Motivation The SALUS Project: Pharmacovigilance Current post-market safety surveillance and reporting activities are largely based on reports of suspected adverse drug reactions sent to the regulatory bodies 5% of all hospital admissions in Europe are due to an adverse drug reaction (ADR) ADRs are the fifth most common cause of hospital deaths drug withdrawals (eg. Vioxx) Interoperability between clinical care and clinical research domains  A semantic interoperability architecture based on commonly accepted data elements 10 / 37

SALUS Project Central & Semantic Meta-data Registry 11 / 37

Motivation The BIVEE Project: Business Innovation Management Software tools for the support of innovation & improvement management within Virtual Enterprises Document Centric Approach Identify document structures through common building blocks  common data elements CCTS – OASIS UBL based on ISO/IEC Production & Innovation Knowledge Repository  Semantic Descriptors Interoperability among business domains of collaborating partners within a Virtual Enterprise  An interoperability architecture based on commonly accepted semantic descriptors (meta-data) 12 / 37

BIVEE Project Business Processes KPIs Innovation related docs Production related docs SD Value Production Space F-PIKR Semantic Search/ Query/ Reasoning Semantic Search/ Query/ Reasoning Semantic Annotation Semantic Annotation KPIs Business Innovation Space SD Virtual Enterprise Environment BIVEE Ontologies (I-PIKR) Production Data External Resources SD Innovation Data VE Members (competencies) Central & Semantic Meta-data Registry 13 / 37

The requirement A clear need for a Common Data Element Repository to facilitate the semantic interoperability between different application domains to store the building blocks of data models of different domains and systems so that different data models are described through the aggregation and association of Common Data Elements should deal with several annotations and links to external world several vocabularies, classification schemes and terminology systems are currently in use for different domains should follow the characteristics of the Linked Data approach. 14 / 37

What is ISO/IEC ? Family of standards addressing the; Semantics of Data Representation of Data Registration of Data ISO/IEC is; Description of metadata in terms of Data Elements Procedures to manage registry of Data Elements 15 / 37

Parts of ISO/IEC Consists of 6 parts defining Framework for Specification Classification Registry Metamodel Formulations of Data Definitions Naming and Identification Principles Registration of Data Elements. 16 / 37

Purpose of ISO/IEC ISO/IEC is to promote Standard description of data Common understanding of data across organizational elements and between organizations Re-use and standardization of data over time, space, and applications Harmonization and standardization of data within an organization and across organizations Management of the components of data Re-use of the components of data 17 / 37

Benefits of ISO/IEC Similar CDE’s linked to same Concept’s; reduced search time All representations of a CDE can be shown together; increased filexibility CDE’s having same value domain can be shown together; easy administration of registry Concept of Object Class and Property; allows Linked Data representation Classification through External Vocabularies; allows Linked Data integration 18 / 37

Common Data Element Logical unit of data Belongs to one kind of information Set of attributes specifies; Identification Definition Representation Permissible value 19 / 37

Common Data Element Data Element Data Element Concept Object Class Property Value Domain Representation 20 / 37

Common Data Element Person Birth Date Value Person Birth Date Person Birth Date Birth Date Value Data Element Data type: Calendar Data Element Concept Object Class Property Value Domain The concept What? The representation How? 21 / 37

Common Data Element diagram adopted from Linked Data ICD9, ICD10 SNOMED CT LOINC RxNorm WHO ART MedDRA …. Linked Data Integration with other MDRs 22 / 37

Common Data Element Improves the quality of data Simplifies data sharing Knowledge sharing Promotes standard, consistent, universal data Ease of development data collection tools Data Interoperability between applications development teams enterprises …  All require precise definitions of data 23 / 37

ISO/IEC Implementations OneMeta MDR, Data Foundations Inc. extendible and configurable, commercial caDSR, US National Cancer Institute Extension to standard, persisted on RDBMS CCTS, UN/CEFACT Business data model standard based on UBL is an implementation of CCTS US National Information Exchange Model - NIEM 24 / 37

Organizations using ISO/IEC Australian Institute of Health and Welfare - METeOR US Department of Justice - Global Justice XML Data Model GJXDM US Environmental Protection Agency - Environmental Data Registry US Health Information Knowledgebase (USHIK) Ohio State University - open Metadata Repository (openMDR) Minnesota Department of Education Metadata Registry (K-12 Data) Minnesota Department of Revenue Property Taxation The Census Bureau Corporate Metadata Repository Statistics Canada Integrated MetaDataBase The Environmental Data Registry 25 / 37

Design & Implementation ISO/IEC Ontology 26 / 37

Ontology Design 27 / 37

Design & Implementation 28 / 37

Design & Implementation JENA RDF/OWL API 29 / 37

Use-case Once we have an implementation for a semantic MDR Need to populate with Common Data Elements Mining for CDEs Importers for different languages: XML Schema, UML, and ontology languages (RDFS/OWL) Other applications must be built on top of the semantic MDR New content models referring to the CDEs Matching and mapping  Strong reasoning Data Warehouses, Web Services, EHR Systems, Content Management Systems etc… 30 / 37

Use-case List all “ClassificationScheme”s List all “ObjectClass”es 31 / 37

Use-case Get all “Property”s of a Patient 32 / 37

Use-case List all “DataElement”s which are “classifiedBy” Myocardial Infarction (ClassificationSchemeItem) and Nifedipine (ClassificationSchemeItem) AND which have Allergy as “DataElementConcept” 33 / 37

Use-case II 34 / 37

Summary Meta-data Registry to facilitate Semantic Interoperability through Common Data Elements (CDE) For several different domains ISO/IEC based well-established and commonly accepted standard Pure triple-store implementation access through Jena API easy integration to Linked Data cloud together with other MDR implementations Importers for CDE identification XML Schema, UML (v1.x and v2.x), RDFS/OWL based ontologies Apache Wicket based Web interface 35 / 37

ISO/IEC Procedures for achieving metadata registry (MDR) content consistency formalized ontology generation with well-defined concepts MDRs (Sets of concepts) MDRs (Sets of concepts) realized build Metadata Registry (ISO/IEC 11179) Metadata Registry (ISO/IEC 11179) DEC CD DE OC... Our Proposal Web Ontology Scope of this Part utilized EDR (Environmental Data Registry) caDSR (US National Cancer Institute) METeO R (Metadata Online Registry) Process Manager Mapping Info. and Rulus 36 / 37

Questions Thank you for listening… Special thanks to A. Anil 37 / 37