SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.

Slides:



Advertisements
Similar presentations
Status Report of the Study Group on MDR/MFI Implemenations ISO/IEC JTC 1/SC 32/WG2 Interim Meeting Santa Fe, NM, USA, November 11~15, 2013 Dongwon Jeong,
Advertisements

1 Metadata Registry Standards: A Key to Information Integration Jim Carpenter Bureau of Labor Statistics MIT Seminar June 3, 1999 Previously presented.
Direction of Proposals for New Edition (E3) of ISO/IEC 11179
Edition 3 Metadata registry (MDR) Ray Gates May 12, /05/20151.
ICS (072)Database Systems: A Review1 Database Systems: A Review Dr. Muhammad Shafique.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
TC3 Meeting in Montreal (Montreal/Secretariat)6 page 1 of 10 Structure and purpose of IEC ISO - IEC Specifications for Document Management.
Data and Knowledge Management
Study Period Report: Metamodel for On Demand Model Selection (ODMS) Wang Jian, He Keqing, He Yangfan, Wang Chong State Key Lab of Software Engineering,
Data and Knowledge Management
Development Principles PHIN advances the use of standard vocabularies by working with Standards Development Organizations to ensure that public health.
Procedures to Develop and Register Data Elements in Support of Data Standardization September 2000.
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
Final Report on MFI & MDR Harmonization Hajime Horiuchi May 2010 SC32WG2 N1425.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: A Multi-Metamodel.
Bridging : FGO and ISO/IEC JTC 1/SC 32/WG2 Interim Meeting Krakow, Poland, October 16, 2012 Dongwon Jeong, Kunsan National University
Chapter 6 System Engineering - Computer-based system - System engineering process - “Business process” engineering - Product engineering (Source: Pressman,
MFI Part-1: Reference Model 2 nd Edition Overview Co-editor: Hajime HORIUCHI Co-editor Keith GORDON For the discussion at Krakow: SC32WG2.
Metadata Tools and Methods Chris Nelson Metanet Conference 2 April 2001.
© 2010 TASC, Inc. | TASC Proprietary Laura J. Reece, Ph.D. for SOCoP workshop Dec 3, 2010 Standards Activities in Semantics and Ontologies.
Status report of : Framework for generating ontologies ISO/IEC JTC 1/SC 32/WG 2 Interim Meeting, Redwood City, USA, November 17, 2010 Dongwon Jeong,
Classification and the Metadata Registry Judith Newton NIST IRS XML Stakeholders/ XML Working Group May 18, 2004.
Baba Piprani (SICOM Canada) Robert Henkel (Transport Canada)
ISO Environmental management — Life cycle assessment — Data documentation format.
1 MFI-5: Metamodel for Process models registration HE Keqing, WANG Chong State Key Lab. Of Software Engineering, Wuhan University
2004 Open Forum for eBusiness and Metadata Technology Standardization Metamodel Framework for Ontology Keqing He, Yixin Jing, Yangfan He State Key Laboratory.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
Requirements for Standardization on the Service Registries ISO/IEC JTC1 SC /10/161 A comment to WSSG, JTC1 SC32WG2 N
ICS (072)Database Systems: An Introduction & Review 1 ICS 424 Advanced Database Systems Dr. Muhammad Shafique.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Frank Farance, Farance Inc
Potential standardization items for the cloud computing in SC32 1 WG2 N1665 ISO/IEC JTC 1/SC 32 Plenary Meeting, Berlin, Germany, June 2012 Sungjoon Lim,
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
"Would you tell me, please, which way I ought to go from here?” "That depends a good deal on where you want to get to," said the Cat. -Lewis Carroll: Alice’s.
Final Study Report on ROR May 2010 SC32WG2 Kunming, China Hajime Horiuchi SC32WG2-N1423.
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Tutorial on XML Tag and Schema Registration in an ISO/IEC Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
Information Architecture The Open Group UDEF Project
IoT Meets Big Data Standardization Considerations
Extracting value from grey literature Processes and technologies for aggregating and analysing the hidden Big Data treasure of the organisations.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
ISO TC37/SC4 N435 Nov 12, 2007 Presented by Miran Choi/ETRI Written by Jae Sung Lee/Chungbuk National Univ.
Concept Proposal Sixth Open Forum on Metadata Registries Semantic Interoperability between Registries To be held January 20-24, 2003 Bruce Bargmeyer
Issues for Discussion on MFI-9 Wang Jian, He Keqing, Wang Chong, Feng Zaiwen, Fie He Wuhan University, China ISO/IEC JTC1/SC32/WG2 N1526.
International/Interagency Collaboration – IT for Environmental Information & Environmental Data Exchange Network Copenhagen, Denmark April 25, 2002 Bruce.
Extending the Metadata Registry for Semantic Web - Enforcing the MDR for supporting ontology concept - May 28, 2008 ISO/IEC JTC 1/SC 32 WG 2 Meeting Sydney,
Final Report on Harmonization of MFI & MDR and Disposition Hajime Horiuchi May18, 2011 SC32WG2 N1533-R1 SC32WG2.
Architecture Ecosystem SIG March 2010 Update Jacksonville FL.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
1 The XMSF Profile Overlay to the FEDEP Dr. Katherine L. Morse, SAIC Mr. Robert Lutz, JHU APL
SysML v2 Formalism: Requirements & Benefits
Workplan for Updating the As-built Architecture of the 2007 GEOSS Architecture Implementation Pilot Session 7B, 6 June 2007 GEOSS Architecture Implementation.
The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
knowledge organization for a food secure world
Report on Eighth Open Forum on Metadata Registries, Berlin, April 2005
MDR for the Semantic Web: Supporting Ontology Concept
Enterprise Data Model Enterprise Architecture approach Insights on application for through-life collaboration 2018 – E. Jesson.
Edition 3 Metadata registry (MDR)
Metadata The metadata contains
, editor October 8, 2011 DRAFT-D
Bird of Feather Session
ISO/IEC (MFI-6) Scope definition & Document Structure
Presentation transcript:

SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China

WG2 Viewpoint Big Data magnifies the existing challenges and issues of managing and interpreting data.

Primary Scope of WG2 Metadata relevant to Big Data Scope: Standards for data management and interchange Within and among local and distributed information systems environments Goals: Facilitate interoperability Facilitate discovery, transformation, analysis and data integration Approach: Standardized data management facilities –Metadata Registration and management, naming conventions Structured semantics and syntax, use of ontologies and terminology to define meaning Reference models and frameworks – framework for registering and managing metadata Metadata describing the meaning and constrains of data fields – “Data Elements” – framework for registering models and mappings between models Ontologies, Role/Goal (actors), Processes, Services, Information Models, Form Design, mappings – common specification for logic languages Technical reports to support implementation MDR consistency MDR interoperability Other technical reports

Big Data Stakeholders* – Government – Commercial (Manufacturing and Distribution?) – Defense – Healthcare – Deep Learning – EcoSystem for Research – Astronomy and Physics – Earth and Environment – Polar Sciences – Energy * from JTC1 N0030

Metadata for Big Data Big Data What is the data about? Where is it? What is the structure? What do the data values mean? How was it created? Who is responsible for it? Is it suitable for use with program X? How “big” is it? How often is it updated? When was it last updated? Are there any publications related to the data? Who are the users of the data? Are there any services that can process the data? Where are those services located? What standards were used to produce or validate the data?

Gaps Identified by JTC1 1. Definitions, Vocabulary and Reference Architectures (e.g. system, data, platforms, online/offline, etc.) [for Big Data] Registration of Ontologies 2. Specifications and standardization of metadata including data provenance MDR 5. Domain-specific language and semantics of eventual consistency [for specific industry domains] Registration of Ontologies, Information Models, Mappings 7. General and domain specific ontologies and taxonomies for describing data semantics – Data Element Concepts, Value Domains; Information Models, Form Designs, Ontologies, Process registration, Service, Role/Goal (Actor)

Gaps Identified by JTC1 9. Remote, distributed, and federated analytics (taking the analytics to the data) including data and processing resource discovery – Ontologies, Process registration, Service, Role/Goal (Actor), On Demand Model selection 10. Data sharing and exchange – This is the main objective for and standards 12. Data analysis and mining – The use of structured metadata allow logic languages to reason over data and models facilitate use of structured metadata.

Volume The amount of data is sufficiently large to require special considerations. Large dataset, large individual data, high dimensionality, large memory requirements for analyzing the data Metadata for addressing above issues: – New project split: Registration of Dataset Metadata (metadta

Variety Variety of data represented in different formats (json, xml, sql, etc) – metadata is not relevant to different formats Variety between database structures for representing the data for similar universes of discourse Variety between information models for representing similar universes of discourse (different semantics and different way to represent semantics) Variety different terminology used in the data e.g. data = population of a city – “Soft Patterns” dependent on usage – What does it mean to be a city? How is the population defined/established/ what is meant by population? – Different meaning for the same term different context or universe of discourse for the same the term “city” can be a political entity, versus “city” geographic location Variety of data structure: structured (data represented in tables or objects e.g. SQL), unstructured (data that is a blob with not tags or external data model describing it, e.g. photo and freeform text), semi- structured (eg data that does not necessarily fully conform with an external data model, and/or may have tags embedded in the data, e.g. XML documents, JSON) Metadata standards help address above issue – registered Information models – registered data element – registered ontologies – Common Logic expressions of registered semantics (computable) and mappings between semantics/ontologies

Velocity Frequent changes: Where stale data is inappropriate; this refers to use cases where there is a high frequency of data updates e.g. continuous monitoring of environmental measurements or human data streams Rapid Response: A need for rapid response to changes/input data There are no existing relevant WG2 Metadata standards addressing these aspects of Velocity.

Veracity Uncertainty due to incompleteness, inconsistency, ambiguity Authenticity/truthfulness of the source of the data Accuracy and correctness of the data Metadata standards help address above issue – registered data elements – Common Logic to determine inconsistencies in the data and incompleteness

Value Data has perceived or quantifiable benefit to the organization Need to determine whether or not the data has value to the organization Metadata standards that help address above isseus. – RGPS registers the role and goal, and the process used to create the data – registered Process models – registered Service models – registered Role/Goal models – registered Ontology concepts – registered Information model – registered mappings between models – registered Form Designs – registered data element

Summary of WG2 Relevance to Big Data Understanding Big Data still requires an understanding of the meaning, structure and of data Existing WG2 standards help address big data metadata issues. WG2 standards can be extended to meet gaps Two new areas: – Dataset metadata – Provenance

ISO/IEC & families ISO/IEC specifies a Metadata Registry and associated procedures – Specifies metadata required to describe specific metadata items, such as data elements, data element concepts, conceptual domains, value domains and concept systems such as classification schemes; ISO/IEC specifies metamodels for registering various types of models and model mappings. It uses the registration procedures of ISO/IEC – Specifies metadata required to describe Ontologies, Information model, form designs, process models, service models, role and goal models

Questions and Discussion ???