Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,

Slides:



Advertisements
Similar presentations
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Advertisements

DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
Evolving the BCO-DMO search interface - experience with semantic and smart search Cyndy Chandler (WHOI) Peter Fox (RPI and WHOI) Robert Groman, Dicky Allison.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Progress in Open-World, Integrative, Web-based Collaborative Research Platforms Peter Fox and the DCO-DS* Team Tetherless World Constellation.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Global Change Information System: Information Model and Semantic Application Prototypes (GCIS-IMSAP) Status 01/08/2013 Stephan Zednik 1, Curt Tilmes 2,
An Example in The DCO Data Portal Formal Specification of Data Types in the Deep Carbon Observatory Data Portal Xiaogang (Marshall) Ma
References: [1] [2] [3] Acknowledgments:
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
DCO's Data Science Day Introduction June 5, 2014, Troy NY Peter Fox (Rensselaer Polytechnic Institute)
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
Prof. Peter #twcrpi) Tetherless World Constellation Chair, Earth and Environmental Science/ Computer Science/ Cognitive.
1 Semantic Provenance and Integration Peter Fox and Deborah L. McGuinness Joint work with Stephan Zednick, Patrick West, Li Ding, Cynthia Chang, … Tetherless.
Applying Provenance Extensions to OPeNDAP Framework Patrick West, James Michaelis, Tim Lebo, Deborah L. McGuinness Rensselaer Polytechnic Institute Tetherless.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Resource Discovery for Extreme Scale Collaboration Benno Lee Patrick West 1 William Smith 2
Brief: Data Science Progress/ Activities and Renewal Plans DCO Executive Committee. Oct. 8-9, Rome (IT) DCO-DS = DCO Data Science.
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
VIVO Conference 2013 Panel on VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data Patrick West – Tetherless.
References: [1] Lebo, T., Sahoo, S., McGuinness, D. L. (eds.), PROV-O: The PROV Ontology. Available via: [2]
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
Data Type Registries (DTR) RDA 4th WG/IG Collab Meeting NIST: Dec 2015 Larry Lannom CNRI.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
DCO-DS: Moving Forward DCO Synthesis Meeting. Oct , 2015 DCO-DS = DCO Data Science.
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
TWC A use case-driven iterative method for building a provenance-aware GCIS ontology Xiaogang Ma a, Jin Guang Zheng a, Justin Goldstein b,c, Linyun Fu.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
Data Typing BoF RDA Plenary 7 Tokyo: March 2016 Larry Lannom CNRI.
A Framework for Earth Science Search Interface Development Design and Implementation of S2S Presented by: Stephan Zednik, Tetherless World Constellation.
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
‘Ontology Management’ Peter Fox (Semantic Web Cluster lead)
WG Research Data Collections RDA P10 Montréal – September 2017
ACS 2016 Moving research forward with persistent identifiers
Ontology Evolution: A Methodological Overview
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Deep Carbon Observatory Data Science Platform
Health Ingenuity Exchange - HingX
WG Research Data Collections An overview of the recommendation
Data types and persistent identifiers in
Semantic Annotation service
Agenda (AM) 9:30-10:15 Introduction to RDA
Modeling Data Set Versioning Operations
Adoption of RDA DTR and PIT in the Deep Carbon Observatory Data Portal
Agro Hackathon Hack 5: Agro Portal and VEST Registry
Science Data Platforms: Informatics Architectures at the Forefront.
Bird of Feather Session
1st Call for Collaboration Projects
Presentation transcript:

Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the DCO-Data Science Team Tetherless World Constellation Rensselaer Polytechnic Institute *Funded by RDA/US (NSF) http://tw.rpi.edu/web/doc/20150921_slides_RDA_P6.pptx The Deep Carbon Observatory (DCO) community is building a cyber-enabled platform for linked science, made available to the community by a multi-institutional data portal. Persistent identifiers and domain specific data types have been identified as key technological issues the portal must address. This presentation focuses on the DCO portal’s planned adoption of RDA DTR and PID methodologies and technologies as a means to address the DCO community's need for persistently identifiable and understandable data type information.

DCO Research Requirements Each defined data type needs a stable and resolvable PID Provide semantics - meaning and context - to the defined data types Annotate datasets with one or more defined data types Each DCO-ID can help redirect to the Web profile of an object, where the detailed metadata can be found. In the DCO Data Portal each object is the instance of a class. The metadata items describing an instance are properties. All those classes and properties are organized by the DCO ontology. The classes and properties behind the instance and DCO-ID are comparable to the information types proposed in the RDA PIT works. The DCO-ID allows both humans and machines to retrieve metadata of any object deposited in the data portal. 2 DCO-ID as a mechanism of persistent identifier for both object registration and retrieval

Nature of efforts The DTR primitives are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, e.g. Dataset, Image, Video, Audio, etc. A registered DCO dataset is asserted as an instance of one of those basic data type classes. It is possible to further annotate the dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID. The primitives in a DTR are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, such as Dataset, Image, Video, and Audio, etc. A registered DCO dataset is asserted as an instance of one of those basic data type classes. It is possible to further annotate a registered dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID. 3

Results of data type specification Updates to the DCO Ontology: A new class dco:DataType. Each specific data type is an instance of it An object property dco:hasDataType linking a dataset and a data type A collection of other classes and properties associated with dco:DataType 4

Implementation of data type and DCO-ID The basic data type dco:dcoOntology rdf:type vivo:Dataset . Profile of the DCO Ontology. It is annotated with a data type. The specific data type dco:dcoOntology dco:hasDataType dco:RDF . 5

Profile of a registered data type dco:RDF a dco:DataType . dco:DataType a owl:Class . Each registered object, such as a data type, has a unique DCO-ID, which is resolvable by the global Handle System 6

A faceted browser for registered data types

Using Data Type as a facet in DCO dataset browser

Notable Machine accessibility and readability Given the DCO-ID of a data type, SPARQL queries can be sent to the triple store of the DCO data portal to retrieve information about the data type Such SPARQL queries can be derived from a query template that is tailored for data types To have a such a query template, further work may be needed to identify the metadata kernel of data type These also show the vision of a API for data type information

Conclusions The methodology of RDA DTR and PIT is highly implementable, especially in the environment of the Semantic Web. The technical framework in the current demonstration systems of DTR and PIT can be adapted or further extended for production uses. Initial good researcher response (they recognize their data types) Slides with backup detail at: http://tw.rpi.edu/web/doc/20150921_slides_RDA_P6.pptx Contact Marshall at max7@rpi.edu   Thank you! 11

Outline Background & Research questions Nature of efforts Approaches RDA-DTR, RDA-PIT, DCO Data Portal Nature of efforts Basic data type and Specific data type Approaches Integration architecture vs. Self-contained architecture DCO-ID Results and recommendation 12

Possible DCO-DTR Approaches An integration architecture DCO Data Portal is built on the VIVO platform DTR and DCO-VIVO as separate knowledge bases DCO-VIVO uses DTR API to access data type information A self-contained architecture To have the functionality of DTR completely within the DCO Data Portal Need to modify the DCO Ontology, e.g. add a class dco:DataType and collect properties associated with it We have worked on this approach 13

Background RDA - Data Type Registry (DTR) working group Addressed a core issue of data interoperability: to parse, understand, and reuse data retrieved from others RDA - Persistent Identifier Information Types (PIT) working group Addressed the essential types of information associated with persistent identifiers (PID) Deep Carbon Observatory (DCO) Data Portal Centrally-managed digital object identification, object registration, metadata management and knowledge graph curation. http://deepcarbon.net DCO anticipated a large number of digital object registrations, and therefore needed an appropriate mechanism to curate and reuse the registered information. 17

Nature of efforts The DTR primitives are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, e.g. Dataset, Image, Video, Audio, etc. The primitives in a DTR are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, such as Dataset, Image, Video, and Audio, etc. A registered DCO dataset is asserted as an instance of one of those basic data type classes. It is possible to further annotate a registered dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID. 18

Nature of efforts (cont.) A registered DCO dataset is asserted as an instance of one of those basic data type classes. The primitives in a DTR are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, such as Dataset, Image, Video, and Audio, etc. A registered DCO dataset is asserted as an instance of one of those basic data type classes. It is possible to further annotate a registered dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID. 19

Nature of efforts (cont.) It is possible to further annotate the dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID. The primitives in a DTR are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, such as Dataset, Image, Video, and Audio, etc. A registered DCO dataset is asserted as an instance of one of those basic data type classes. It is possible to further annotate a registered dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID. 20