TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.

Slides:



Advertisements
Similar presentations
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Advertisements

TWC Why Data Science Matters Xiaogang (Marshall) Ma Tetherless World Constellation Rensselaer Polytechnic Institute
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
Evolving the BCO-DMO search interface - experience with semantic and smart search Cyndy Chandler (WHOI) Peter Fox (RPI and WHOI) Robert Groman, Dicky Allison.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Tobias Weigel (DKRZ) Tobias Weigel Deutsches Klimarechenzentrum (DKRZ) Persistent Identifiers Solving a number of problems through a simplistic mechanism.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Progress in Open-World, Integrative, Web-based Collaborative Research Platforms Peter Fox and the DCO-DS* Team Tetherless World Constellation.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Global Change Information System: Information Model and Semantic Application Prototypes (GCIS-IMSAP) Status 01/08/2013 Stephan Zednik 1, Curt Tilmes 2,
An Example in The DCO Data Portal Formal Specification of Data Types in the Deep Carbon Observatory Data Portal Xiaogang (Marshall) Ma
References: [1] [2] [3] Acknowledgments:
DCO's Data Science Day Introduction June 5, 2014, Troy NY Peter Fox (Rensselaer Polytechnic Institute)
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Discovering accessibility, display, and manipulation of data in a data portal Nancy Hoebelheinrich Patrick West 2
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
Prof. Peter #twcrpi) Tetherless World Constellation Chair, Earth and Environmental Science/ Computer Science/ Cognitive.
1 Semantic Provenance and Integration Peter Fox and Deborah L. McGuinness Joint work with Stephan Zednick, Patrick West, Li Ding, Cynthia Chang, … Tetherless.
Applying Provenance Extensions to OPeNDAP Framework Patrick West, James Michaelis, Tim Lebo, Deborah L. McGuinness Rensselaer Polytechnic Institute Tetherless.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Resource Discovery for Extreme Scale Collaboration Benno Lee Patrick West 1 William Smith 2
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Triple Spaces for an Ubiquitous Web of Services Reto Krummenacher,
Brief: Data Science Progress/ Activities and Renewal Plans DCO Executive Committee. Oct. 8-9, Rome (IT) DCO-DS = DCO Data Science.
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
VIVO Conference 2013 Panel on VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data Patrick West – Tetherless.
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Data Type Registries (DTR) RDA 4th WG/IG Collab Meeting NIST: Dec 2015 Larry Lannom CNRI.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
TWC Illuminate Knowledge Elements in Geoscience Literature Xiaogang (Marshall) Ma, Jin Guang Zheng, Han Wang, Peter Fox Tetherless World Constellation.
DCO-DS: Moving Forward DCO Synthesis Meeting. Oct , 2015 DCO-DS = DCO Data Science.
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
TWC A use case-driven iterative method for building a provenance-aware GCIS ontology Xiaogang Ma a, Jin Guang Zheng a, Justin Goldstein b,c, Linyun Fu.
OOI Cyberinfrastructure and Semantics OOI CI Architecture & Design Team UCSD/Calit2 Ocean Observing Systems Semantic Interoperability Workshop, November.
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
Data Science for Global Ebola Response Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Data Typing BoF RDA Plenary 7 Tokyo: March 2016 Larry Lannom CNRI.
A Framework for Earth Science Search Interface Development Design and Implementation of S2S Presented by: Stephan Zednik, Tetherless World Constellation.
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
‘Ontology Management’ Peter Fox (Semantic Web Cluster lead)
WG Research Data Collections RDA P10 Montréal – September 2017
ACS 2016 Moving research forward with persistent identifiers
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Deep Carbon Observatory Data Science Platform
WG Research Data Collections An overview of the recommendation
Data types and persistent identifiers in
Semantic Annotation service
Agenda (AM) 9:30-10:15 Introduction to RDA
Modeling Data Set Versioning Operations
Adoption of RDA DTR and PIT in the Deep Carbon Observatory Data Portal
Agro Hackathon Hack 5: Agro Portal and VEST Registry
Science Data Platforms: Informatics Architectures at the Forefront.
Bird of Feather Session
Modeling Data Set Versioning Operations
1st Call for Collaboration Projects
Presentation transcript:

TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the DCO-Data Science Team Tetherless World Constellation Rensselaer Polytechnic Institute *Funded by RDA/US (NSF)

TWC Outline Background –RDA-DTR, RDA-PIT, DCO Data Portal –DCO research requirements Approaches –Integration architecture vs. self-contained architecture –DCO-ID Nature of efforts –Basic data type and Specific data type –Implementation Results and conclusions 2

TWC Background RDA - Data Type Registry (DTR) working group –Addressed a core issue of data interoperability: to parse, understand, and reuse data retrieved from others RDA - Persistent Identifier Information Types (PIT) working group –Addressed the essential types of information associated with persistent identifiers (PID) Deep Carbon Observatory (DCO) Data Portal –Centrally-managed digital object identification, object registration, metadata management and knowledge graph curation. – 3

TWC DCO Research Requirements Each defined data type needs a stable and resolvable PID Provide semantics - meaning and context - to the defined data types Annotate datasets with one or more defined data types 4 DCO-ID as a mechanism of persistent identifier for both object registration and retrieval

TWC Possible DCO-DTR Approaches An integration architecture –DCO Data Portal is built on the VIVO platform –DTR and DCO-VIVO as separate knowledge bases –DCO-VIVO uses DTR API to access data type information A self-contained architecture –To have the functionality of DTR completely within the DCO Data Portal –Need to modify the DCO Ontology, e.g. add a class dco:DataType and collect properties associated with it We have worked on this approach 5

TWC Nature of efforts 6 The DTR primitives are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, e.g. Dataset, Image, Video, Audio, etc. A registered DCO dataset is asserted as an instance of one of those basic data type classes. It is possible to further annotate the dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID.

TWC Results of data type specification Updates to the DCO Ontology: –A new class dco:DataType. Each specific data type is an instance of it –An object property dco:hasDataType linking a dataset and a data type –A collection of other classes and properties associated with dco:DataType 7

TWC Implementation of data type and DCO-ID 8 The basic data type dco:dcoOntology rdf:type vivo:Dataset. The specific data type dco:dcoOntology dco:hasDataType dco:RDF.

TWC Profile of a registered data type 9 Each registered object, such as a data type, has a unique DCO-ID, which is resolvable by the global Handle System dco:RDF a dco:DataType. dco:DataType a owl:Class.

TWC A faceted browser for registered data types

TWC Using Data Type as a facet in DCO dataset browser

TWC 12

TWC Notable Machine accessibility and readability –Given the DCO-ID of a data type, SPARQL queries can be sent to the triple store of the DCO data portal to retrieve information about the data type –Such SPARQL queries can be derived from a query template that is tailored for data types –To have a such a query template, further work may be needed to identify the metadata kernel of data type –These also show the vision of a API for data type information

TWC Conclusions The methodology of RDA DTR and PIT is highly implementable, especially in the environment of the Semantic Web. The technical framework in the current demonstration systems of DTR and PIT can be adapted or further extended for production uses. Initial good researcher response (they recognize their data types) 14 Thank you!

TWC Spare slides 15

TWC 16

TWC 17

TWC 18

TWC Nature of efforts 19 The DTR primitives are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, e.g. Dataset, Image, Video, Audio, etc.

TWC Nature of efforts (cont.) 20 A registered DCO dataset is asserted as an instance of one of those basic data type classes.

TWC Nature of efforts (cont.) 21 It is possible to further annotate the dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID.