ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
ESIP Semantic Web Working Group 2013 ESIP Winter Meeting 3:30PM EST, Wednesday, January 9.
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
Evolving the BCO-DMO search interface - experience with semantic and smart search Cyndy Chandler (WHOI) Peter Fox (RPI and WHOI) Robert Groman, Dicky Allison.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Experiences Developing a User- centric Presentation of A Domain- enhanced Provenance Data Model Cynthia Chang 1, Stephan Zednik 1, Chris Lynnes 2, Peter.
Applying Semantics in Dataset Summarization for Solar Data Ingest Pipelines James Michaelis ( ), Deborah L. McGuinness
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
ESIP Semantic Web Working Group 2013 ESIP Winter Meeting 3:30PM EST, Wednesday, January 9.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator Session: Managing Ecological Data for Effective Use and Reuse Patrice Seyed.
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Configurable User Interface Framework for Cross-Disciplinary and Citizen Science Presented by: Peter Fox Authors: Eric Rozell, Han Wang, Patrick West,
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Provenance Capture in Data Access And Data Manipulation Software Patrick West 1 Peter Fox
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
References: [1] [2] [3] Acknowledgments:
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal Principle Investigator: Eric Rozell Tetherless World Constellation.
References: [1] Branch, B.D., Fosmire, M., The role of interdisciplinary GIS and data curation librarians in enhancing authentic scientific research.
Discovering accessibility, display, and manipulation of data in a data portal Nancy Hoebelheinrich Patrick West 2
Motivations and Challenges: Proper data management hinges on recording and maintaining “steps” applied to create data. Consumers require methods to assess.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
DOAP – Description of a Project Ontology DOAP provides us with the ability to represent software, software projects, releases of software, licensing information,
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
1 Semantic Provenance and Integration Peter Fox and Deborah L. McGuinness Joint work with Stephan Zednick, Patrick West, Li Ding, Cynthia Chang, … Tetherless.
Applying Provenance Extensions to OPeNDAP Framework Patrick West, James Michaelis, Tim Lebo, Deborah L. McGuinness Rensselaer Polytechnic Institute Tetherless.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
M.Benno Blumenthal and John del Corral International Research Institute for Climate and Society OpenDAP 2007
ToolMatch Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Products Patrick West 1 Nancy Hoebelheinrich.
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
Resource Discovery for Extreme Scale Collaboration Benno Lee Patrick West 1 William Smith 2
Coding Provenance in Software and Matching Tools to Data OPeNDAP Provenance Project And ESIP ToolMatch Project Patrick West, Tetherless World Constellation.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
The VIRTUAL SOLAR-TERRESTRIAL OBSERVATORY - Exploring paradigms for interdisciplinary data-driven science Peter Fox 1 Don Middleton 2,
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
VIVO Conference 2013 Panel on VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data Patrick West – Tetherless.
References: [1] Lebo, T., Sahoo, S., McGuinness, D. L. (eds.), PROV-O: The PROV Ontology. Available via: [2]
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
A Semantic Web Approach for the Third Provenance Challenge Tetherless World Rensselaer Polytechnic Institute James Michaelis, Li Ding,
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
Supported by ESIP Semantic Web Cluster A service based on community-built semantic web applications Provide users with the means to match their datasets.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
Poster: EGU Glossary: USGCRP – United States Global Change Research Program NCA – National Climate Assessment GCIS – Global Change Information.
Get the poster at Semantic Visualization Provenance Records:
Provenance Capture in Data Access And Data Manipulation Software
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Modeling Data Set Versioning Operations
ToolMatch Service: Finding Tools for Your Data & Data for Your Tools ESIP Summer 2014 A Collaboration between ESIP’s: Semantic Web Cluster & Product &
ToolMatch Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Products Patrick West1 Nancy
HDF-EOS Workshop XXI / The 2018 ESIP Summer Meeting
Modeling Data Set Versioning Operations
Presentation transcript:

ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich Peter Fox Christopher Lynnes 3 ( 2 Tetherless World Constellation, Rensselaer Polytechnic Institute th St., Troy, NY, United States) ( 1 Knowledge Motifs, San Mateo, CA, United States) ( 3 Goddard Space Flight Center, NASA, Greenbelt, MD, United States) Abstract Glossary: CMAP/COE – Concept Mapping Application Ontology Editor, built on top of the IHMC CmapTools concept mapping software ESIP – Earth Science Information Partners ( FOAF - Friend of a Friend ( O&M – Observations and Measurements ( OWL – Web Ontology Language RDFs – Resource Description Framework Schema RPI/TWC – Rensselaer Polytechnic Institute / Tetherless World Constellation ( SADL – Semantic Application Design Language ( SPARQL – Simple Protocol and RDF Query Language Acknowledgments: Eric Rozell, Master’s Graduate of Rensselaer Polytechnic institute The accessibility of science data products is becoming increasingly easier, with more and more data and scientific community portals coming online all the time. But what can one do with the data product once it has been found? Can I visualize the data product as a map, plot, or graph? Can I import the data into a particular data manipulation tool like MatLab or IDL or iPython Notebook? How is the dataset accessible, and what kind of data products can be generated from it? ToolMatch is a crowd source approach (ontological model, information model, RDF Schema) that allows data and tool providers, and portal developers to enable user discovery of what can be done with a science data product, or conversely, which science data products are usable within a given tool. Example queries may include "I need data for Carbon dioxide (CO2) concentrations, a climate change indicator, for the summer of 2012, that can be accessed via OPeNDAP Hyrax and plotted as a timeseries.", or "I need data with measurements of atmospheric aerosol optical depth sliced along latitude and longitude, returned as netcdf data, and accessible in MatLab." This contribution outlines the progress of the ToolMatch development, plans for utilizing its capabilities, and efforts to leverage and enhance the use of ToolMatch in various portals. ToolMatch Description Tools To facilitate a crowd source approach for domain experts who are not ontologists, we develop an ontological model using one of the open source, English-like languages available that can help us. We develop an ontology that can help us: Determine what storage format a data collection is in, i.e. NetCDF4, HDF4 Determine what conventions the metadata follow Determine the types of information stored in the file Determine the type of server the data collection can be accessed from From this information we can then: Infer the various tools available that can visualize the given data collection Proposed Solution Resulting query The resulting query to find the set of tools available to visualize a data collection becomes very simple SELECT ?tool WHERE { toolmatch:visualizedBy ?tool. ?tool rdf:type toolmatch:Tool. } SADL is an open source language designed for domain experts who are not ontologists, but are still interested in building formal models of an OWL ontology, testing the validity of the models, expressing rules using ontological concepts, and retrieving information via ontologically based queries. SADL is designed to be English-like, and was used in an Eclipse-Indigo IDE for this project. From the SADL file we can: generate an rdf/xml file import the rdf/xml file into CMAP/COE to generate a relationship diagram import the rdf/xml file into our triple store and run the inferences over the information. The resulting information displayed to the user allows them to decide how best to visualize this information Relationship Diagram * Equivalent Class DataCollection and (isAccessedBy value OPeNDAP) or (hasDataStorageFormat value NetCDF) and (usesGridType value AuxiliaryLatLonGrid) or (usesGridType value RegularLatLonGrid) and usesConvention value ClimateForecast_CF * Subclass Of mappedBy value IDV and mappedBy value McIDAS-V and mappedBy value Panoply Inference * Equivalent Class DataCollection and (isAccessedBy value OPeNDAP) or (hasDataFormat value NetCDF) and usesConvention value CF1Convention and usesConvention value RegularLatLonGrid * Subclass Of mappedBy value Ferret and mappedBy value GrADS * Equivalent Class DataCollection and (isAccessedBy value GrADSDataServer) or (isAccessedBy value Hyrax) or (isAccessedBy value ThreddsDataServer) or (isAccessedBy value erddap) * Subclass Of isAccessedBy value OPeNDAP Next Steps Complete the Tools Concept Modeling Integrate the Model and Knowledge Information into an existing Virtual Observatory Feedback from the ESIP Community on the Information Model Crowdsourcing, getting people and groups to contribute to the knowledge store Poster