References: [1] [2] [3] Acknowledgments:

Slides:



Advertisements
Similar presentations
Towards a Common Provenance Model for Research Publications Linyun Fu Xiaogang Ma Patrick West Stace Beaulieu.
Advertisements

Complexity must become Linear or Decrease Smart data infrastructure: The sixth generation of mediation for data science Peter Fox 1
Global Change Information System Curt Tilmes, USGCRP/NASA Brian Duggan, Steve Aulenbach, Justin Goldstein, USGCRP/UCAR Andrew Buddenberg, NCA/TSU, NOAA/NCDC/CICS.
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
Global Change Information System Curt Tilmes NASA GSFC USGCRP ESIP Federation Winter Meeting 2013
Presenting Provenance Based on User Roles Experiences with a Solar Physics Data Ingest System Patrick West, James Michaelis, Peter Fox, Stephan Zednik,
A Semantic Sommelier as an Ontology-powered Mobile Social Application and a Pedagogical Tool Deborah L. McGuinness and Evan W. Patton.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Experiences Developing a User- centric Presentation of A Domain- enhanced Provenance Data Model Cynthia Chang 1, Stephan Zednik 1, Chris Lynnes 2, Peter.
Applying Semantics in Dataset Summarization for Solar Data Ingest Pipelines James Michaelis ( ), Deborah L. McGuinness
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
Domain Modelling the upper levels of the eframework Yvonne Howard Hilary Dexter David Millard Learning Societies LabDistributed Learning, University of.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Global Change Information System (GCIS) Curt Tilmes
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Global Change Information System: Information Model and Semantic Application Prototypes (GCIS-IMSAP) Status 01/08/2013 Stephan Zednik 1, Curt Tilmes 2,
Provenance Capture in Data Access And Data Manipulation Software Patrick West 1 Peter Fox
An Example in The DCO Data Portal Formal Specification of Data Types in the Deep Carbon Observatory Data Portal Xiaogang (Marshall) Ma
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Persistent Identification of Agents and Objects of Global Change: Progress in the Global Change Information System Peter Fox, RPI Curt Tilmes, NASA Xiaogang.
References: [1] Branch, B.D., Fosmire, M., The role of interdisciplinary GIS and data curation librarians in enhancing authentic scientific research.
Domain Modeling In FREMA David Millard Yvonne Howard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Discovering accessibility, display, and manipulation of data in a data portal Nancy Hoebelheinrich Patrick West 2
Motivations and Challenges: Proper data management hinges on recording and maintaining “steps” applied to create data. Consumers require methods to assess.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
How to read a scientific paper
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
DOAP – Description of a Project Ontology DOAP provides us with the ability to represent software, software projects, releases of software, licensing information,
TWC Experience in ontology engineering with the Global Change Information System Xiaogang (Marshall) Ma Tetherless World Constellation Rensselaer Polytechnic.
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
1 Semantic Provenance and Integration Peter Fox and Deborah L. McGuinness Joint work with Stephan Zednick, Patrick West, Li Ding, Cynthia Chang, … Tetherless.
Applying Provenance Extensions to OPeNDAP Framework Patrick West, James Michaelis, Tim Lebo, Deborah L. McGuinness Rensselaer Polytechnic Institute Tetherless.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
TWC Ontology Development for Provenance Tracing in National Climate Assessment of the US Global Change Research Program Xiaogang Ma a, Jin Guang Zheng.
DCO-VIVO: A Collaborative Data Platform for the Deep Carbon Science Communities Han Wang 1 ( ), Yu Chen 1 Patrick West.
References: [1] Lebo, T., Sahoo, S., McGuinness, D. L. (eds.), PROV-O: The PROV Ontology. Available via: [2]
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
Domain Modeling In FREMA Yvonne Howard David Millard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
TWC A use case-driven iterative method for building a provenance-aware GCIS ontology Xiaogang Ma a, Jin Guang Zheng a, Justin Goldstein b,c, Linyun Fu.
Supported by ESIP Semantic Web Cluster A service based on community-built semantic web applications Provide users with the means to match their datasets.
How Environmental Informatics is Preparing Us for the Era of Big Data AGU FM 2013 GC11F-01 December 09, 2013, MW 3001 Peter
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Technical Reports ELEC422 Design II. Objectives To gain experience in the process of generating disseminating and sharing of technical knowledge in electrical.
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
Worked example: Global Change Information System Peter Fox, and … others Xinformatics 4400/6400 Week 11, April 19, 2016.
Annotating and Embedding Provenance in Science Data Repositories to Enable Next Generation Science Applications Deborah L. McGuinness.
Poster: EGU Glossary: USGCRP – United States Global Change Research Program NCA – National Climate Assessment GCIS – Global Change Information.
Provenance Capture in Data Access And Data Manipulation Software
Persistent Identifiers Implementation in EOSDIS
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Ontology Evolution: A Methodological Overview
Citation Map Visualizing citation data in the Web of Science
Foundations; information modeling
Data types and persistent identifiers in
Adoption of RDA DTR and PIT in the Deep Carbon Observatory Data Portal
Towards Executable Provenance Graphs for Reported Results in Research Publications Linyun Fu Xiaogang Ma Patrick West
Modeling Data Set Versioning Operations
Presentation transcript:

References: [1] [2] [3] Acknowledgments: We thank Stephan Zednik for his contributions to the earlier stage of the GCIS endeavor, and Ana Pinheiro Privette and Anne Waple for their comments on the GCIS. Sponsors: National Science Foundation University Corporation for Atmospheric Research Tetherless World Constellation Ontology engineering for provenance enablement in the third National Climate Assessment Xiaogang (Marshall) Ma 1, Jin Guang Zheng 1, Justin Goldstein 2,3, Steve Aluenbach 2,3, Curt Tilmes 3,4, Peter Fox 1 Tetherless World Constellation, Rensselaer Polytechnic Institute, Troy, NY 12180, USA; 2 University Corporation for Atmospheric Research, Boulder, CO 80301, USA; 3 U.S. Global Change Research Program, Washington, DC 20006, USA; 4 NASA Goddard Space Flight Center, Greenbelt, MD 20771, USA Background and Motivation Every four years, the U.S. Global Change Research Program (USGCRP) [1] produces a National Climate Assessment (NCA) that presents the findings of global climate change and the impacts of climate change on the United States. The topic of global change builds on a huge collection of scientific research, which also generates provenance information about entities, activities, and people involved in producing datasets, methods and findings. Capturing and presenting global change provenance, linking to the research papers, datasets, models, analyses, observations and satellites, etc. that support the key research findings in this domain can increase understanding, credibility and trust of the assessment process and the resulting report, and aid in reproducibility of results and conclusions. The USGCRP is now producing the third NCA report (NCA3) and is developing a Global Change Information System (GCIS) that will present the content of that report and its provenance, including the scientific support for the findings of the assessment. As the GCIS will be web-based, it provides a platform for representing the provenance information and implementing the results with semantic web technologies. Method and Technology We use a use case-driven iterative development methodology [2] that will present this information both through a human accessible web site as well as a machine readable interface for automated mining of the provenance graph. A use case illustrates an objective that a primary actor wants to accomplish and the sequence of interactions between the primary actor and a system such that the primary actor's objective is successfully achieved. A use case establishes a context in which domain scientists and computer scientists can work together on a topic of interest. Key steps in the iterative methodology are described in Figure 1. Focusing on the technical part, we use the developing World Wide Web Consortium (W3C) PROV data model and ontology [3] for representing the provenance information in the GCIS. A viewer wishes to identify the source of the data in a particular NCA3 figure. A reference to the paper in which the figure was originally published in appears in the figure caption. Clicking that reference displays a page of information about the paper, including a link to the datasets used in the paper. Following each of those links presents a page of information about the dataset, including links back to the agency/data center web page describing the dataset in more detail and making the actual data available for order or download. We collected the primary classes and relationships in this use case (Figure 2) and later adapted them into the GCIS ontology (Figure 5). Summary The ongoing research concentrates on the provenance for the NCA3 report. Following the iterative development methodology, we have worked on a number of use cases to refine an ontology for describing entities, activities, agents and their inter-relationships in the NCA3 report. We also mapped those entities and relationships into the PROV-O ontology to realize the formal presentation of provenance. Several prototype systems have been developed to provide users the functionalities to browse and search provenance information with topics of interest. In the future, the GCIS will collect and link records of publications, datasets, instruments, organizations, methods, people, etc. eventually covering provenance information for the entire scope of global change. Figure 1 Semantic Web methodology and technology development process [2] Figure 5 Primary classes and relationships in current version of the GCIS ontology Figure 3 Roles of people in the writing of chapter 6 (Agriculture) in the NCA3 draft report Use Case 1: Visit data center website of dataset used to generate a report figure A reader sees that Chapter 6 (Agriculture) in the NCA3 draft report was written by a list of authors. On the title page of that chapter the reader can see the role of each author, i.e., convening lead author, lead author or contributing author, in the generation of this report chapter. To make those roles also machine-readable, we collected classes, instances and relationships in this use case (Figure 3) and adapted them into the GCIS ontology (Figure 5). Use Case 2: Roles of people in the generation of a chapter in the NCA3 draft report Figure 2 Classes and relationships recognized from Use Case 1 Here just three of the eight authors are shown. Each author had a specific role for this chapter. Figure 4 Platforms, instruments and sensors that contribute to Figure 1.2 in NCA3 Details of the Jason1 and Jason2 missions are omitted here. Use Case 3: Provenance tracing of NASA contributions to Figure 1.2 in NCA3 draft report A reader sees that Figure 1.2 “Sea Level Rise: Past, Present and Future” of the NCA3 draft report cites four data sources in the figure caption. Selecting the third citation displays a page of information about the paper and a citation to the dataset used in the paper. Clicking the citation link the reader opens a page containing information about the dataset, including a description that the dataset is derived from data produced by the TOPEX/Poseidon and Jason altimeter missions funded by NASA and CNES. Following each of these missions presents a page about the platforms, instruments and sensors in that mission. To make those information both human- and machine-readable, we collected classes, instances and relationships in this use case (Figure 4) and adapted them into the GCIS ontology (Figure 5). Get the poster at: