Presenting Provenance Based on User Roles Experiences with a Solar Physics Data Ingest System Patrick West, James Michaelis, Peter Fox, Stephan Zednik,

Slides:



Advertisements
Similar presentations
Geoinformatics 2008 Fox Semantic Provenance 1 Semantic Provenance for Image Data Processing Peter Fox (HAO/ESSL/NCAR) Deborah McGuinness (RPI) Jose Garcia,
Advertisements

April 24, 2007McGuinness NIST Interoperability Week Ontology Summit Semantic Web Perspective Deborah L. McGuinness Acting Director & Senior Research Scientist.
Ontology and Application for Reusable Search Interface Design Plans for Advanced Semantic Technologies Final Project Eric Rozell, Tetherless World Constellation.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Provenance-aware faceted search Peter Fox Stephan Zednik Patrick West Tetherless World Constellation, RPI EGU 2010.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Information Fusion: Moving from domain independent to domain literate approaches Professor Deborah L. McGuinness Tetherless World Constellation, Rensselaer.
Semantic Representation of Temporal Metadata in a Virtual Observatory Han Wang 1 Eric Rozell 1
Experiences Developing a User- centric Presentation of A Domain- enhanced Provenance Data Model Cynthia Chang 1, Stephan Zednik 1, Chris Lynnes 2, Peter.
Applying Semantics in Dataset Summarization for Solar Data Ingest Pipelines James Michaelis ( ), Deborah L. McGuinness
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
Emerging Trends in Provenance Deborah L. McGuinness Tetherless World Constellation Chair Rensselaer Polytechnic Institute SWPM Workshop at ISWC November.
Linked Data and the Provenance Explosion Deborah L. McGuinness Tetherless World Constellation Chair Professor of Computer Science and Cognitive Science.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
Provenance-Aware Faceted Search Deborah L. McGuinness 1,2 Peter Fox 1 Cynthia Chang 1 Li Ding 1.
Configurable User Interface Framework for Cross-Disciplinary and Citizen Science Presented by: Peter Fox Authors: Eric Rozell, Han Wang, Patrick West,
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
SemantAqua: A Semantically-Enabled Provenance-Aware Water Quality Portal Evan W. Patton, Ping Wang, Jin Guang Zheng, Timothy Lebo, Li Ding, Joanne Luciano,
Web Science Trust Network RPI Web Science Research Center Kickoff Meeting Fall 2012 August 28, PM Winslow 1140.
References: [1] [2] [3] Acknowledgments:
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal Principle Investigator: Eric Rozell Tetherless World Constellation.
Discovering accessibility, display, and manipulation of data in a data portal Nancy Hoebelheinrich Patrick West 2
A Semantically-Enabled Provenance- Aware Water Quality Portal Joint work with: Jin Guang Zheng, Ping Wang, Evan Patton, Timothy Lebo, Joanne Luciano Deborah.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Semantically-Enabled Science Data Integration (SESDI) and The Virtual Solar-Terrestrial Observatory (VSTO) Semantically-enabled (large-scale) Scientific.
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
NEON non-specialist use case; Science data reuse in a classroom Peter Fox Brian Wee Patrick West 1
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
1 Advanced Semantic Technologies Prof. Deborah McGuinness and Dr. Patrice Seyed CSCI CSCI ITWS ITWS TA: Justin.
1 Semantic Provenance and Integration Peter Fox and Deborah L. McGuinness Joint work with Stephan Zednick, Patrick West, Li Ding, Cynthia Chang, … Tetherless.
Applying Provenance Extensions to OPeNDAP Framework Patrick West, James Michaelis, Tim Lebo, Deborah L. McGuinness Rensselaer Polytechnic Institute Tetherless.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Resource Discovery for Extreme Scale Collaboration Benno Lee Patrick West 1 William Smith 2
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Semantically-Enabled Virtual Observatories: VSTO Highlights for Observational Data Deborah McGuinness Acting Director and Senior Research Scientist Knowledge.
The VIRTUAL SOLAR-TERRESTRIAL OBSERVATORY - Exploring paradigms for interdisciplinary data-driven science Peter Fox 1 Don Middleton 2,
VIVO Conference 2013 Panel on VIVO Use-Cases for Collaborative Science: From Researcher Networks to Semantic User Interfaces for Data Patrick West – Tetherless.
Applications and Requirements for Scientific Workflow Introduction May NSF Geoffrey Fox Indiana University.
Realities in Science Data and Information - Let's go for translucency AGU FM10 IN13B-02 Peter Fox (RPI) Tetherless World.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
A Semantic Web Approach for the Third Provenance Challenge Tetherless World Rensselaer Polytechnic Institute James Michaelis, Li Ding,
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
Explainable Adaptive Assistants Deborah L. McGuinness, Tetherless World Constellation, RPI Alyssa Glass, Stanford University Michael Wolverton, SRI International.
Determining Fitness-For-Use of Ontologies through Change Management, Versioning and Publication Best Practices Patrick West 1 Stephan.
Catalog/ ID Selected Logical Constraints (disjointness, inverse, …) Terms/ glossary Thesauri “narrower term” relation Formal is-a Frames (properties) Informal.
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Information Model Driven Semantic Framework Architecture and Design for Distributed Data Repositories AGU 2011, IN51D-04 December 9, 2011 Peter Fox (RPI)
Social and Personal Factors in Semantic Infusion Projects Patrick West 1 Peter Fox 1 Deborah McGuinness 1,2
TWC Adoption* of RDA DTR and PIT in the Deep Carbon Observatory Data Portal Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox, & the.
A Framework for Earth Science Search Interface Development Design and Implementation of S2S Presented by: Stephan Zednik, Tetherless World Constellation.
Annotating and Embedding Provenance in Science Data Repositories to Enable Next Generation Science Applications Deborah L. McGuinness.
The Semantic eScience Framework AGU FM10 IN22A-02 Deborah McGuinness and Peter Fox (RPI) Tetherless World Constellation.
Ontology and Application for Reusable Search Interface Design Plans for Advanced Semantic Technologies Final Project Eric Rozell, Tetherless World Constellation.
Poster: EGU Glossary: USGCRP – United States Global Change Research Program NCA – National Climate Assessment GCIS – Global Change Information.
Get the poster at Semantic Visualization Provenance Records:
improve the efficiency, collaborative potential, and
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Stephan Zednik, Patrick West, Peter Fox Tetherless World Constellation
Data types and persistent identifiers in
Adoption of RDA DTR and PIT in the Deep Carbon Observatory Data Portal
Science Data Platforms: Informatics Architectures at the Forefront.
Presentation transcript:

Presenting Provenance Based on User Roles Experiences with a Solar Physics Data Ingest System Patrick West, James Michaelis, Peter Fox, Stephan Zednik, Deborah McGuinness – Tetherless World Constellation ( – Rensselaer Polytechnic Institute ( AGUFM2010-IN43C-05 1

Outline of Presentation Prior Work in Selective Provenance Presentation Rationale for User Roles in Presentation Our Focus Area: Semantic Provenance Capture in Data Ingest Systems (SPCDIS) Advanced Coronal Observing System (ACOS) Applying user roles to provenance 2

Prior Work Significant prior work on provenance views + abstractions (Moreau, 2009) Two kinds approaches: Expanding Abstract Provenance (Hunter, 2007) Start with abstract provenance, expand to fine grained Abstracting Fine Grained Provenance (Davidson, 2008) Start with fine-grained, select desired components, then abstract away unwanted detail Common goal: manage complexity of provenance 3

Complexity 4

Kinds of Users In context of a Solar Physics Data System, two kinds of expertise: Scientific (Astro/Solar Physics) Technical (Pipeline + components) Kinds of Users: Project coordinators Knowledgeable in both science and technical Outside Domain Experts Citizen Scientists 5

Rationale for User Roles Different backgrounds for different users E.g., Domain Experts versus Citizen Scientists Abstract -> Fine-grained: can be time intensive process Fine-grained -> Abstract: requires background to know what you’re looking for Key idea: Initial presentation of provenance components can be important for end-users Finer grained components for experts Abstract components for novices 6

Multiple-domain knowledgebase Objective: Use Semantic Web technologies to combine provenance from different sources in an interoperable fashion. 7 Provenance Ontology Solar Physics Domain Data Processing Domain Extension of work on Virtual Solar Terrestrial Observatory Good/Bad/Ugly (GBU) ratings, Trust, Quality flags Proof Markup Language (PML)

8 Advanced Coronal Observing System Mauna Loa Solar Observatory (MLSO) Hawaii Intensity Images (GIF) Raw Image Data Captured by CHIP Chromospheric Helium-I Image Photometer Raw Data Capture National Center for Atmospheric Research (NCAR) Data Center. Boulder, CO Velocity Images (GIF) Follow-up Processing on Raw Data Quality Checking (Images Graded: GOOD, BAD, UGLY) Publishes

Provenance View – Citizen Scientist 9 Data Capture (MLSO) Data Processing (NCAR) Quality Check (NCAR) Good/Bad/Ugl y Rating Raw Image Data Calibrated Image Data

Provenance View – Domain Expert Data Capture (MLSO) Flat Field Calibration Good/Bad/Ugly Rating Hot Pixel Correction Centering/Trimming /Clipping Compute Sample Means Determine Test Channel Assign GBU Rating Data Processing Quality Check 10

Use Cases Different users wish to get overview of provenance for quality rating. Citizen Scientist: Sees high-level provenance. Wishes to know more about how Good/Bad/Ugly rating created Expands Quality Check node. Domain Expert: Starts with fine-grained provenance view, generates abstraction exposing quality check processes: Compute Sample Means Determine Test Channel Assign Good/Bad/Ugly Rating 11

Applying user roles Semantic Web (RDFS/OWL) Ontologies for defining domain knowledge needed. Specifically for defining: Workflow components. User roles. Component-Role Mapping. 12 RDFs – Resource Description Framework schema OWL – Web Ontology Language

Ongoing Issues Some inherent challenges Deciding on how to map components to roles. Will a given user necessarily fit into one of the pre- defined roles? Key research question pursued For preserving provenance interface usability, what a good middle ground between: Going from abstract to fine-grained provenance As well as fine-grained to abstract provenance 13

Summary Managing complexity is an important activity for presenting provenance. Just providing drill-down from abstract to more detailed views or fine-grained selection is not enough. The user can be provided an initial presentation of content based on their level of knowledge, from general interest to domain expert. What is needed is an approach that provides the right level of initial explanation based on the user’s role. 14

References L. Moreau, “The foundations for provenance on the web.” K. Cheung, J. Hunter, and Lashtabeg, A. and J. Drennan “SCOPE: a scientific compound object publishing and editing system.” International Journal of Digital Curation, 3(2), S. Cohen-Boulakia, O. Biton, S. Cohen and S. Davidson “Addressing the provenance challenge using ZOOM.” Concurrency and Computation: Practice and Experience, 20(5), p ,