Presentation is loading. Please wait.

Presentation is loading. Please wait.

Understanding the utility and fitness of Workflow Provenance for Experiment Reporting Pınar Alper, Supervisor: Carole A. Goble 1.

Similar presentations


Presentation on theme: "Understanding the utility and fitness of Workflow Provenance for Experiment Reporting Pınar Alper, Supervisor: Carole A. Goble 1."— Presentation transcript:

1 Understanding the utility and fitness of Workflow Provenance for Experiment Reporting Pınar Alper, Supervisor: Carole A. Goble 1

2 Local Data Local Data Local Tool Local Tool Results Data Research  Reporting Results Tool Analysis Results Data select recollect share package publish Build a citation string Package results by origin Document important run parameteres C. Tenopir, S. Allard, et al. Data sharing by scientists: Practices and perceptions. PLoS ONE, 6(6):e21101, 06 2011. 2

3 Provenance we have WF descriptionExecution provenance Prospective Retrospective Generic information: Data artefacts, consumption/production relations Execution times/status 3

4 Provenance that is reported – Origin – Methodological context – Scientific Context Scientific Data Provenance 4

5 Motifs D Garijo, P Alper, K Belhajjame, O Corcho, Y Gil, C Goble, Common motifs in scientific workflows: An empirical analysis, Future Generation Computer Systems. ISSN 0167-739X. Minority (~30%) Data-creation Majority (~70%) Data-preparation (value-copying) Workflows as implementation artefacts: 240 Workflows, 4 Systems 10 domains A domain independent characterization of activities ~90% characterizable http://purl.org/net/wf-motifs# 5

6 Research Framework WF Summaries Labeling WF II III WF Motifs I Minimal additional design-time information High-level categorization, as Semantic Annotations Based on empirical evidence Process Model for labeling Motifs inform when to collect when to propagate labels Novelty: Dynamic, domain specific Novelty: Partial transparency Graph Re-write primitives Configurable filters More informed abstraction wMotifs Novelty: Declarative abstraction and contextual grouping 6 Grey-box Groundtruth –user behavior P Alper, K Belhajjame, C Goble, P Karagoz, Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotations, IEEE Big Data, July 2013. P Alper, C Goble, and K Belhajjame. 2013. On assisting scientific data curation in collection-based dataflows using labels. In Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science (WORKS '13). ACM, New York, NY, USA, 7-16. DOI=10.1145/2534248.2534249

7 How do I use Taverna Workbench scufl2-api make a wf Inquire about details Scufl2-wfdesc we operate on abstract wf description Issues Additional characteristics (port depths, itertion config) Annotation support @UI w key-value pairs List handling representation Resource uniqueness 7

8 Thank you! Carole A. GOBLE University of Manchester Khalid BELHAJJAME Université Paris Dauphine Pinar KARAGOZ Middle East Technical University Pinar ALPER University of Manchester 8


Download ppt "Understanding the utility and fitness of Workflow Provenance for Experiment Reporting Pınar Alper, Supervisor: Carole A. Goble 1."

Similar presentations


Ads by Google