Download presentation
Presentation is loading. Please wait.
Published byMorgan Bruley Modified over 10 years ago
1
Feedback on OPM Yogesh Simmhan Microsoft Research Synthesis of pairwise conversations with: Roger Barga Satya Sahoo Microsoft Research Beth Plale Abhijit Borude Indiana University
2
Roles Using role annotations in OPM is not well defined… – Named relationships are used as first class objects as defined in the RDF model – Affect the way inferences are made – Semantically meaningful or not?
3
Bake Cake Eggs(3) John Flour Used(eggs) Eggs(1) wasGeneratedBy(unused) wasGeneratedBy(cake) Used(flour) Bake Cake Eggs(A, B, C) John Flour Used(eggs) Eggs(A) wasGeneratedBy(unused) wasGeneratedBy(cake) Used(flour)
4
Accounts Composite processes identified in OPM – Different granularity? – Different view (client vs service) – service/workflow composition using alternate accounts? – Should we specify composition more explicitly in edges as edge types? Subclasses? Customer A Baking Baker Baking [] [] [] Customer B Observer Observers
5
Data Collections does not seem to support the idea of granularity for data products Alternate accounts more suited for process granularity, less for data granularity – process types for data de/compositions? Subclasses?
6
Annotations Causality is not the only relationship between provenance entities – Relevant domain-specific relationships that are needed to answer a scientists query. Subclasses stronger form of annotations – Different? – Subclasses part of model – Annotations dependent on representation? Extensibility mechanisms?
7
Representation/Serialization OPM maps exactly to the W3C recommended standard to represent metadata Resource Description Framework (RDF) – OPM graph is differently named RDF graph XML, RDF, CSV…
8
Time OPM approach to incorporating temporal parameter in provenance using time interval to represent instantaneous is not well defined – based on granularity of values the query result will vary – Accuracy of timestamps affects inference – Logical timestamps? Do we need time range? – Long running process (provenance is past, notcurrent)…
9
Agent Loose form of control flow? – Workflow engine? – Commandline invoking workflow engine? – Researcher who starts commandline? – Previous component that triggers next component? – Where do we have TriggeredBy and where do we have ControlledBy?
10
Service Output data Input data WF Engine Service Output data Input data Client ? WF Engine? WF document Service Output data Input data WF document User ? ? Client ? WF Engine ? ? ?
11
Vagueness in Inferences Edge count limits? Weak and strong semantics P1 used A1 – P1 MUST have used A1 – P1 MAY have used A1 P1 used A1; A2 wasGenerated by P1 – A2 MUST have been derived from A1 – A2 MAY have been derived from A1 Weak is lowest common denominator – mayHaveBeenUsed <= mustHaveBeenUsed…subclass?
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.