Provenance Aware Service Oriented Architecture (1 year on) Professor Luc Moreau University of Southampton

Slides:



Advertisements
Similar presentations
GRADD: Scientific Workflows. Scientific Workflow E. Science laboris Workflows are the new rock and roll of eScience Machinery for coordinating the execution.
Advertisements

Enabling and Supporting Provenance in e-Science Applications Luc Moreau University of Southampton
A Proof of Concept: Provenance in a Service Oriented Architecture Liming Chen, Victor Tan, Fenglian Xu, Alexis Biller, Paul Groth, Simon Miles, John Ibbotson,
Software change management
Configuration management
An Open Provenance Model for Scientific Workflows Professor Luc Moreau University of Southampton
Provenance: concepts, architecture and envisioned tools Professor Luc Moreau University of Southampton
UK e-Science All Hands Meeting 2005 Paul Groth, Simon Miles, Luc Moreau.
Architecture Tutorial Summary and Conclusions. Architecture Tutorial The Provenance Architecture.
Evaluation of a Large-scale VRE Implementation - ELVI Staff and students using the VRE benefit from the greater transparency and communication that it.
Principles of Personalisation of Service Discovery Electronics and Computer Science, University of Southampton myGrid UK e-Science Project Juri Papay,
Provenance in Distr. Organ Transplant Management Applying Provenance in Distributed Organ Management Sergio Álvarez, Javier Vázquez-Salceda, Tamás Kifor,
As computer network experiments increase in complexity and size, it becomes increasingly difficult to fully understand the circumstances under which a.
PrIMe PrIMe : Provenance Incorporating Methodology Steve Munroe The EU Grid Provenance Project University of Southampton UK
Architecture Tutorial 1 Overview of Today’s Talks Provenance Data Structures Recording and Querying Provenance –Break (30 minutes) Distribution and Scalability.
IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan
® IBM Software Group © 2006 IBM Corporation Rational Software France Object-Oriented Analysis and Design with UML2 and Rational Software Modeler 04. Other.
1 SWE Introduction to Software Engineering Lecture 22 – Architectural Design (Chapter 13)
Application architectures
Provenance Challenges and Technologies for Grids Luc Moreau University of Southampton
Application architectures
This chapter is extracted from Sommerville’s slides. Text book chapter
Electronically Querying for the Provenance of Entities Simon Miles Provenance-Aware Service-Oriented Architectures.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Massimiliano Assante – Leonardo Candela – Donatella Castelli – Pasquale Pagano Fourteenth International Conference on Grey Literature An Environment Supporting.
Implementation and Evaluation of a Protocol for Recording Process Documentation in the Presence of Failures Zheng Chen and Luc Moreau
Software Design Refinement Using Design Patterns Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.
UK e-Science All Hands Meeting 2005 Paul Groth, Simon Miles, Luc Moreau.
OASIS WSQM TC Meeting Dugki Min. 컴퓨터공학부 건국대학교 Agenda 1. Roll Call 2. Review and approval of the agenda 3. Review and approval of the previous.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
The GRIMOIRES Service Registry Weijian Fang and Luc Moreau School of Electronics and Computer Science University of Southampton.
Usage of `provenance’: A Tower of Babel Luc Moreau.
Architecture Tutorial Provenance: overview Professor Luc Moreau University of Southampton
Miguel Branco CERN/University of Southampton Enabling provenance on large-scale e-Science applications.
Architecture Tutorial 1 Overview of Today’s Talks Provenance Data Structures Recording and Querying Provenance –Break (30 minutes) Distribution and Scalability.
Configuration Management (CM)
Provenance: an open approach to experiment validation in e- Science Professor Luc Moreau University of Southampton
Composing Adaptive Software Authors Philip K. McKinley, Seyed Masoud Sadjadi, Eric P. Kasten, Betty H.C. Cheng Presented by Ana Rodriguez June 21, 2006.
20 October 2006Workflow Optimization in Distributed Environments Dynamic Workflow Management Using Performance Data David W. Walker, Yan Huang, Omer F.
Scalable Clustering on the Data Grid Patrick Wendel Moustafa Ghanem Yike Guo Discovery Net Department of Computing Imperial College,
Max Ong University of Sheffield, UK. AHM 2004 Session 2.3: Workflow Composition, Wednesday 1 st September 2004, 4pm. Workflow Advisor in DAME Abstract.
Provenance: an open approach to experiment validation in e- Science Professor Luc Moreau University of Southampton
Provenance Challenge Simon Miles, Mike Wilde, Ian Foster and Luc Moreau.
Agent-Oriented Data Curation in Bioinformatics Simon Miles University of Southampton PASOA project:
Security Issues in a SOA- based Provenance System Victor Tan, Paul Groth, Simon Miles, Sheng Jiang, Steve Munroe, Sofia Tsasakou and Luc Moreau PASOA/EU.
Framework for MDO Studies Amitay Isaacs Center for Aerospace System Design and Engineering IIT Bombay.
Infrastructures for Social Simulation Rob Procter National e-Infrastructure for Social Simulation ISGC 2010 Social Simulation Tutorial.
Metadata Mòrag Burgon-Lyon University of Glasgow.
July 27, 2005High Performance Distributed Computing 05 Recording and Using Provenance in a Protein Compressibility Experiment Paul Groth, Simon Miles,
Information Services Andrew Brown Jon Ludwig Elvis Montero grid:seminar1:lectures:seminar-grid-1-information-services.ppt.
OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
Formalising a protocol for recording provenance in Grids Paul Groth – University of Southampton.
Recording the Context of Action for Process Documentation Ian Wootten Cardiff University, UK
Recording Actor Provenance in Scientific Workflows Ian Wootten, Shrija Rajbhandari, Omer Rana Cardiff University, UK.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
Provenance in Distr. Organ Transplant Management EU PROVENANCE project: an open provenance architecture for distributed.
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
Tools for Navigating and Analysis of Provenance Information Vikas Deora, Arnaud Contes and Omer Rana.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
National Aeronautics and Space Administration Jet Propulsion Laboratory March 17, 2009 Workflow Orchestration: Conducting Science Efficiently on the Grid.
Fault Tolerance in Distributed Systems Gökay Burak AKKUŞ Cmpe516 – Fault Tolerant Computing.
18 May 2006CCGrid2006 Dynamic Workflow Management Using Performance Data Lican Huang, David W. Walker, Yan Huang, and Omer F. Rana Cardiff School of Computer.
Recording and Reasoning Over Data Provenance in Web and Grid Services Martin Szomszor and Luc Moreau University of Southampton.
Provenance: an open approach to experiment validation in e-Science
Provenance: Problem, Architectural issues, Towards Trust
CSSSPEC6 SOFTWARE DEVELOPMENT WITH QUALITY ASSURANCE
Module 01 ETICS Overview ETICS Online Tutorials
Presentation transcript:

Provenance Aware Service Oriented Architecture (1 year on) Professor Luc Moreau University of Southampton

The PASOA Team  PASOA Southampton  Simon Miles, Paul Groth, Miguel Branco, Luc Moreau  PASOA Cardiff  Ian Wootten, Shrija Rajbhandari, Omer Rana, David Walker

Provenance Definition  Merriam-Webster Online dictionary:  the origin, source;  the history of ownership of a valued object or work of art or literature  The provenance of a piece of data is the process that led to the data  Our aim is to conceive a computer-based representation of provenance that allows us to perform useful analysis and reasoning to support our use cases

Provenance Use Cases (1) High Energy Physics: tracking, analysing, verifying data sets in the ATLAS Experiment of the Large Hadron Collider (CERN) Bioinformatics: verification of “experiment validity”.

Provenance Use Cases (2) Aerospace engineering: maintain a historical record of design processes, up to 99 years. Organ transplant management: tracking of previous decisions, crucial to maximise the efficiency in matching and recovery rate of patients

The Provenance Problem Given a set of services in an open grid environment that are composed in order to produce a given result; How can we determine the process that generated the result? (especially after their composition, i.e., virtual organisation, has been disbanded)

Provenance “Lifecycle” Application Results Provenance Store Record Documentation of Execution Query Provenance of Data Manage Store and its contents Core Interfaces to Provenance Store

 Logical Architecture  Adopted by EU Provenance as strawman [Miles et al. 05]

Recording & Querying PReP [Groth et al. 04]  Protocol adopted by application components  Allow for multiple provenance stores (scalability) Query Interface [Miles et al.05]  Purpose  Obtain the provenance of some specific data  Allow for “navigation” of the data structure representing provenance  Abstract interface  Allows us to view the provenance store as if containing XML data structures  Based on XPath and XQuery clientservice invocation result invocation and result recording Provenance Store Provenance Store invocation and result recording

Assertions about Performance and Availability  A taxonomy of gathered information about performance  Recorded (invocation start/end time and counts)  Derived from Recorded Information (averages)  Queried against other actor owned metrics  Compilation of assertions in a measure of trust (both from service and client perspective) [Wootten, Rana 05] Trust is a subjective probability that an actor will perform a particular action [Gambetta] [Rajbhandari, Rana 05]

PReServ [Groth et al. 05]  Implementation of PReP protocol and Query Interface  Provenance store implemented as a Web Service  Client side libraries for using Provenance Store  Axis Handler for automatically recording communication between Axis-based Web Services Axis Handler Axis Handler Provenance Service Backend Store Interface File System Store In- Memory Store … Backend Stores PS Client Side Library PS Client Side Library Web Service WS Client Query Actor WS PS Client Side Library WS Calls Java Calls

Bioinformatics Application  Bioinformatics workflow studying compressibility of biological sequences  Implemented as a VDT workflow, scheduled by Condor  Each service, script, command records provenance [HPDC’05]

Bioinformatics Application (2)  Use Cases  Algorithm verification  A bioinformatician, A, downloads a protein sequence from the RefSeq database and runs the compressibility experiment.  A later performs the same experiment on the same sequence data, again downloaded from RefSeq.  A compares the two experiment results and notices a difference.  A determines whether the difference was caused by the algorithms used to process the sequence data having been changed.

Bioinformatics Application (3)  Recording Scalability  Querying Scalability

Other Applications  EU Provenance project  Pre-prototype about baking cakes  e-Demand  Detect sharing of services in workflow execution to offer more resilient execution [Townend et al 05] [Xu et al 05]

Conclusions  Mostly unexplored area that is crucial to develop trusted systems  Current work:  System and protocol designing, architecture specification, generic support for use cases  Pursue the deployment in concrete application and performance evaluation  Download our software from  Tell us about your use cases: we are keen to find new collaborations in this space!

 Talk to Paul and Simon

Publications 1.Paul Groth, Simon Miles, Weijian Fang, Sylvia C. Wong, Klaus-Peter Zauner, and Luc Moreau. Recording and Using Provenance in a Protein Compressibility Experiment. In Proceedings of the 14th IEEE International Symposium on High Performance Distributed Computing (HPDC'05), July Paul T. Groth. Recording Provenance in Service-Oriented Architectures. 9 Month Report, University of Southampton; Faculty of Engineering, Science and Mathematics; School of Electronics and Computer Science, Paul Groth, Michael Luck, and Luc Moreau. A protocol for recording provenance in service-oriented Grids. In Proceedings of the 8th International Conference on Principles of Distributed Systems (OPODIS'04), Grenoble, France, December Paul Groth, Michael Luck, and Luc Moreau. Formalising a protocol for recording provenance in Grids. In Proceedings of the UK OST e-Science second All Hands Meeting 2004 (AHM'04), Nottingham, UK, September Simon Miles, Paul Groth, Miguel Branco, and Luc Moreau. The requirements of recording and using provenance in e-Science experiments. Technical report, University of Southampton, Luc Moreau, Syd Chapman, Andreas Schreiber, Rolf Hempel, Omer Rana, Lazslo Varga, Ulises Cortes, and Steven Willmott. Provenance-based Trust for Grid Computing --- Position Paper. In, Paul Townend, Paul Groth, and Jie Xu. A Provenance-Aware Weighted Fault Tolerance Scheme for Service-Based Applications. In Proc. of the 8th IEEE International Symposium on Object-oriented Real-time distributed Computing (ISORC 2005), May 2005.