Workflows Description, Enactment and Monitoring in SAGA Ashiq Anjum, UWE Bristol Shantenu Jha, LSU 1.

Slides:



Advertisements
Similar presentations
Enabling the execution of various workflows (Kepler, Taverna, Triana, P-GRADE) on EGEE Tamas Kukla, Tamas Kiss, Gabor Terstyanszky.
Advertisements

The Next Generation Grid Kostas Tserpes, NTUA Beijing, 22 of June 2005.
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
Generic MPI Job Submission by the P-GRADE Grid Portal Zoltán Farkas MTA SZTAKI.
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
Towards Intelligent Workflow Planning for Neuroimaging Analyses Irfan Habib, Ashiq Anjum, Peter Bloodsworth, Richard McClatchey Centre for Complex Cooperative.
Plateforme de Calcul pour les Sciences du Vivant SRB & gLite V. Breton.
Cloudmesh Resource Shifting 1 2. Cloudmesh: from IaaS(NaaS) to Workflow (Orchestration) Workflow Virtual Cluster Components Infrastructure iPython (Pegasus)
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
NeuGRID PROJECT: A PRACTICAL EXAMPLE OF RESEARCH THROUGH THE GARR.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
CONDOR DAGMan and Pegasus Selim Kalayci Florida International University 07/28/2009 Note: Slides are compiled from various TeraGrid Documentations.
Developing Reusable Software Infrastructure – Middleware – for Multiscale Modeling Wilfred W. Li, Ph.D. National Biomedical Computation Resource Center.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Grid Engine Riccardo Rotondo
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
CONTENTS Arrival Characters Definition Merits Chararterstics Workflows Wfms Workflow engine Workflows levels & categories.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
A General and Scalable Solution of Heterogeneous Workflow Invocation and Nesting Tamas Kukla, Tamas Kiss, Gabor Terstyanszky.
JRA1/Job Submission and Monitoring Moreno Marzolla on behalf of JRA1/Job Submission Task INFN Sezione di Padova,
DR Software: Essential Foundational Elements and Platform Components UCLA Smart Grid Energy Research Center (SMERC) Industry Partners Program (IPP) Meeting.
Combining the strengths of UMIST and The Victoria University of Manchester Utility Driven Adaptive Workflow Execution Kevin Lee School of Computer Science,
ITPA/IMAGE 7-10 May 2007 Software and Hardware Infrastructure for the ITM B.Guillerminet, on behalf of the ITM & ISIP teams (P Strand, F Imbeaux, G Huysmans,
Javascript Cog Kit By Zhenhua Guo. Grid Applications Currently, most grid related applications are written as separate software. –server side: Globus,
London e-Science Centre GridSAM Job Submission and Monitoring Web Service William Lee, Stephen McGough.
Chapter 10 Analysis and Design Discipline. 2 Purpose The purpose is to translate the requirements into a specification that describes how to implement.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
TeraGrid CTSS Plans and Status Dane Skow for Lee Liming and JP Navarro OSG Consortium Meeting 22 August, 2006.
Lawrence Tarbox, Ph.D. Washington University in St. Louis School of Medicine Mallinckrodt Institute of Radiology, Electronic Radiology Lab.
Wrapping Scientific Applications As Web Services Using The Opal Toolkit Wrapping Scientific Applications As Web Services Using The Opal Toolkit Sriram.
ServiceSs, a new programming model for the Cloud Daniele Lezzi, Rosa M. Badia, Jorge Ejarque, Raul Sirvent, Enric Tejedor Grid Computing and Clusters Group.
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Intermediate Condor: Workflows Rob Quick Open Science Grid Indiana University.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
Metadata Mòrag Burgon-Lyon University of Glasgow.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE User Forum, Manchester, 10 May ‘07 Nicola Venuti
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
Biomedical Informatics Research Network BIRN Workflow Portal.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
In Vivo Imaging Middleware and Applications RSNA 2007 Berkant Barla Cambazoglu The Ohio State University Department of Biomedical Informatics.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
CATI Pitié-Salpêtrière CATI: A national platform for advanced Neuroimaging In Alzheimer’s Disease Standardized MRI and PET acquisitions Across a wide network.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
EGI Technical Forum Amsterdam, 16 September 2010 Sylvain Reynaud.
INFSO-RI JRA2 Test Management Tools Eva Takacs (4D SOFT) ETICS 2 Final Review Brussels - 11 May 2010.
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
1 Adaptive Workflow to Support Knowledge Intensive Tasks Ann Macintosh AIAI The University of Edinburgh
Research Traceability using Provenance Services for Biomedical Analysis Dr Peter Bloodsworth CCCS Research Centre UWE, Bristol, UK
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Biomedical Informatics Research Network BIRN Workflow Portal.
PROCESS - H2020 Project Work Package WP6 JRA3
Grid Engine Riccardo Rotondo
Grid Engine Diego Scardaci (INFN – Catania)
a middleware implementation
gLite The EGEE Middleware Distribution
Presentation transcript:

Workflows Description, Enactment and Monitoring in SAGA Ashiq Anjum, UWE Bristol Shantenu Jha, LSU 1

neuGrid Recent progress in neuroimaging techniques and data formats has led to an explosive growth in neuroimaging data Analysis of this data can facilitate research in neuro-degenerative diseases.

Clinical Users Commercial Partners Academic Partners

Services in neuGRID

Generalised Services Generalised Services My Favourite Application NeuGrid Glueing Service (Uses SAGA) Job handling File Management Security Task Dependencies Monitoring Workflow Management Infrastructure gLite, Globus, OMII-UK, Cloud Infrastructure gLite, Globus, OMII-UK, Cloud User Portal Service 5 Generic Reusable Services Workflow Specific. ProvenanceQueryingLORISAnonymisation

Neuroimaging datasets are generally processed through Neuroimaging pipelines

CIVET produces 1100% more data than it consumes, and intermediate data usage is more than 4000%. Without optimisation runtime of a single workflow on single image is around 8 hrs 10,000 brain images in neuGrid by the end of this year, each Image between 70 to 120 MB

A Neuroimaging Workflow

LoNI KEPLER Pipeline Service API Enactment Abstraction (pluggin-like) MyFavoriteTool Service Based Enactor Task Based Enactor Glueing Service SCUFL MoMLXML GRID Pipelines Translation Component Pipeline Planner (Distribution Aware Pipeline Description) Pipeline Service : Generalisation

Pipeline Service : Overview Designed to provide the required functionality to author, transform and plan workflows And orchestrate and facilitate the retrieval of analysis data and intermediary output for Provenance capture. The Pipeline Service specifies workflows and retrieves the output via the Glueing Service.

Workflow Planning Approaches Approaches for workflow planning include: – Data-based Methods: Data elimination – Task-based Approaches: Task Clustering – Experimental evaluations concentrate on automated task clustering. Two types of clustering – Automated Horizontal Clustering – Collapse Factor Based – Bundle Factor Based – User defined clustering

Improve data reuseability in the workflows

Enactment via Glueing Service Enactment via Glueing Service Uses SAGA to communicate with an underlying infrastructure. Able to cater for multiple infrastructures  interoperability. Enables flow of data and control to and from the infrastructure (here gLite) for Provenance. 14

Glueing Service Provides file management; workflow submission & monitoring; and provenance retrieval functionality in a generic manner. Builds upon SAGA to provide a middleware agnostic way for services and users to interact with the Grid. The Glueing Service provides a SOAP wrapper over the OGF SAGA. In order to use the Glueing Service in a SAGA compliant manner we have developed the UWESOAP Adaptor.

SAGA: In a thousand words..

digedag digedag - prototype implementation of a SAGA- based workflow package, with: –an API for programatically expressing workflows –a parser for (abstract or concrete) workflow descriptions –an (in-time workflow) planner –a workflow enactor (using the SAGA engine) this will eventually be separated from digedag, but will continue to use SAGA Can accept mDAG output, or Pegasus output Can move back and forth between abstract and concrete DAG

Application Development Phase Generation & Exec. Planning Phase Execution Phase DAG-based Workflow Applications: Extensibility Approach

Digedag: SAGA Workflow Package Development Phase: Creation & management of nodes and edges of a DAG and parts of the DAG Planning Phase: Digedag planner is fired when creating and executing C-DAG – thus responding to dynamic changes instantly –When adding/removing nodes/edges –Node/edge firing succeeds/fails, or edge transfer fails/succeeds Mixed Planning and Execution Phase –Having the full A-DAG, current C-DAG and live Information Execution Phase: SAGA-based Enactor designed to support explicit dynamic execution –SAGA-based DAG enactor, which changes the Concrete-DAG on the fly, thus remapping workflow elements (DAG nodes).

Application Development Phase Generation & Exec. Planning Phase Execution Phase DAG-based Applications Extensibility and Higher-level API Monitoring requirements/model of DAGman tied with Condor

SAGA-based DAG Execution Preserving Performance

Glueing Service : Current Status V1.0 Available, integrated with LORIS & operable with gLite Secure authentication with the infrastructure is implemented. The glueing service software –Can be compiled from source –Can be deployed using binaries –Can be tested using preconfigured VM UWE SOAP Adaptor –Supports job submission, monitoring and file transfers –Supports file reading, writing, listing –Translates SAGA API calls written by an end user to SOAP calls –Supports SOAP attachments using Java activation framework

Future Work

Glueing Service (Future)