Workflow Project Status Update Luciano Piccoli - Fermilab, IIT Nov 07 2008.

Slides:



Advertisements
Similar presentations
GRADD: Scientific Workflows. Scientific Workflow E. Science laboris Workflows are the new rock and roll of eScience Machinery for coordinating the execution.
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Testing Workflow Purpose
Pegasus on the Virtual Grid: A Case Study of Workflow Planning over Captive Resources Yang-Suk Kee, Eun-Kyu Byun, Ewa Deelman, Kran Vahi, Jin-Soo Kim Oracle.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
Workload Management Massimo Sgaravatto INFN Padova.
Lower costs and improve predictability Automation Enable service owners to focus on work that adds business value Reduce error-prone manual activities.
The middleware that makes real time integration a reality.
Integrate into existing systems with PowerShell integration modules Extend by building PS modules to enable integrating into other systems Optimize.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
CONDOR DAGMan and Pegasus Selim Kalayci Florida International University 07/28/2009 Note: Slides are compiled from various TeraGrid Documentations.
June Amsterdam A Workflow Bus for e-Science Applications Dr Zhiming Zhao Faculty of Science, University of Amsterdam VL-e SP 2.5.
W EB - BASED B IOINFORMATICS P IPELINES FOR B IOLOGISTS Integrative Services for Genomic Analysis (ISGA) Chris Hemmerich Center for Genomics and Bioformatics.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Environment for Management of Experiments on the Grid Master of Science Thesis AGH University of Science and Technology, Krakow, Poland Faculty of Electrical.
Workflow Systems for LQCD SciDAC LQCD Software meeting, Boston, Feb 2008 Fermilab, IIT, Vanderbilt.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Cluster Reliability Project ISIS Vanderbilt University.
Workflow Project Luciano Piccoli Illinois Institute of Technology.
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
What are the main differences and commonalities between the IS and DA systems? How information is transferred between tasks: (i) IS it may be often achieved.
Testing Workflow In the Unified Process and Agile/Scrum processes.
Xiao Liu CS3 -- Centre for Complex Software Systems and Services Swinburne University of Technology, Australia Key Research Issues in.
The ACGT Workflow Editing & Enactment Environment Giorgos Zacharioudakis Institute of Computer Science, Foundation for Research & Technology – Hellas (ICS-FORTH)
Distributed Aircraft Maintenance Environment - DAME DAME Workflow Advisor Max Ong University of Sheffield.
Accelerating Scientific Exploration Using Workflow Automation Systems Terence Critchlow (LLNL) Ilkay Altintas (SDSC) Scott Klasky(ORNL) Mladen Vouk (NCSU)
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Ganga A quick tutorial Asterios Katsifodimos Trainer, University of Cyprus Nicosia, Feb 16, 2009.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
LQCD Workflow Execution Framework: Models, Provenance, and Fault-Tolerance Luciano Piccoli 1,3, Abhishek Dubey 2, James N. Simone 3, James B. Kowalkowski.
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Framework for MDO Studies Amitay Isaacs Center for Aerospace System Design and Engineering IIT Bombay.
AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas MTA SZTAKI.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
Course: COMS-E6125 Professor: Gail E. Kaiser Student: Shanghao Li (sl2967)
LQCD Workflow Project L. Piccoli October 02, 2006.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
+ Support multiple virtual environment for Grid computing Dr. Lizhe Wang.
Testing Overview Software Reliability Techniques Testing Concepts CEN 4010 Class 24 – 11/17.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
Workflow Management Concepts and Requirements For Scientific Applications.
Collection and storage of provenance data Jakub Wach Master of Science Thesis Faculty of Electrical Engineering, Automatics, Computer Science and Electronics.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
Windows Workflow Foundation Guy Burstein Senior Consultant Advantech – Microsoft Division
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
Reliability and Workflow projects Jim Kowalkowski.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
Overview on the work performed during EPIKH Training Faiza MEDJEK /INFN, CATANIA 1.
Architecture and design
SDM workshop Strawman report History and Progress and Goal.
Wide Area Workload Management Work Package DATAGRID project
Overview of Workflows: Why Use Them?
A General Approach to Real-time Workflow Monitoring
Introduction to the SHIWA Simulation Platform EGI User Forum,
Presentation transcript:

Workflow Project Status Update Luciano Piccoli - Fermilab, IIT Nov

2 Reminder Workflow: aims to provide end users with an easier way to orchestrate and describe complex processing of data in a visual form, much like flow charts, but without the need to understand computers or programming. Participant: workflow task, usually a PBS job. Object that transforms inputs into outputs. Example of participants is: dCache (dccp), PBS (qsub), user applications and shell scripts. All Participants are considered to be atomic operations from the executing workflow's point of view.

3 Review Change of focus: development of workflow independent front and back-end systems –Parameterization –Run time history –Provenance –Secondary data storage Ruby on Rails and Ruote BPM workflow engine –Configuration generation and 2pt analysis workflows implemented

4 Web Interface

5 Confgen Workflow (in Ruote) class TuneProcessDefinition < OpenWFE::ProcessDefinition def make cursor do get_parameters _break :if => "${f:error} != none" prepare_tuning _break :if => "${f:error} != none" _loop do tune check _break :if => "${f:done} == true" _break :if => "${f:error} != none" end

6 Update: Implementation Addition of workflow to the data model –Allows recovery from failed participant (workflow task) –When recovering a workflow existing data products not generated again –Currently working on Configuration Generation Workflow

7 Update: Integration with Cluster Reliability enable, disable and query sensors report status and failures Workflow 1 Workflow 2 Workflow N … Participant (Workflow task) Pre-conditions Post-conditions Periodic sensor checks, truth conditions. (run PBS job) Local Job Scheduling (PBS) qsub, qdel Cluster Reliability Monitoring Database syslog-ng Workflow Engine(s) Workflow Tracking Database submit Cluster

8 Update: Participation in Conferences Paper for ‘Scientific Workflows and Business Workflow Standards in e-Science (SWBES)’ on LQCD workflow requirements and system evaluation. SuperComputing08 participation this month –Participation in the 3rd Workshop on Workflows in Support of Large-Scale Science (WORKS08) –Kepler Tutorial – implementation of confgen/2pt analysis

Outline for Review

10 January Review Outline Motivation for workflow project. –What is the LQCD workflow project? –What is workflow to us? Parameterized description of physics Grid/SOA vs. Cluster How does it differ from general workflow solutions? –Why do we need it? Show how the current process works, show how it would work when using workflows Common definition and storage for physics processes and products (reuse) Increase productivity and accountability Description of relevant LQCD workflow requirements, e.g. data handling and fault tolerance. –Describe LQCD campaign (2pt analysis and confgen example) –Show the parallelism involved by describing the two campaign layers: loop over configurations and processing on single configuration

11 January Review Outline Work that we’ve done –Evaluation of existing workflow systems and collaborations (Swift, Askalon, Ruote (Kepler and Pegasus)). –Development of database driven workflow and result tracking system. Workflow independent front and back-end infrastructure Integration with the Cluster Reliability subproject (failure monitoring and diagnostics) –Overall workflow/reliability architecture Community participation –U.Chicago/Swift, Innsbruck/Askalon –Lattice 2008 –E-Science conference 2008 –SuperComputing 2007, 2008 Backup slides –Real example showing need for fault tolerance: cluster time wasted.