Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June 29-04 July 2009 P-GRADE Portal tutorial part 1.: Managing workflow.

Slides:



Advertisements
Similar presentations
1 CEOS WGISS Meeting, May 8-12, 2006, Budapest MTA SZTAKI Computer and Automation Research Institute Hungarian Academy of Sciences
Advertisements

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Introduction to EGEE hands-on Gergely Sipos.
1 P-GRADE Portal and GEMLCA Legacy Code Architecture Peter Kacsuk MTA SZTAKI
Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
INFSO-RI Enabling Grids for E-sciencE Workload Management System and Job Description Language.
P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti and G. Hermann P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti and G. Hermann MTA SZTAKI.
1 MTA SZTAKI Application development on EGEE with P-GRADE Portal Gergely Sipos
EGEE-II INFSO-RI Enabling Grids for E-sciencE Grid application development with gLite and P-GRADE Portal Miklos Kozlovszky MTA SZTAKI.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
WS-PGRADE: Supporting parameter sweep applications in workflows Péter Kacsuk, Krisztián Karóczkai, Gábor Hermann, Gergely Sipos, and József Kovács MTA.
Visual Solution to High Performance Computing Computer and Automation Research Institute Laboratory of Parallel and Distributed Systems
Grid Execution Management for Legacy Code Applications Exposing Application as Grid Services Porto, Portugal, 23 January 2007.
1 MTA SZTAKI Hungarian Academy of Sciences Grid application support by the P-GRADE Portal Peter Kacsuk.
1 P-GRADE Portal and GEMLCA: A workflow-oriented portal and application hosting environment Miklos Kozlovszky.
1 portal.p-grade.hu További lehetőségek a P-GRADE Portállal Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
1 P-GRADE Portal: Towards a User-friendly Grid Environment Tamas Kiss Centre for Parallel Computing.
1 portal.p-grade.hu Further information on P-GRADE Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Grid application development with gLite and P-GRADE Portal Miklos Kozlovszky MTA SZTAKI.
SEE-GRID and EGEE Portal applications M. Kozlovszky, A. Balasko Computer and Automation Research Institute of the Hungarian Academy of Sciences PUCOWO,
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
07/06/11 New Features of WS-PGRADE (and gUSE) 2010 Q Q2 Miklós Kozlovszky MTA SZTAKI LPDS.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI Creating the Autodock gateway from WS-PGRADE/gUSE and making it cloud-enabled.
From P-GRADE to SCI-BUS Peter Kacsuk, Zoltan Farkas and Miklos Kozlovszky MTA SZTAKI - Computer and Automation Research Institute of the Hungarian Academy.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
1 Next steps with EGEE and P-GRADE Portal Gergely Sipos MTA SZTAKI EGEE Training and Induction.
INFSO-RI Enabling Grids for E-sciencE Supporting legacy code applications on EGEE VOs by GEMLCA and the P-GRADE portal P. Kacsuk*,
Introduction to WS-PGRADE and gUSE Tutorial Akos Balasko 04/17/
WS-PGRADE portal and its usage in the CancerGrid project M. Kozlovszky, P. Kacsuk Computer and Automation Research Institute of the Hungarian Academy of.
1 Advanced features of the P-GRADE portal Peter Kacsuk, Gergely Sipos Peter Kacsuk, Gergely Sipos MTA.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Training services offered by SZTAKI for EGEE and EGI Gergely Sipos MTA SZTAKI (Hungarian.
1 P-GRADE Portal tutorial MTA SZTAKI Gergely Sipos
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
P-GRADE and GEMLCA.
1 P-GRADE Portal: a workflow-oriented generic application development portal Peter Kacsuk MTA SZTAKI, Hungary Univ. of Westminster, UK.
Getting started DIRAC Project. Outline  DIRAC information system  Documentation sources  DIRAC users and groups  Registration with DIRAC  Getting.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status report on Application porting at SZTAKI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Services for advanced workflow programming.
1 P-GRADE Portal: An easy to use graphical interface for Globus and EGEE Grids.
1 P-GRADE Portal tutorial at EGEE’09 Introduction to hands-on Gergely Sipos MTA SZTAKI EGEE.
The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures contract no Workflow repository, user.
1 Practical information for the GEMLCA / P-GRADE hands-on Gergely Sipos On behalf of: MTA.
1 Practical information for the GEMLCA / P-GRADE hands-on Tamas Kiss University of Westminster.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to P-GRADE Portal hands-on Miklos Kozlovszky MTA SZTAKI
1 portal.p-grade.hu Workflow and parameter study management by P-GRADE Portal Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
EGEE-II INFSO-RI Enabling Grids for E-sciencE P-GRADE overview and introduction: workflows & parameter sweeps (Advanced features)
1 Other features and next steps Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
1 Egrid portal Stefano Cozzini and Angelo Leto. 2 Egrid portal Based on P-GRADE Portal 2.3 –LCG-2 middleware support: broker, CEs, SEs, BDII –MyProxy.
1 P-GRADE Portal hands-on Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
1 Further information and next steps Further information and next steps Gergely Sipos MTA SZTAKI
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
1 P-GRADE Portal and Developer Alliance Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences portal.p-grade.hu.
27/10/2006, Catania EGEE Industry Day Filling the gaps: Advanced Grid portal and desktop grid technologies for industry Robert Lovas MTA SZTAKI Laboratory.
RI EGI-TF 2010, Tutorial Managing an EGEE/EGI Virtual Organisation (VO) with EDGES bridged Desktop Resources Tutorial Robert Lovas, MTA SZTAKI.
1 Support for parameter study applications in the P-GRADE Portal Gergely Sipos MTA SZTAKI (Hungarian Academy of Sciences)
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Application specific portlet other portal features and next steps Miklos Kozlovszky.
Tamas Kiss University Of Westminster
P-GRADE Portal tutorial
Introduction to P-GRADE Portal hands-on
P-GRADE and GEMLCA.
Lightweight introduction
Lightweight introduction
Application development on EGEE with P-GRADE Portal
Introduction to the SHIWA Simulation Platform EGI User Forum,
Workflow level parametric study support by the P-GRADE portal
Presentation transcript:

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July 2009 P-GRADE Portal tutorial part 1.: Managing workflow applications on EGEE Miklos Kozlovszky MTA SZTAKI

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Contents P-GRADE Portal in a nutshell Workflow development with P-GRADE Portal Workflow execution with P-GRADE Portal

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Family of P-GRADE Portal products P-GRADE portal –Creating (basic) workflows and parameter sweeps for service grids – P-GRADE/GEMLCA portal (University of Westminster) –To wrap legacy applications into Grid Services –To add legacy code services to P-GRADE Portal workflows – WS-PGRADE –Creating complex workflow and parameter sweeps for clusters, service grids and desktop grids –Creating complex applications using embedded workflows, legacy codes and community components from workflow repository –

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July References and P-GRADE installations P-GRADE Portal service is available for –SEE-GRID-SCI infrastructure –Central European VO of EGEE –GILDA: Training VO of EGEE –Many national Grids (UK National Grid Service, GridIreland, Turkish Grid, Croatian Grid, etc.) –US Open Science Grid –Economy-Grid, Swiss BioGrid, Bio and Biomed EGEE VOs, BioInfoGrid, BalticGrid –GIN VO of OGF –EGEE Respect program tool –Total:  12 community installations, connection to ~20 grids  ~1200 additional downloads from Sourceforge –Developers from: UK, Turkey, Croatia, Italy, France

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Some applications/services based on P-GRADE Portal Molecular Dynamics Study of Water Penetration (CHARMM): Parametric workflow resulting hundreds of long running jobs – R: Patient Readmission Analysis – In Silico Modelling using Autodock – MultiBayes – DNA sequence analyzer application GAMESS-UK - ab initio molecule structure analyzer application Minimizing operational cost of factories and logistic service providers (EMMIL): Parametric workflow resulting thousands of short running jobs Studying oscillons and magnetic monopole configurations: Parametric workflow resulting hundreds of short running jobs Ultra short range weather forecast (MEANDER): workflow that integrates 4 meteorological algorithms and one visualizer component

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Application Specific Portals The Rendering Portal Create a new rendering job by uploading file to be rendered Submit workflow Download rendered images Contact: Mr. Tamas Kiss,

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Download rendered images Set up the most optimal supplier chain Log in as buyer supplier, or logistic service provider. Create auctions or take your bids Parameter study optimization workflow is automatically generated and executed Best buyer-seller-logistic service provider pairs discovered, result presented in an Excel file Application Specific Portals (contd.) Supplier chain portal: EMMIL P-GRADE Procurement Portal

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Application Specific Portals (contd.) EMMIL P-GRADE Procurement Portal Buyers profile Buyer’s product management Buyer starts new auction Supplier places bid Supplier auction is closing 3PL places bid 3PL auction is closing Results

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Fast evolution of Grid systems and middleware: –GT2, OGSA, GT3 (OGSI), GT4 (WSRF), LCG-2, gLite, … Many production Grid systems are built with them –EGEE (LCG-2  gLite), UK NGS (GT2), Open Science Grid (GT2  GT4), NorduGrid (~GT2) Although the same set of core services are available everywhere, they are implemented in different ways –Data services –Computation services –Security services (single sign-on) –(Brokers) Current situation and trends in Grid computing

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July P-GRADE Portal in a nutshell General purpose, workflow-oriented computational Grid portal. Supports the development and execution of workflow-based Grid applications – a tool for Grid orchestration Based on GridSphere-2 –Easy to expand with new portlets (e.g. application-specific portlets) –Easy to tailor to end-user needs Grid services supported by the portal: ServiceEGEE gridsGlobus grids Job executionComputing ElementGRAM File storageStorage ElementGridFTP server Certificate managementMyProxy Information systemBDIIMDS-2, MDS-4 BrokeringWorkload Management SystemGTbroker Job monitoringMercury Workflow & job visualizationPROVE Solves Grid interoperability problem at the workflow level

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July The typical user scenario Part 1 - development phase Portal server Grid services START EDITOR OPEN & EDIT or DEVELOP WORKFLOW SAVE WORKFLOW Certificate servers

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Certificate servers Portal server Grid services TRANSFER FILES, SUBMIT JOBS DOWNLOAD (SMALL) RESULTS The typical user scenario Part 2 - execution phase VISUALIZE JOBS and WORKFLOW PROGRESS MONITOR JOBS DOWNLOAD PROXY CERTIFICATES

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Define workflow structure 2.Define binaries & setup parameters and properties (JDL) 3.Define resource selection 4.Setup In/Outputs 5.Create or upload file/directory structure on SE (optional) 6.Name & Save the workflow (Upload binaries, and other files) Main steps Defining workflow with the P-GRADE Portal

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July What is a P-GRADE Portal workflow? a directed acyclic graph where –Nodes represent jobs (batch programs to be executed on a computing element) –Ports represent input/output files the jobs expect/produce –Arcs represent file transfer operations semantics of the workflow: –A job can be executed if all of its input files are available

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Two levels of parallelism by a workflow The workflow concept of the P- GRADE Portal enables the efficient parallelization of complex problems Semantics of the workflow enables two levels of parallelism: The job can be a parallel program – Parallel execution inside a workflow node – Parallel execution among workflow nodes Multiple jobs can run parallel

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July x 10 x 25 x 5 x Forecasting dangerous weather situations (storms, fog, etc.), crucial task in the protection of life and property Processed information: surface level measurements, high- altitude measurements, radar, satellite, lightning, results of previous computed models Requirements: Execution time < 10 min High resolution (1km) Ultra-short range weather forecast (Hungarian Meteorology Service)

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July The typical user scenario Development phase: Certificate servers Portal server Grid services START EDITOR OPEN & EDIT or DEVELOP or IMPORT WORKFLOW SAVE WORKFLOW

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Workflow development Opening the workflow editor The editor is a Java Webstart application download and installation is only one click!

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Workflow Editor Defining the graph The aim is to define a DAG of batch jobs: 1.Drag & drop components: jobs and ports 2.Define their properties 3.Connect ports by channels (no cycles, no loops, no conditions)

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Workflow Editor Properties of a job Properties of a job: Binary executable Type of executable Number of required processors Command line parameters The resource to be used for the execution: Grid/VO (Computing element)

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Direct resource selection: Which computing element to use? The information system portlet queries BDII and GIIS servers I still don’t know which resource to use!

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Automatic resource selection 1.Select a broker Grid/VO for the job (e.g. GILDA_LCG2_broker/GILDA_gLite_broker) 2.(Describe the ranks & requirements of the job in JDL) 3.The portal will use the broker to find the best resource for the job!

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Workflow Editor Defining broker jobs Select a Grid with broker! (*_BROKER) Ignore the resource field! If default JDL is not sufficient use the built-in JDL editor!

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Workflow Editor Built-in JDL editor JDL  look at the gLite Users’ manual!

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Workflow Editor Defining input-output files File properties Type: input: the job reads output: the job generates File type: local: comes from my desktop remote: comes from an SE File: location of the file Internal file name: Executable reads the file in this name – fopen(“file.in”, …) File storage type (output files only): Permanent: final result Volatile: only data channel

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Client side location: result.dat LFC logical file name lfn:/grid/gilda/kozlovszky/11-04_-_result.dat Local file Remote file How to refer to an I/O file? Client side location: c:\experiments\11-04.dat LFC logical file name lfn:/grid/gilda/kozlovszky/11-04.dat Input fileOutput file

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Local vs. remote files Portal server Grid services Computing elements Storage elements REMOTE INPUT FILES REMOTE OUTPUT FILES LOCAL INPUT FILES & EXECUTABLES LOCAL OUTPUT FILES LOCAL INPUT FILES & EXECUTABLES LOCAL OUTPUT FILES Only the permanent files! Your jobs can access storage files directly too!

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Job level file transfer in EGEE VOs Portal server EGEE VO Computing Element Storage Elements MyJob REMOTE INPUT FILE 23 gLite File Catalog Service REMOTE OUTPUT FILE REGISTER REMOTE OUTPUT FILE Custom file transfer Your code CAN but DOES NOT HAVE TO speak grid protocols EGEE WMS Pre script Binary of MyJob Post script 01 LOCAL FILES: Local input User binary Pre script Post script Generated by the portal Input sandbox LOCAL OUTPUT FILE Output sandbox Shell scripts

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Reminder: Grid files and JDL Example JDL file Executable = “gridTest”; StdError = “stderr.log”; StdOutput = “stdout.log”; InputSandbox = {“/home/joda/test/gridTest”}; OutputSandbox = {“stderr.log”, “stdout.log”}; InputData = “lfn:/grid/gilda/mydir/testbed ”; OutputData = “lfn:/grid/gilda/mydir/result ”; lfn: logical file name The file itself is NOT transferred by the gLite (or Globus) middleware! Your binary must transfer input/output grid files! P-GRADE Portal tranfers the file for you. Your executable does not have to know any grid protocol if it is used in P-GRADE Portal

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Workflow Editor Saving the workflow Workflow is defined! Let’s execute it!

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Download proxies 2.Submit workflow 3.Observe workflow progress 4.If some error occurs correct the graph 5.Download result Main steps Executing workflows with the P-GRADE Portal

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July The typical user scenario Execution phase – step 1: Certificate servers Portal server Grid services DOWNLOAD PROXY CERTIFICATES

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July MyProxy interaction in P-GRADE: Certificate Manager To start your session on the Grid you must create a proxy certificate on the portal server “Certificates” portlet: to upload a proxy into MyProxy servers to download a proxy from MyProxy into the portal server Certificates portlet

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Certificate Manager Downloading a proxy 1.MyProxy server access details: Hostname Port number User name (from upload) Password (from upload) 2.Proxy parameters: Lifetime Comment 3.Grid association

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July MyProxy server Portal server Grid services I have to do this before workflow submission Proxy 1 Certificates, proxies with gLite VOs: Download VOMS server Proxy 2 VOMS ext. Proxy 2 VOMS ext.

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Certificate Manager Associating the proxy with a grid This operation displays the details of the certificate and the list of available Grids (defined by portal administrator)

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Certificate Manager browsing proxies Multiple proxies can be available on the portal server at the same time! SEE-GRID CEs and SEs HUNGRID CEs and SEs

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Certificate servers Portal server Grid services TRANSFER FILES, SUBMIT JOBS The typical user scenario Execution phase - step 2:

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Workflow Management (workflow portlet) The portlet presents the status, size and output of the available workflow in the “Workflow” list It has a Quota manager to control the users’ storage space on the server The portlet also contains the “Abort”, “Attach”, “Details”, “Delete” and “Delete all” buttons to handle execution of workflows The “Attach” button opens the workflow in the Workflow Editor The “Details” button gives an overview about the jobs of the workflow

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July White/Red/Green color means the job is initial/running/finished state Workflow Execution (observation by the workflow portlet)

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July White/Red/Green color means the job is initial/running/finished state Workflow Execution (observation by the workflow portlet)

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July White/Red/Green color means the job is initial/running/finished state Workflow Execution (observation by the workflow portlet)

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July White/Red/Green color means the job is initial/running/finished state Workflow Execution (observation by the workflow portlet)

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Workflow Execution (observation by the workflow portlet) White/Red/Green color means the job is initialised/running/finished

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Certificate servers Portal server Grid services The typical user scenario Execution phase – step 3: VISUALIZE JOBS and WORKFLOW PROGRESS MONITOR JOBS

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July The portal monitors and visualizes workflow progress On-Line Monitoring both at the workflow and job levels (workflow portlet)

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Rescuing a failed workflow 1. A job failed during workflow execution Read the error log to know why

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Rescuing a failed workflow 2. Map the failed job onto a different CE or download a new proxy for it. Don’t touch the finished jobs! The execution can continue from the point of failure!

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Rescuing a failed workflow 3.

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Rescuing a failed workflow 4. Resume workflow by the Rescue button

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Logs provided for each job

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Analysis of the log :32:19 - Proxy with VOMS extensions created for VO "voce" with accounting group "" :32:19 - Job submission in progress :32:23 - Job has been submitted successfully! :32:23 - Job identifier is: " :32:26 - EGEE job's status has changed to "Waiting" (host is ) :33:00 - EGEE job's status has changed to "Ready" (host is ce1- egee.srce.hr) :35:46 - EGEE job's status has changed to "Waiting" (host is egee- ce.grid.niif.hu) :36:19 - EGEE job's status has changed to "Ready" (host is ce.cyf- kr.edu.pl) :36:53 - EGEE job's status has changed to "Waiting" (host is ce.cyf-kr.edu.pl) :37:26 - EGEE job's status has changed to "Done" (host is egee-ce.grid.niif.hu) :37:26 - Job found to be finished. Checking again if this is really the case :38:03 - EGEE job's status has changed to "Ready" (host is egee-ce1.gup.uni-linz.ac.at).

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Fault-tolerance by P-GRADE portal 09:33: the broker assigned the job to a site: ce1-egee.srce.hr 09:35: The broker moved the job to another site: egee-ce.grid.niif.hu 09:36: Again the broker moved the job to another site: ce.cyf-kr.edu.pl 09:37: The broker indicated that the job is Done, but. 09:38:... It turned out that the job was not finished (Done - Failed status), only it was moved to another site: egee-ce1.gup.uni-linz.ac.at 09:39: Again the broker moved the job to another site: ares02.cyf- kr.edu.pl 09:39: Again the broker moved the job to another site: ce.cyf-kr.edu.pl 09:40: After trying 10 different sites the VOCE broker gave it up and aborted the job (the Shallow RetryCount was set for 10): :40:16 - The job has been aborted!

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Fault-tolerance by P-GRADE portal Our fault-tolerant portal did not give it up: :40:16 - The job can be submitted again (try 1 out of 3, excluding host(s): ce.cyf-kr.edu.pl) :40:17 - Proxy with VOMS extensions created for VO "voce" with accounting group "" :40:17 - Job submission in progress :40:27 - Job has been submitted successfully! :40:27 - Job identifier is: " :40:30 - EGEE job's status has changed to "Waiting" (host is ) :41:04 - EGEE job's status has changed to "Ready" (host is eszakigrid66.inf.elte.hu).

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July :41:37 - EGEE job's status has changed to "Scheduled" (host is eszakigrid66.inf.elte.hu) :44:57 - EGEE job's status has changed to "Done" (host is eszakigrid66.inf.elte.hu) :44:57 - Job found to be finished. Checking again if this is really the case :45:34 - EGEE job's status has changed to "Waiting" (host is eszakigrid66.inf.elte.hu) :06:06 - The job's status hasn't changed for 20 minutes, resubmitting... It is a quite frequently occurring problem in EGEE-like grids that the broker leaves jobs stuck in CEs. queues.) In such case the portal automatically kills the job on this site and resubmits it to the broker :06:06 - Proxy with VOMS extensions created for VO "voce" with accounting group "" :06:06 - Job submission in progress :06:12 - Job has been submitted successfully! 10:10: The job successfully finished with exit code 0 on site: ce.ui.savba.sk Fault-tolerance by P-GRADE portal

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Certificate servers Portal server Grid services DOWNLOAD (SMALL) RESULTS The typical user scenario Execution phase – step 5

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July Downloading the results…

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July File management portlet

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July P-GRADE portal Pros. –Easy-to-use workflow system with graphical editor –Easy-to-use parameter sweep concept at workflow level –Multi-grid / multi-VO access mechanism: job submission to LCG, gLite and GT2 –Intelligent handling of grid errors –Open source community on Sourceforgeon Sourceforge –Reliable, production installations for several Grid, EGEE VOsinstallations –Part of EGEE RESPECT programmeRESPECT programme –…–… –Features can serve most grid scenarios

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July To become a user of P-GRADE Portal Become user of EGEE Select a portal installation: Portal installations

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July To become a user of P-GRADE Portal contd. Get access

Miklos Kozlovszky - EGEE and EDGeS Summer School on Grid Application Support ‘09 June July 2009 Thank you Miklos Kozlovszky