Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 www.portal.p-grade.hu www.cpc.wmin.ac.uk/gemlca P-GRADE Portal and GEMLCA: A workflow-oriented portal and application hosting environment Miklos Kozlovszky.

Similar presentations


Presentation on theme: "1 www.portal.p-grade.hu www.cpc.wmin.ac.uk/gemlca P-GRADE Portal and GEMLCA: A workflow-oriented portal and application hosting environment Miklos Kozlovszky."— Presentation transcript:

1 1 www.portal.p-grade.hu www.cpc.wmin.ac.uk/gemlca P-GRADE Portal and GEMLCA: A workflow-oriented portal and application hosting environment Miklos Kozlovszky m.kozlovszky@sztaki.hu MTA SZTAKI (Hungarian Academy of Sciences)

2 2 Contents Motivation of creating the tools P-GRADE Portal and GEMLCA in a nutshell Lifecycle of GEMLCA / P-GRADE applications Services provided for application developers Introduction to the hands-on exercises Hands-on

3 3 Context Basic Grid services: AA, job submission, info, … Higher-level grid services (brokering,…) Application toolkits, standards Application Grid middleware services Middleware specific clients Middleware independent services and interfaces of P-GRADE/GEMLCA Graphical interface

4 4 Current situation and trends in Grid computing Fast evolution of Grid systems and middleware: –GT2, OGSA, GT3 (OGSI), GT4 (WSRF), LCG-2, gLite, … Many production Grid systems are built with them –EGEE (LCG-2  gLite), UK NGS (GT2), Open Science Grid (GT2  GT4), NorduGrid (~GT2) Although the same set of core services are available everywhere, they are implemented in different ways –Data services (file management) –Computation services (job submission) –Security services (proxy based single sign-on) –Brokers (not in every middleware, but e.g. in gLite - WMS)

5 5 E-scientists’ concerns The GEMLCA / P-GRADE Portal give you the answers! How to concentrate on my own research if the tool I would like to use is in continuous change? How can I learn and understand the usage of the Grid? How can I develop Grid applications? How can I execute grid applications? How to tackle performance issues? How to use several VOs or several grids at the same time? How to migrate my application from one grid or VO to another? How can I collaborate with fellow researchers?

6 6 P-GRADE Portal in a nutshell General purpose, workflow-oriented computational Grid portal. Supports the development and execution of workflow based Grid applications – a Grid orchestration environment Based on GridSphere web portal framework –Functionalities are accessed through portlets –Easy to expand with new portlets (e.g. application-specific portlets) –Easy to tailor to end-user or community needs Developed by SZTAKI (1.0 in 2003, now 2.5) Grid services supported by P-GRADE Portal 2.5: ServiceEGEE grids (LCG/gLite)Globus 2 grids Job executionComputing ElementGRAM File storageStorage Element, File catalogGridFTP server Certificate managementMyProxy server, VOMS server Information systemBDIIMDS-2, MDS-4 BrokeringWorkload Management System Job monitoringMercury Workflow & job visualizationPROVE Solves Grid interoperability problem at the workflow level TODAY’S FOCUS

7 7 GEMLCA extension of the P-GRADE Portal P-GRADE Portal extended with GEMLCA Grid service back-end –To share jobs and legacy codes as application components with others –A step towards collaborative e-Science Developed by the University of Westminster (London) Support for Globus 4 grids (besides GT2 and EGEE) Available on the NGS and OGF GIN P-GRADE Portal GEMLCA Globus 4 VOs Globus 2 VOs LCG / gLite VOs job

8 8 Related projects The development, operation and training of P-GRADE Portal and GEMLCA is supported by the following projects: –SEE-GRID www.see-grid.eu Development, application support –Coregrid www.coregrid.net Research, development –EGEE www.eu-egee.org gLite training, application development –ICEAGE www.iceage-eu.org Grid training and education

9 9 A Grid application in the GEMLCA / P-GRADE Portal A directed acyclic graph where –Nodes represent jobs or services (a batch program executed on a computing resource) –Ports represent input/output files the components expect/produce –Arcs represent file transfer operations Semantics of the workflow: –A job can be executed if all of its input files are available –Responsibility of the built-in workflow manager

10 10 Three levels of parallelism within a P-GRADE Portal application The workflow concept of the GEMLCA/ P-GRADE Portal enables the efficient parallelization of complex problems Semantics of the workflow enables two levels of parallelism: The job/service can be a parallel code – Parallel execution inside a workflow node – Parallel execution among workflow nodes Multiple nodes can run parallel Multiple instances of the same workflow process different data files – Parametric sweep execution of the workflow (SIMD)

11 11 25 x 10 x 25 x 5 x Forecasting dangerous weather situations (storms, fog, etc.), crucial task in the protection of life and property Processed information: surface level measurements, high- altitude measurements, radar, satellite, lightning, results of previous computed models Requirements: Execution time < 10 min High resolution (1km) Ultra-short range weather forecast (Hungarian Meteorology Service)

12 12 Workflow-level Grid interoperability: The GIN Resource Testing portal Grid Interoperability Now VO Portal: OGF effort to demonstrate workflow level grid interoperability between major production Grids and to monitor these resources

13 13 The typical user scenario Part 1 - development phase MyProxy servers Portal server Grid services START EDITOR OPEN & EDIT or DEVELOP WORKFLOW or PS WF SAVE WF / PS REUSE WORKFLOW COMPONENTS

14 14 MyProxy servers Portal server Grid services TRANSFER FILES, SUBMIT JOBS DOWNLOAD (SMALL) RESULTS The typical user scenario Part 2 - execution phase VISUALIZE JOBS and WORKFLOW PROGRESS MONITOR JOBS DOWNLOAD PROXY CERTIFICATES Keep large files on Grid storage resources

15 15 Portal server Grid services The typical user scenario Part 3 - collaborative phase Share workflow components with other users of the same portal Export and share workflows with users of the same, or another portal MyProxy servers

16 16 Inside the portal server Tomcat DAGMan workflow manager Information systems MyProxy server & VOMS P-GRADE Portal portlets (JSR-168, Gridsphere 2): Workflow, Certificates, Information System, Settings, GEMLCA Information system clients CoG API & scripts Java Webstart workflow editor Web browser shell scripts Grid middleware services (WMS, LFC, SE, …) Client P-GRADE Portal server Grid Grid middleware clients Mercury monitor service Mercury API GEMLCA service (WSRF) Optional plug-in: Technology specific gateways File transfer Proxy management Load monitoring

17 17 Workflow Editor Defining the graph Define a Directed Acyclic Graph (DAG) of jobs and services (GEMLCA jobs): 1.Drag & drop components: nodes and ports 2.Define their properties 3.Connect ports by channels (no cycles, no loops, no conditions…)

18 18 Workflow Editor Properties of a job component Properties of a job: Type of executable Client side location of the binary Number of required processors Command line parameters The resource to be used for the execution: Grid (VO) Resource / broker

19 19 Workflow Editor Defining broker jobs Select a Grid with broker! (*_BROKER) Ignore the resource field! If default JDL is not sufficient use the built-in JDL editor!

20 20 Workflow Editor Built-in JDL editor for brokered jobs JDL  look at the gLite Users’ manual! Rank & Requirement

21 21 Workflow Editor Properties of a service component (GEMLCA job) Properties of a service: The location of the service: Grid (VO) Resource / broker An application (binary) associated with that resource Input parameter values for the service

22 22 Workflow Editor Defining job / service input-output data File properties Type: input: the component reads output: the component writes File type: local: originates from my desktop remote: originates from a grid storage element File: location of the file File storage type (for outputs only): Permanent: final result Volatile: used only for inter-component data transfer

23 23 How to refer to an I/O file? Client side location: c:\experiments\11-04.dat LFC logical file name (LFC file catalog is required – EGEE VOs) lfn:/grid/gilda/sipos/11-04.dat GridFTP address (in Globus Grids): gsiftp://somengshost.ac.uk/mydir/11-04.dat Input fileOutput file Local file Remote file Client side location: result.dat LFC logical file name (LFC file catalog is required – EGEE VOs) lfn:/grid/gilda/sipos/11-04_-_result.dat GridFTP address (in Globus Grids): gsiftp://somengshost.ac.uk/mydir/result.dat

24 24 Workflow level file transfer by the workflow manager Portal server Grid infrastructure Computing elements Storage elements REMOTE INPUT FILES REMOTE OUTPUT FILES LOCAL INPUT FILES & BINARIES LOCAL OUTPUT FILES LOCAL INPUT FILES & BINARIES LOCAL OUTPUT FILES GEMLCA repository User level storage Binaries of GEMLCA jobs

25 25 Job / service level file transfer by the workflow manager Portal server Grid infrastructure Computing Element Storage Elements 01 2 binary Post script Custom file transfer Pre script 3 REMOTE INPUT FILE LOCAL INPUT FILE 0 1 REMOTE OUTPUT FILE LOCAL OUTPUT FILE 23 Generated by the portal

26 26 Reminder: grid files in JDL Example JDL file Executable = “gridTest”; StdError = “stderr.log”; StdOutput = “stdout.log”; InputSandbox = {“/home/joda/test/gridTest”}; OutputSandbox = {“stderr.log”, “stdout.log”}; InputData = “lfn:/grid/VOname/mydir/testbed0-00019”; Requirements = other.Architecture==“INTEL” && \ other.OpSys==“LINUX” && other.FreeCpus >=4; Rank = “other.GlueHostBenchmarkSF00”; lfn: logical file name RB uses File Catalog to find file location The file itself is NOT transferred by the middleware! Your binary must transfer input/output grid files! Higher level tools can tranfer the file for you. E.g. P-GRADE Portal Your code does not have to “speak” storage protocols if it is developed in P-GRADE Portal!

27 27 Information system portlet to browse computing elements Graphical interface for BDII servers

28 28 1.Download proxies 2.Submit workflow 3.Observe workflow progress 4.If some error occurs correct the graph 5.Download result Main steps Workflow execution

29 29 Certificate Manager Certificates portlet To start your session on the Grid you must create a proxy certificate on the portal server “Certificates” portlet: to upload a proxy into MyProxy servers to download a proxy from MyProxy into the portal server

30 30 Certificate Manager Downloading a proxy 1.MyProxy server access details: Hostname Port number User name (from upload) Password (from upload) 2.Proxy parameters: Lifetime Comment 3.Grid association

31 31 Certificate Manager Multi-grid portal  Multi-proxy environment Multiple proxies can be available on the portal server at the same time! Certificate from EGEE CA: SEE-GRID CEs and SEs Certificate from Hungarian CA: HUNGRID CEs and SEs

32 32 Certificates, proxies: Upload MyProxy server Portal server Grid services Generate and upload a long proxy I have to do this once, when I use the portal for the first time… Certificate Private key Proxy 1 Delete private key and certificate

33 33 MyProxy server Portal server Grid services I have to do this every time when I want to execute workflows Proxy 1 Certificates, proxies with gLite VOs: Download VOMS server Proxy 2 VOMS ext. Proxy 2 VOMS ext.

34 34 Workflow Management (workflow portlet) The portlet presents the status, size and output of the available workflow in the “Workflow” list It has a Quota manager to control the users’ storage space on the server The portlet also contains the “Abort”, “Attach”, “Details”, “Delete” and “Delete all” buttons to handle execution of workflows The “Attach” button opens the workflow in the Workflow Editor The “Details” button gives an overview about the jobs of the workflow

35 35 White/Red/Green color means the job is initial/running/finished state Workflow Execution (observation by the workflow portlet)

36 36 White/Red/Green color means the job is initial/running/finished state Workflow Execution (observation by the workflow portlet)

37 37 White/Red/Green color means the job is initial/running/finished state Workflow Execution (observation by the workflow portlet)

38 38 White/Red/Green color means the job is initial/running/finished state Workflow Execution (observation by the workflow portlet)

39 39 Workflow Execution (observation by the workflow portlet) White/Red/Green color means the job is initialised/running/finished

40 40 On-Line Monitoring both at the workflow and job levels (workflow portlet) -The portal monitors and visualizes workflow progress -The portal monitors and visualizes parallel jobs (if they are prepared for Mercury monitor)

41 41 Rescuing a failed workflow 1. A job failed during workflow execution Read the error log to know why

42 42 Rescuing a failed workflow 2. Map the failed job onto a different resource or download a new proxy for it Don’t touch the finished jobs! The execution can continue from the point of failure

43 43 Downloading the results…

44 44 Sharing a successfully finished job with other users: GEMLCA repository GEMLCA repository

45 45 Collaborative grid applications Combine services and your code in the same workflow!Serviceinvocation Service invocation Job submission

46 46 File Management through LFC and LCG File / Directory management through LFC and LCG –listing LFC hosts for the selected VO –browsing a LFC directory –creating a new directory –removing a directory/file –displaying details (owner-group info, last modification, access rights) of a directory/file –renaming a directory/file –changing access rights of a directory/file –uploading a (local) file to a storage element –downloading a file from a storage element –listing replicas of a file –replicating a file –deleting a replica of a file In the frame of the Portal Developer Alliance Birsen Omay from the Middle East Technical University has created this new P-GRADE Portal extension.

47 47 File Management through LFC and LCG LFC Host Portal server LFC file and directory management Storage Element

48 48 MyProxy Credential Manager Getting information about a MyProxy credential MyProxy Credential Management –getting info about a previously stored MyProxy credential –changing passphrase of a MyProxy credential –removing a credential In the frame of the Portal Developer Alliance Birsen Omay from the Middle East Technical University has created this new P-GRADE Portal extension

49 49 MyProxy Credential Manager Getting information about a MyProxy credential MyProxy server access details: Hostname Port number User name (from upload) Password (from upload) Display information about MyProxy credential

50 50 Certificates, proxies: Getting information MyProxy server Portal server Grid services Get information (owner, start date, end date) about the credential stored for “username” from MyProxy Server Request information about the credential

51 51 MyProxy Credential Manager Getting information about a MyProxy credential Information about the credential for username “birsen”

52 52 Certificates, proxies: Changing Passphrase MyProxy server Portal server Grid services Modify password of the credential stored for “username” Change password for “username”

53 53 Certificates, proxies: Removing a Credential MyProxy server Portal server Grid services Destroy the credential for “username” Remove the credential for “username” from MyProxy server

54 54 How to get access? P-GRADE Portal service is available: –SEE-GRID infrastructure –Central European VO of EGEE –GILDA: Training VO of EGEE –Many national Grids (UK National Grid Service, HunGrid, Turkish Grid, etc.) –US Open Science Grid, TeraGrid –Economy-Grid, Swiss BioGrid, Bio and Biomed EGEE VOs, BalticGrid –OGF Grid Interoperability Now (GIN) VO portal.p-grade.hu/index.php?m=5&s=0

55 55 Summary and conclusion P-GRADE Portal hides the complexity of Grid systems –Globus 2, Globus 4, LCG, gLite Various components can be integrated into workflows Sequential codes MPI codes Legacy code services (with the GEMLCA-specific version) Workflows can be executed as parameter studies –Storage management –Generators –Collectors Your code does not have to contain grid specific calls Graphical interfaces for –grid application development –certificate management –application execution and monitoring Support for collaborative work –Share workflow components –Share workflows Built by standard portlet API  customizable to specific needs

56 56 Questions? www.portal.p-grade.hu pgportal@lpds.sztaki.hu Learn once, use everywhere Develop once, execute anywhere


Download ppt "1 www.portal.p-grade.hu www.cpc.wmin.ac.uk/gemlca P-GRADE Portal and GEMLCA: A workflow-oriented portal and application hosting environment Miklos Kozlovszky."

Similar presentations


Ads by Google