Support to MPI, Schedulers and Complex Workflows

Slides:



Advertisements
Similar presentations
Nimrod/K: Towards Massively Parallel Dynamic Grid Workflows David Abramson, Colin Enticott, Monash Ilkay Altinas, UCSD.
Advertisements

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User.
Challenges for Interactive Grids a point of view from Int.Eu.Grid project Remote Instrumentation Services in Grid Environment RISGE BoF Manchester 8th.
Distributed Systems Architecture Research Group Universidad Complutense de Madrid EGEE UF4/OGF25 Catania, Italy March 2 nd, 2009 State and Future Plans.
EGC 2005, CrossGrid technical achievements, Amsterdam, Feb. 16th, 2005 WP2-3 New Generation Environment for Grid Interactive MPI Applications M igrating.
EUFORIA FP7-INFRASTRUCTURES , Grant Scientific Workflows Kepler and Java API 4 HPC/GRID ITM meeting Juelich 2009 Michał Owsiak Marcin Płóciennik.
Defining France Grilles resource allocation strategy Gilles Mathieu, IN2P3 Computing Centre France Grilles International Advisory Committee – March 2011.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
EUFORIA FP7-INFRASTRUCTURES , Grant JRA3 B. Guillerminet on behalf of the JRA3 project 22 January 2008 Kick-Off Meeting January 2008.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
EUFORIA FP7-INFRASTRUCTURES , Grant Scientific Workflows Kepler and Java API 4 HPC/GRID Hands on tutorial - ITM Meeting 2009 Michal Owsiak.
DORII Joint Research Activities DORII Joint Research Activities Status and Progress 4 th All-Hands-Meeting (AHM) Alexey Cheptsov on.
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
EGI-InSPIRE Steven Newhouse Interim EGI.eu Director EGI-InSPIRE Project Director.
OGF 25/EGEE User Forum Catania, March 2 nd 2009 Meta Scheduling and Advanced Application Support on the Spanish NGI Enol Fernández del Castillo (IFCA-CSIC)
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Lessons learnt from the EGEE Application Porting Support activity Gergely Sipos Coordinator.
ITPA/IMAGE 7-10 May 2007 Software and Hardware Infrastructure for the ITM B.Guillerminet, on behalf of the ITM & ISIP teams (P Strand, F Imbeaux, G Huysmans,
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
EUFORIA FP7-INFRASTRUCTURES , Grant EUFORIA: EU Fusion fOR ITER Applications Marcus Hardt
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, An Overview of the GridWay Metascheduler.
Migrating Desktop Marcin Płóciennik Marcin Płóciennik Kick-off Meeting, Santander, Graphical.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, Novelties and Features around the GridWay.
SEE-GRID-SCI The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures contract no.
BalticGrid-II Project The Second BalticGrid-II All-Hands Meeting, Riga, May, Joint Research Activity Enhanced Application Services on Sustainable.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGI-InSPIRE Steven Newhouse Interim EGI.eu Director EGI-InSPIRE Project Director Technical Director EGEE-III 1GDB - December 2009.
CERN, DataGrid PTB, April 10, 2002 CrossGrid – DataGrid Collaboration (Framework) Marian Bubak and Bob Jones.
EUFORIA FP7-INFRASTRUCTURES , Grant Migrating Desktop Uniform Access to the Grid Marcin Płóciennik Poznan Supercomputing and Networking Center.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
Dr. Isabel Campos Plasencia (IFCA-CSIC) Spanish NGI Coordinator ES-GRID The Spanish National Grid Initiative.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI MPI and Parallel Code Support Alessandro Costantini, Isabel Campos, Enol.
BalticGrid-II Project EGEE UF’09 Conference, , Catania Partner’s logo Framework for Grid Applications Migrating Desktop Framework for Grid.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Spanish National Research Council- CSIC Isabel.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Interfacing gLite services with the Kepler.
Migrating Desktop Uniform Access to the Grid Marcin Płóciennik Poznan Supercomputing and Networking Center Poznan, Poland EGEE’07, Budapest, Oct.
EGI-InSPIRE RI EGI Community Forum 2012 EGI-InSPIRE EGI-InSPIRE RI EGI Community Forum 2012 Kepler Workflow Manager.
INFSO-RI JRA2 Test Management Tools Eva Takacs (4D SOFT) ETICS 2 Final Review Brussels - 11 May 2010.
Migrating Desktop Uniform Access to the Grid Marcin Płóciennik Poznan Supercomputing and Networking Center Poland EGEE’08 Conference, Istanbul, 24 Sep.
CERN - IT Department CH-1211 Genève 23 Switzerland t IT-GD-OPS attendance to EGEE’09 IT/GD Group Meeting, 09 October 2009.
EMI INFSO-RI Testbed for project continuous Integration Danilo Dongiovanni (INFN-CNAF) -SA2.6 Task Leader Jozef Cernak(UPJŠ, Kosice, Slovakia)
EGI-InSPIRE Project Overview1 EGI-InSPIRE Overview Activities and operations boards Tiziana Ferrari, EGI.eu Operations Unit Tiziana.Ferrari at egi.eu 1.
CMB & LSS Virtual Research Community Marcos López-Caniego Enrique Martínez Isabel Campos Jesús Marco Instituto de Física de Cantabria (CSIC-UC) EGI Community.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
1 An unattended, fault-tolerant approach for the execution of distributed applications Manuel Rodríguez-Pascual, Rafael Mayo-García CIEMAT Madrid, Spain.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
EGI-InSPIRE RI EGI-InSPIRE RI EGI-InSPIRE Software provisioning and HTC Solution Peter Solagna Senior Operations Manager.
Advantages of adopting late-binding techniques through standardised interfaces for workflow managers. A.J. Rubio-Montero 1, M. Plociennik 2, I. Marín-Carrión.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Shared Services and Tools MPI John Walsh, Isabel Campos, Antonio Laganà EGITF-2010,
Piotr Bała, Marcin Radecki, Krzysztof Benedyczak
JRA1 Middleware re-engineering
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
C Loomis (CNRS/LAL) and V. Floros (GRNET)
User Interfaces: Science Gateways, Workflows and Toolkits
Scientific workflow in Kepler – hands on tutorial
European Middleware Initiative (EMI)
Management of Virtual Machines in Grids Infrastructures
PRACE-EGI helpdesk integration
N.B. Please always use EGI-InSPIRE templates!
Maite Barroso, SA1 activity leader CERN 27th January 2009
Action U-E-5 Technical Coordination – User Technical Support
Interoperability & Standards
Management of Virtual Machines in Grids Infrastructures
PROCESS - H2020 Project Work Package WP6 JRA3
Leigh Grundhoefer Indiana University
Operations Management Board April 30
Introduction to the SHIWA Simulation Platform EGI User Forum,
Presentation transcript:

Support to MPI, Schedulers and Complex Workflows Compiled by Isabel Campos Spanish NGI Director ES-NGI: Enol Fernandez (IFCA-CSIC, Santander) and Ruben S. Montero (U. Complutense de Madrid) GRID-IRELAND: John Walsh (TCD, Dublin) PL-Grid: Marcin Plociennik (PSNC, Poznan)

EGI-InSPIRE proposal WE NEED TO PROGRESS NOW TOWARDS A DESCRIPTION MPI MPI Tools based on mpi-start Schedulers GRIDWAY Complex Workflows SOMA (Life Sciences env.) TAVERNA (Life Sciences env.) KEPLER-RAS (Fusion env.) WE NEED TO PROGRESS NOW TOWARDS A DESCRIPTION OF THE WORK TO BE DONE TO SUPPORT THE HEAVY USER COMMUNITIES IN SA3

Support to MPI Input from Enol Fernandez (IFCA-CSIC) John Walsh (TCD) + MPI Working Group

Support to MPI: final steps in EGEE JRA1 Missing functionality Only if critical User Input gathered by the MPI WG Bad functioning of a particular feature JRA1 SA3 SA1 MPI Task Force remit Abnormal failure rates at the sites SA1 SAM  Nagios

Support to MPI: final steps in EGEE CLOSING EGEE-III WITH A STABLE NUMBER OF SITES WITH PROPER MPI SUPPORT FROM WHICH TO GROW A WELL DEFINED MPI SUPPORTING INFRASTRUCTURE IN EGI CURRENTLY 94 SITES SUPPORT MPI, OF WHICH 84% ARE SUCCESFULLY PASSING THE SAM/NAGIOS TESTS A KNOWLEDGE-DATABASE FOR SITE SUPPORT IS IN PLACE HTTP://WIKI.IFCA.ES/E-CIENCIA/INDEX.PHP/MPI_ERRORS DEFINE THE SET OF REQUIREMENTS THAT USERS GROUPS FIND NECESSARY FOR MORE ADVANCED MPI FEATURES IN THE EGI ERA A DOCUMENT IS BEING WORKED OUT INSIDE THE MPI WORKING GROUP

Providing MPI Support to EGI mpi-start will be maintained by CSIC inside the EMI project Testing & Certification of midldeware components will be organized from Ibergrid (Spain + Portugal) For MPI components LIP (Portugal), CESGA (Spain) will count on the support from TCD for the certification effort

Recent developments to improve user support in mpi-start BASIC FEATURES OF MPI-START Supports OpenMPI and MPICH Supports file distribution in non-shared filesistems Hooks mechanism in place to ease I/O at pre- and post-run time CURRENT VERSION 0.61 (ALREADY CERTIFIED) Weaknesses in error reporting identified and fixed Improved file distribution mechanism (allows using $HOME and also other more generic i/o spaces) Automatic detection of 32bit or 64bit compiled libraries FUTURE SUPPORT FOR ADVANCED SELECTION OF CORES/NODE Important for a proper MPI process allocation Important for OpenMP support (multithreaded codes)

Summary of actions Rollout of RPMs will take place with the general mechanism foreseen in EGI User Support and Site support will be organized in the EGI Helpdesk We expect to get feedback and requirements from the EGI user communities EGI Requirements will be transmited to the Software Providers EMI will provide: mpi-start (CSIC) and MPI-utils (TCD) Testing & Certification will take place organized by Ibergrid (CESGA, LIP) and TCD

Input from Ruben Santiago Universidad Complutense de Madrid Schedulers: GridWay Input from Ruben Santiago Universidad Complutense de Madrid

A Metascheduler to enable interoperation with GT4 and clouds Different Middleware stacks Different Data/Execution models Global user identities Integration through adapters NGI - Broker Users GridWay Applications Middleware GT4 GT4 gLite gLite gLite gLite SGE Cluster PBS Cluster SGE Cluster PBS Cluster SGE Cluster PBS Cluster

Integration of glite + GT4 in ES-NGI Integrate NGI-GT4 Resources GridWay Broker instance deployed (RedIRIS) Backup/Testing available at UCM Integrate with NGI global services User access to GT4 resources

Complex Workflows: Kepler/RAS Input from Marcin Plociennik PSNC, Poznan (PL-GRID)

Handling Scientific workflows

Kepler/RAS - overview Kepler – workflow orchestration A framework for design, execution and deployment of scientific workflows Support for concurrent modelling, design and execution Precisely defined models of computation and component interaction An intuitive GUI that lets rapid workflow composition A modular, reusable and extendable object-oriented environment An XML based workflow definition – MoML Developed in US (UC Davis, UC Santa Barbara, and UC San Diego) In terms of Euforia project extended with GRID/HPC execution actors (with usage of RAS services) Chosen and used by fusion community (EFDA ITM) RAS – Roaming Access Server (part of Migrating Desktop) Support for different middleware stacks (gLite/UNICORE) Developed in terms of int.eu.grid/BalticGrid II/Euforia Integrated with VineToolkit/gLogin Providing interactive services

Support for Scientific workflows Support for different complex use cases Different level of integration with applications Mixed Grid & HPC workflows Support fot workflows requiring visualisation and interactive access Kepler Engine Different middleware stacks Integration through plugins/adapters i2glogin RAS gLite UI I2G UI Vine Toolkit i2g GRID and HPC infrastructures

Activities planned To maintain the integration of Kepler/RAS with the different underlying middleware stacks To maintain Kepler/RAS services But also to Support next application use cases Supporting different workflow scenarios Customisation according to specific user’s requirements Initial target – Fusion community, however since the framework provides generic services, open to support wider user communities (like coming from ES, A&A, LS or other)