Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA HPC Applications on the Sicilian Grid Infrastructure Marcello Iacono-Manno

Slides:



Advertisements
Similar presentations
Challenges for Interactive Grids a point of view from Int.Eu.Grid project Remote Instrumentation Services in Grid Environment RISGE BoF Manchester 8th.
Advertisements

High Performance Computing Course Notes Grid Computing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Supporting MPI Applications on EGEE Grids Zoltán Farkas MTA SZTAKI.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
High Performance Computing (HPC) at Center for Information Communication and Technology in UTM.
FESR Consorzio COMETA - Progetto PI2S2 Using MPI to run parallel jobs on the Grid Marcello Iacono Manno Consorzio COMETA
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Supporting MPI applications on the EGEE Grid.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Special Jobs Matias Zabaljauregui UNLP.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
EGEE is a project funded by the European Union under contract IST Input from Generic and Testing Roberto Barbera NA4 Generic Applications Coordinator.
BESIII distributed computing and VMDIRAC
FESR Consorzio COMETA Grid Introduction and gLite Overview Corso di formazione sul Calcolo Parallelo ad Alte Prestazioni (edizione.
Computational grids and grids projects DSS,
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Enabling Grids for E-sciencE Workload Management System on gLite middleware Matthieu Reichstadt CNRS/IN2P3 ACGRID School, Hanoi (Vietnam)
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
Group 1 : Grid Computing Laboratory of Information Technology Supervisors: Alexander Ujhinsky Nikolay Kutovskiy.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
E-science grid facility for Europe and Latin America E2GRIS1 Gustavo Miranda Teixeira Ricardo Silva Campos Laboratório de Fisiologia Computacional.
E-science grid facility for Europe and Latin America Using Secure Storage Service inside the EELA-2 Infrastructure Diego Scardaci INFN (Italy)
Cloud Age Time to change the programming paradigm?
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Feb. 06, Introduction to High Performance and Grid Computing Faculty of Sciences,
FESR Consorzio COMETA - Progetto PI2S2 The COMETA consortium and its activities for Grid adoption by Industry in the context of.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 14 February 2006.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid2Win: Porting of gLite middleware to.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
FESR Consorzio COMETA - Progetto PI2S2 GRID Success Stories in Sicily (The TriGrid VL and PI2S2 Projects) Roberto Barbera University.
FESR Consorzio COMETA - Progetto PI2S2 Porting a program to run on the Grid Marcello Iacono Manno Consorzio COMETA
LSF Universus By Robert Stober Systems Engineer Platform Computing, Inc.
INFSO-RI Enabling Grids for E-sciencE GILDA Praticals Giuseppe La Rocca INFN – Catania gLite Tutorial at the EGEE User Forum CERN.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA Special Jobs Valeria Ardizzone INFN - Catania.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Spanish National Research Council- CSIC Isabel.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
Performance Evaluation of Parallel Algorithms on a Computational Grid Environment Simona Blandino 1, Salvatore Cavalieri 2 1 Consorzio COMETA, 2 Faculty.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
FESR Consorzio COMETA Giuseppe Andronico INFN Sez. CT & Consorzio COMETA Workshop Grids vs. Clouds Beijing, Consorzio.
FESR Consorzio COMETA - Progetto PI2S2 Porting MHD codes on the GRID infrastructure of COMETA Germano Sacco & Salvatore Orlando.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
II EGEE conference Den Haag November, ROC-CIC status in Italy
Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA Grid2Win : gLite for Microsoft Windows Elisa Ingrà - INFN.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
FESR Trinacria Grid Virtual Laboratory The Trinacria Grid Virtual Laboratory Roberto Barbera University of Catania and INFN Grid Open Day.
FESR Consorzio COMETA - Progetto PI2S2 Using MPI to run parallel jobs on the Grid Marcello Iacono Manno Consorzio Cometa
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Accessing the VI-SEEM infrastructure
Using MPI to run parallel jobs on the Grid
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
gLite MPI Job Amina KHEDIMI CERIST
Special jobs with the gLite WMS
The gLite Workload Management System
Grid Computing.
CompChem VO: User experience using MPI
Special Jobs: MPI Alessandro Costa INAF Catania
University of Technology
Using MPI to run parallel jobs on the Grid
How To Integrate an Application on Grid
Presentation transcript:

Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA HPC Applications on the Sicilian Grid Infrastructure Marcello Iacono-Manno Consorzio COMETA OGF25 / 4th EGEE User Forum Catania, March 5 th, 2009

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 2 Overview Sicilian Infrastructure Grid & HPC Middleware Scheduling Policies License Server Use Cases –FLUENT, FLASH, OpenFOAM, etc...

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th Objectives of an e-Infrastructure in Sicily Create a Virtual Laboratory in Sicily, both for scientific and industrial applications, built on top of a Grid infrastructure and with a special support for HPC Connect the Sicilian e-Infrastructure to those already existing in Italy, in Europe, and in the rest of the world improving the scientific collaboration and increasing the “competitiveness” of e-Science and e-Industry “made in Sicily” Disseminate the “grid paradigm” through the organization of dedicated events and training courses Trigger/foster the creation of spin-offs in the ICT area in order to reduce the “brain drain” of brilliant young people to other parts of Italy and beyond

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 4 The Sicilian E-Infrastructure ~ € in 3 years ! ~350 FTEs! (2/3 new employees)‏ More than 2000 CPUs More than 300 TBytes

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 5 Why HPC on Grid Provide a Grid supporting HPC is important to involve new communities –Many HPC users will be happy to move to Grid and reduce the costs to achieve comparable performances –This is the main way towards sustainability PI2S2 has pushed a big effort to enable its infrastructure for HPC applications –Choosing infrastructure components satisfying the requirements of many HPC applications  es. server with multicore CPU, InfiniBand for node interconnection –Developing new solutions to integrate in the Grid middleware so to better manage HPC applications –Developing a license server to run on Grid so to allow the execution of commercial parallel applications  many users required commercial application such as FLUENT and others

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 6 Grid vs. HPC GRID –Heterogeneous  Resources  Jobs  Users –Common network infrastructure (generally GigaBit Ethernet)‏ –Focus on the overall infrastructure performance  Number of jobs –Allows resources sharing  Reduce the ownership of the infrastructure  Stability is more difficult to be attained  High overall computer power and storage capacity available at a reasonable cost HPC –Clusters dedicated to a single user (or community)‏  One user at a time –Components optimised for one or a few applications –Ad-hoc network technologies (InfiniBand) –Cutting edge components to get the best performance on each single job –Too expensive for many small companies and/or research groups –Limited availability of calculus power and/or storage capacity

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 7 GigaBit vs InfiniBand The advantage of using a low – latency network becomes more evident the greater the number of nodes #cycles

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 8 MPI & Grid An MPI program is an application including calls to MPI library functions that synchronise the execution flow among the cooperating nodes Official gLite supports only two MPI implementations: –MPICH –MPICH2 Several patches enable MPI on “old” GigaBit Ethernet and “new” low-latency InfiniBand nets –In COMETA infrastructure MPI jobs run on either GigaBit (MPICH, MPICH2) or InfiniBand (MVAPICH, MVAPICH2) Currently, MPI parallel jobs can run only inside a single Computing Elements (CE)‏ –The possibility of executing parallel jobs spread on different CEs is under investigation –We are thinking about wiring the whole Catania Campus with IB

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 9 Submission of an MPI application MPI jobs are specified by setting the JDL JobType attribute to MPICH The implementation variant is specified with the attribute MPIType – _, e.g. MVAPICH2_PGI706 The NodeNumber defines the number of required cores Es. Matchmaking: the Resource Broker (RB) chooses a CE (if any!) with enough free Processing Elements (PE = CPU cores)‏ e.g.: free PE# ≥ NodeNumber Type = " Job "; JobType = " MPICH "; MPIType = " MVAPICH_gcc4 "; NodeNumber = 12 ; Executable = " mergesort-ib1-gcc4 ";

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 10 Matchmaking Replaced with the real value When these attributes are included in a JDL script the following requirements are automatically added by the WMS: (other.GlueCEInfoTotalCPUs >= NodeNumber) && Member (,other.GlueHostApplicationSoftwareRunTimeEnvironment) ‏ These requirements allow the WMS to find out the best resource where the job can be executed

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 11 Additional elements Executable specifies the MPI executable Arguments specifies the WN command line –Executable + Arguments form the command line on the WN mpi.pre.sh is a special script file that is sourced before launching MPI executable mpi.post.sh is a special script file that is sourced after MPI executable termination –warning: they run only on the master node The mpirun command is issued by the middleware (… what if a proprietary script/bin?)‏ MPIGranularity is a JDL attribute specifying the CPU core distribution‏ –Foster the integration of shared memory applications into Grid

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 12 Execution The WMS Job wrapper copies all the files indicated in the InputSandbox on the master and ALL of the “slave” nodes host based ssh authentication MUST BE well configured between all the WNs If additional environment variables are needed ONLY on the “master” node, they can be set by the mpi.pre.sh –If required ON ALL THE NODES a static installation is the only method (middleware extension is under consideration)‏ The WMS start the execution on the master

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 13 mpi.jdl [ Type = "Job"; JobType = "MPICH"; MPIType = "MPICH_GCC4"; Executable = “MPIparallel_exec”; NodeNumber = 2; Arguments = “arg1 arg2 arg3"; StdOutput = "test.out"; StdError = "test.err"; InputSandbox = {“mpi.pre.sh”,“mpi.post.sh”, “MPIparallel_exec”}; OutputSandbox = {“test.err”, “test.out”, “executable.out”}; Requirements = other.GlueCEInfoLRMSType == "PBS" || other.GlueCEInfoLRMSType == "LSF"; ] Local Resource Manager (LRMS) = PBS/LSF only Local Resource Manager (LRMS) = PBS/LSF only Pre e Post Processing Scripts Executable

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 14 Scheduling Policy Heterogeneous jobs running on the same infrastructure make Grids different from dedicated clusters –Resources should be effectively shared among users having diverse constraint PI2S2 sites have queues for different jobs duration –increasing priority for shorter jobs –their policy is very complex due to the variety of requirements Several kinds of jobs are identified –emergency jobs need absolute priority so they perform pre- emption interrupting the execution of short jobs  emergency jobs are not very long, so pre-emption is acceptable HPC job scheduling policy is based on resource reservation so they collect cores up to needed amount –This policy is reasonably effective as the number of HPC jobs is far lower compared to the number of short jobs

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 15 License Server Many HPC programs require a software license A unique license must be shared among the users on different and geographically distant sites –Currently licenses are generally connected to a user name, or a physical net address  Both these solution do not fit the distributed environment of Grid infrastructures A float license delivered to remote sites is required Several tools allow the management of floating license –FlexLM is the most used free software license server –The applications must know the address of the server and they ask at run-time for an execution license Current license servers do not identify Grid users so it is not easy to define usage policy for commercial software

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 16 GridLM: Grid License Manager GridLM has been developed by COMETA and INFN Catania to avoid problems with commercial software Execution permission is granted only for users belong to an authorised group inside a VO –Users have to specify their group during the proxy creation  es. voms-proxy-init --voms cometa:/cometa/example_sw –Additionally, the software has to be specified in the JDL The communication among sites use the Grid security mechanism –Every message is encrypted with the proxy certificates A further development aims to associate the license with delegated user proxy linked to a job (robot certificate)

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 17 Use Case: FLUENT FLUENT is a commercial software for Computer Fluid Dynamics (CFD) –mainly used for for flow modelling, and heat and mass transfer simulations Current distribution include the support for many MPI cluster configuration including the MPICH on InfiniBand No special effort required to execute FLUENT on Grid –The MPI wrapper was by-passed since FLUENT use its own wrapper –The pre-processing script is responsible to run the application –The other JDL parameters are used to find the execution resources

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 18 Use Case: FLASH FLASH is a 3D astrophysical hydrodynamic code for supercomputers –used in current astrophysical research Compiled with PGI compiler and optimised for COMETA infrastructure Used in production with job using up to 128 cores and running for several days Some tests on performance enhancement after the optimisation have been executed Performance is comparable to other platforms

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 19 Speed-Up

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 20 Use Case: OpenFOAM OpenFOAM is a free simulation environment –commonly used as an equation solver in solving CFD problems Very complex interaction with gLite m/w –a special set-up is required in any execution node but Grid mechanisms allow access only to the master –the MPICH support requires to recompile the code with a supported compiler  this is not a real limitation but need an extra effort to test the application –writing a solver requires to recompile the entire software so a mechanism to support new developer has to be investigated

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 21 MPI on the web... GENERAL MPI WEB PAGES PI2S2 RELATED MPI PAGES ometahttps://grid.ct.infn.it/twiki/bin/view/PI2S2/WikiConsorzioC ometa

OGF25 / 4 th EGEE User Forum, Catania, 2009, March 5th 22 Thank you for your kind attention ! Any questions ?