OGF 25/EGEE User Forum Catania, March 2 nd 2009 Meta Scheduling and Advanced Application Support on the Spanish NGI Enol Fernández del Castillo (IFCA-CSIC)

Slides:



Advertisements
Similar presentations
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Advertisements

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User.
Challenges for Interactive Grids a point of view from Int.Eu.Grid project Remote Instrumentation Services in Grid Environment RISGE BoF Manchester 8th.
Distributed Systems Architecture Research Group Universidad Complutense de Madrid EGEE UF4/OGF25 Catania, Italy March 2 nd, 2009 State and Future Plans.
Marco Verlato, INFN 23 March, 2011 ISGC2011/OGF31, Taipei,Taiwan Interoperability solutions in India 1.
Generic MPI Job Submission by the P-GRADE Grid Portal Zoltán Farkas MTA SZTAKI.
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
Practical Mechanisms for Managing Parallel and Interactive Jobs on Grid Environments Enol Fernández UAB.
EGC 2005, CrossGrid technical achievements, Amsterdam, Feb. 16th, 2005 WP2-3 New Generation Environment for Grid Interactive MPI Applications M igrating.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
Int.eu.grid: A grid infrastructure for interactive applications Gonçalo Borges LIP on behalf of Int.EU.Grid Collaboration INGRID’08, Italy, April 2008.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Supporting MPI Applications on EGEE Grids Zoltán Farkas MTA SZTAKI.
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
Task 3.5 Tests and Integration ( Wp3 kick-off meeting, Poznan, 29 th -30 th January 2002 Santiago González de la.
AustrianGrid, LCG & more Reinhard Bischof HPC-Seminar April 8 th 2005.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Supporting MPI applications on the EGEE Grid.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI WS-PGRADE/gUSE Supporting e-Science communities in Europe Zoltan Farkas.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
INFSO-RI Enabling Grids for E-sciencE The US Federation Miron Livny Computer Sciences Department University of Wisconsin – Madison.
The EDGI project receives Community research funding 1 EDGI Brings Desktop Grids To Distributed Computing Interoperability Etienne URBAH
Computational grids and grids projects DSS,
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Lessons learnt from the EGEE Application Porting Support activity Gergely Sipos Coordinator.
WRF4G The Weather Research Forecasting model workflow for the GRID Department of Applied Mathematics & Computer Sciences University of.
L ABORATÓRIO DE INSTRUMENTAÇÃO EM FÍSICA EXPERIMENTAL DE PARTÍCULAS Enabling Grids for E-sciencE Grid Computing: Running your Jobs around the World.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks S. Natarajan (CSU) C. Martín (UCM) J.L.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, An Overview of the GridWay Metascheduler.
Migrating Desktop Marcin Płóciennik Marcin Płóciennik Kick-off Meeting, Santander, Graphical.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
INFSO-RI Enabling Grids for E-sciencE OSG-LCG Interoperability Activity Author: Laurence Field (CERN)
Interactive Workflows Branislav Šimo, Ondrej Habala, Ladislav Hluchý Institute of Informatics, Slovak Academy of Sciences.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
Int.eu.grid: Experiences with Condor to Run Interactive and Parallel Applications on the Grid Elisa Heymann Department of Computer Architecture and Operating.
A step towards interoperability (between Int.EU.Grid and EGEE Grid infrastructures) Gonçalo Borges, Jorge Gomes LIP on behalf of Int.EU.Grid Collaboration.
Grid Activities in Portugal Gonçalo Borges Jornadas LIP 2010 Braga, Janeiro 2010.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
Dr. Isabel Campos Plasencia (IFCA-CSIC) Spanish NGI Coordinator ES-GRID The Spanish National Grid Initiative.
Support to MPI and interactivity on gLite infrastructures EGEE’07 Budapest, 4th Oct 2007.
BalticGrid-II Project EGEE UF’09 Conference, , Catania Partner’s logo Framework for Grid Applications Migrating Desktop Framework for Grid.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Spanish National Research Council- CSIC Isabel.
Migrating Desktop Uniform Access to the Grid Marcin Płóciennik Poznan Supercomputing and Networking Center Poznan, Poland EGEE’07, Budapest, Oct.
Migrating Desktop Uniform Access to the Grid Marcin Płóciennik Poznan Supercomputing and Networking Center Poland EGEE’08 Conference, Istanbul, 24 Sep.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
Activities of the Spanish National Grid Initiative.
IBERGRID as RC Total Capacity: > 10k-20K cores, > 3 Petabytes Evolving to cloud (conditioned by WLCG in some cases) Capacity may substantially increase.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
User requirements for interactive controlling and monitoring of applications in grid environments Dr. Isabel Campos Plasencia Institute of Physics of Cantabria.
Grid Computing: Running your Jobs around the World
Tamas Kiss University Of Westminster
Support to MPI, Schedulers and Complex Workflows
I2G CrossBroker Enol Fernández UAB
Interoperability & Standards
The GENIUS portal and the GILDA t-Infrastructure
Installation/Configuration
Introduction to the SHIWA Simulation Platform EGI User Forum,
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

OGF 25/EGEE User Forum Catania, March 2 nd 2009 Meta Scheduling and Advanced Application Support on the Spanish NGI Enol Fernández del Castillo (IFCA-CSIC)

Outline 1.Spanish NGI 2.Metaschedulers – Grid-Way – CrossBroker 3.Advanced Application Support – Interactivity – MPI 4.Summary 2

National Level entity which operates a general purpose e-science infrastructure Objectives: – Establish a collaboration framework between all participating institutions to foster a coordinated development of a Grid infrastructure in Spain – Propose a sustainable design of the Grid infrastructure that covers the ecosystem of different Grid projects, computing centers, grid infrastructures, etc… – Run central services to keep up the infrastructure Spanish NGI: NGI-ES

Core: Spanish institutions participating in Grid research and development projects: – EGEE, EUFORIA, DORII, EELA, i2g, … with a common middleware based on gLite – The new infrastructure GRID – CSIC – Universities, computing centers with Globus Toolkit 4 middleware – RedIris (Spanish NReN) support EUGRIDPMA certificates. Relation with the Spanish Supercomputing Network (RES) – This network comprises several Spanish research centers that operate a common infrastructure of supercomputing. – Analyze possibility of mixed workflows between both infrastructures Relation with the Portuguese NGI in the framework of Ibergrid Components of NGI-ES

Interoperability is needed for the sustainability of the NGI – Key issue for the creation of the EGI infrastructure – Allows users to select the resources that better fit their necessities – Potential access to a significantly larger set of resources Reduced management overheads if only a single Grid middleware system needs deployment on each site NGI-ES Interoperability

6 VO Oriented The architecture of NGI-ES is oriented towards the support of Virtual Organizations Key Issues Advanced VO Services User Support Monitoring & Accounting Application porting and support Middleware driven by applications requirements RESOURCE CENTERS NGI-ES Architecture Virtual Organizations Central Services

NGI-ES Architecture gLiite WMS User Interface CrossBroker VO Applications Users GT4 RC gLite RC VOMS SE SRM GridFTP LCG CE GRAM MDS Cluster Manager WN GridWay BDII Information Repositories Monitoring & Accounting

Resource Centers Resource Centers Map Initial Call:  18 sites  1300 to 4500 cores depending on local occupancy  ~350TB of Online Storage Initial Call:  18 sites  1300 to 4500 cores depending on local occupancy  ~350TB of Online Storage

Monitorization and accounting services at Cesga – Accounting Portal: – Monitorization Portal: Global Information Repositories: – OpenLDAP + GlueSchema server at IFCA, integrated with NGI-PT – Global MDS server at RedIRIS for GT4 resources Metaschedulers: – GridWay for the NGI at RedIRIS – Crossbroker for gLite resources at IFCA – gLite WMS for bulk submission of serial jobs at IFIC VOMS server at IFCA HelpDesk at RedIRIS Deployed Services

Interoperability: – Resources from EU projects based on gLite (EGEE, I2G, EELA, WLCG, DORII) – Globus Toolkit 4 resources Metaschedulers developed by Spanish groups – GridWay – CrossBroker Advanced Application Support – Interactivity – MPI NGI-ES Added Value

GridWay allows the efficient use of computing resources of a Grid – Included in the Globus distribution – Support for both gLite and GT4 resources – Standards compliant: JSDL, DRMAA, … Used by several grid projects and initiatives worldwide Developed by UCM More information at: Metaschedulers: GridWay

CrossBroker provides support for Parallel and Interactive jobs – High priority treatment for interactive jobs with the use of multiprogramming – Interoperable with EGEE, provides same services than gLite WMS Used in production environments – Used in EU CrossGrid, int.eu.grid and Euforia projects (12K – 55K jobs per month) Developed by UAB + CSIC More info: Metaschedulers: CrossBroker

CrossBroker User Interface Information Index Replica Location Service Computing Element Scheduling Agent Resource Searcher Application Launcher Job Starter Interactive Agent

Interactivity allows researchers to visualize results and obtain them faster Requirements: – Fast startup: the possibility of starting the application immediately, even in high occupancy scenarios – Online Input-Output streaming: the ability to have application input and output online. Interactivity Support

The idea – Each job is encapsulated in an agent that takes control over the WN independently of its LRMS Lightweight “Virtual Machines” – Each Worker Node is divided in 2 execution slots – Each VM can execute jobs independently (e.g. batch and interactive) – NOT a full virtual machine (Xen, VMWare,…) – NO need for special priviledges in the WN Interactivity Support: Multiprogramming

CrossBroker WN LRMS Computing Element Job Grid Resource

Interactivity Support: Multiprogramming CrossBroker Grid Resource WN LRMS Computing Element Slot 1Slot 2 Job Agent Job < 40s > 200s

CrossBroker User Interface Interactivity Support: Interactive Agents glogin Connection parameters: IP address Port glogin User Application Worker Nde Slot 1 Slot 2 glogin User App. Batch

Many application areas need MPI support – Earth Sciences, Biological sciences, Computational Chemistry, Nuclear Fusion, – Representative results can be obtained by using order of 10s-100s of CPUs Many clusters are MPI – ready – In local mode by direct submission – Shared filesystems with high performance intranet It is interesting to offer this capability when the user is working inside a Grid infrastructure – As an infrastructure on its own – As a testbed for small runs before executing on large HPC MPI Support

There is no standard way of starting an MPI app – No common syntax for mpirun, mpiexec support optional – Schedulers (PBS, SGE, Condor…) handle machinefiles in different ways – Metascheduler services need to handle different implementations (OpenMPI, MPICH, LAM…) in a flexible and extensible way – Shared vs. Non-Shared filesystems MPI Support: Issues

Specifies a unique interface to the upper layer in the middleware to describe MPI jobs Support basic file distributions Implemented as portable shell scripts Extensible via user hooks and plugins at the site level MPI Support: MPI-Start CORE MPISchedulerHooks Open MPIMPICH2MPICHPACXLAMPBSSGELSF

MPI-Start is used by the CrossBroker to support: – Intra-cluster apps with OpenMPI or MPICH – Inter-cluster apps using PACX-MPI or MPICH-G2 User does not need to specify low level details of jobs: – MPI-Start copies all the input files to the WN (if not shared FS) – MPI-Start invokes the mpi program using the best configuration for the site. MPI Support: MPI-Start

NGI Grid Infrastructure is well active! – Deployment of NGI infrastructure going on – Integrates EGEE like resources (i2g, EUFORIA, DORII, EELA, …) and GT 4 resources Added values of NGI-ES – Metaschedulers middleware – Support to MPI and Interactive jobs Interest in European Grid Infrastructure More info: Summary