Introduction to Grid Technology

Slides:



Advertisements
Similar presentations
Workload Management David Colling Imperial College London.
Advertisements

EU 2nd Year Review – Jan – Title – n° 1 WP1 Speaker name (Speaker function and WP ) Presentation address e.g.
Workload management Owen Maroney, Imperial College London (with a little help from David Colling)
INFSO-RI Enabling Grids for E-sciencE Workload Management System and Job Description Language.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Job Submission The European DataGrid Project Team
A Computation Management Agent for Multi-Institutional Grids
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
SEE-GRID-SCI Hands-On Session: Workload Management System (WMS) Installation and Configuration Dusan Vudragovic Institute of Physics.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
The DataGrid Project NIKHEF, Wetenschappelijke Jaarvergadering, 19 December 2002
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Computational grids and grids projects DSS,
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – M. Sgaravatto – n° 1 The EU DataGrid Workload Management System: towards.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Enabling Grids for E-sciencE Workload Management System on gLite middleware Matthieu Reichstadt CNRS/IN2P3 ACGRID School, Hanoi (Vietnam)
M. Sgaravatto – n° 1 The EDG Workload Management System: release 2 Massimo Sgaravatto INFN Padova - DataGrid WP1
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An Introduction to the EGEE Project Presented by Min Tsai ISGC 2007, Taipei With thanks.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Feb. 06, Introduction to High Performance and Grid Computing Faculty of Sciences,
TERENA 2003, May 21, Zagreb TERENA Networking Conference, 2003 MOBILE WORK ENVIRONMENT FOR GRID USERS. TESTBED Miroslaw Kupczyk Rafal.
M. Sgaravatto – n° 1 Overview of release 2 of the EDG WP1 Workload Management System deployed in the INFN production Grid Massimo Sgaravatto INFN Padova.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
High-Performance Computing Lab Overview: Job Submission in EDG & Globus November 2002 Wei Xing.
EGEE is a project funded by the European Union under contract IST WS-Based Advance Reservation and Co-allocation Architecture Proposal T.Ferrari,
Workload Management System Jason Shih WLCG T2 Asia Workshop Dec 2, 2006: TIFR.
Summary from WP 1 Parallel Section Massimo Sgaravatto INFN Padova.
The DataGrid Project NIKHEF, Wetenschappelijke Jaarvergadering, 19 December 2002
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Introduction to Computing Element HsiKai Wang Academia Sinica Grid Computing Center, Taiwan.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
EU 2nd Year Review – Feb – WP1 Demo – n° 1 WP1 demo Grid “logical” checkpointing Fabrizio Pacini (Datamat SpA, WP1 )
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Gri2Win: Porting gLite to run under Windows XP Platform
Grid2Win Porting of gLite middleware to Windows XP platform
Overview of the Training Agenda
The EDG Testbed Deployment Details
Workload Management System on gLite middleware
Design rationale and status of the org.glite.overlay component
Workload Management System ( WMS )
gLite Grid Services Salma Saber
Grid Operations Procedures
EGEE tutorial, Job Description Language - more control over your Job Assaf Gottlieb Tel-Aviv University EGEE is a project.
Job Submission in the DataGrid Workload Management System
Grid2Win: Porting of gLite middleware to Windows XP platform
Workload Management System
Overview of IPB responsibilities in EGEE-III SA1
Grid Services Ouafa Bentaleb CERIST, Algeria
Antun Balaz SCL, Institute of Physics Belgrade Serbia
Short update on the latest gLite status
5. Job Submission Grid Computing.
Gri2Win: Porting gLite to run under Windows XP Platform
The gLite Workload Management System
Overview of the EGEE project and the gLite middleware
EGEE Middleware: gLite Information Systems (IS)
Wide Area Workload Management Work Package DATAGRID project
gLite Job Management Christos Theodosiou
gLite The EGEE Middleware Distribution
Job Submission M. Jouvin (LAL-Orsay)
Presentation transcript:

Introduction to Grid Technology Antun Balaz SCL, Institute of Physics Belgrade Serbia antun@ipb.ac.rs 25/03/2011

Agenda NGI_AEGIS, EGI and EGI-InSPIRE AEGIS infrastructure and management gLite overview and basic services Managing jobs with gLite

NGI_AEGIS and EGI EGI.eu created in February 2010 Established as an international consortium based in Amsterdam Serbia represented in the EGI Council and other bodies by IPB Coordinates EGI-InSPIRE project, May 2010 – April 2014 IPB represents Serbia as a partner

EGI-InSPIRE FP7 RI-261323 project, ESFRI WP1: Management (NA1) WP2: External relations (NA2) WP3: User community coordination (NA3) WP4: Operations (SA1) WP5: Provisioning the software infrastructure (SA2) WP6: Services for HUC (SA3) WP7: Operational tools (JRA1)

IPB and EGI-InSPIRE IPB is involved in NA2, NA3, SA1 Operations: AEGIS operations Coordination of middleware deployment OMB OTAG

AEGIS infrastructure (1) Production: AEGIS01-IPB-SCL (704 CPUs, 26 TB) AEGIS02-RCUB (48 CPUs, 113 GB) AEGIS03-ELEF-LEDA (64 CPUs, 1.5 TB) AEGIS04-KG (48 CPUs, 480 GB) AEGIS07-IPB-ATLAS (128 CPUs) AEGIS11-MISANU (64 CPUs)

AEGIS infrastructure (2) Certification: AEGIS05-ETFBG AEGIS09-FTN-KM Demo/training: AEGIS08-IPB-DEMO New: UOB Faculty of Physics

AEGIS management NGI_AEGIS management (A. Balaz, D. Vudragovic, V. Slavnic) Helpdesk: helpdesk.aegis.rs Nagios: nagios.aegis.rs Mailing lists

gLite – Grid middleware The Grid relies on advanced software – the middleware - which interfaces between resources and the applications The GRID middleware Finds convenient places for apps to run Optimises use of resources Organises efficient access to data Deals with authentication at different sites Runs the job & monitors progress Transfers the result back to the scientist

gLite – Overview First release 2005 currently gLite 3.13.2 Developed from existing components (globus, condor,..) Interoperability & Co-existence with deployed infrastructure Robust: Performance & Fault tolerance Open Source license

Set of basic Grid services Job submission/management File transfer (individual, queued database access) Data management (replication, metadata) Monitoring/Indexing system information Advanced School in High Performance and GRID Computing – Concepts and Applications, ICTP, Trieste, Italy

Basic services of gLite User Interface Information System Workload Management System Submit job query Retrieve status & output create credential query publish state Submit job File and Replica Catalog Retrieve output Job status Logging Computing Element Storage Element Site X Job status Authorization Service (VOMS) process Logging and bookkeeping

User interface Local Workstation User describes job in text file using Job Description Language Submits job to WMS using (usually) the command-line interface ssh UI UI (user interface) has preinstalled client software WMS Workload Management System CEs

Managing jobs with gLite User Interface Submit Input “sandbox” Information System stderr.txt User interface stdout.txt Get output Output “sandbox” Job status update Job Submit Event Status / log query stderr.txt stdout.txt publish state Input “sandbox” Output “sandbox” Slide inherited from EDG – European Data Grid Job status update A worker node is allocated by the local jobmanager Logging & bookkeeping STD input stream is read from file STD out and err. streams are redirected into files stderr.txt /bin/hostname stdout.txt Computing Element

Characteristics of resources Location of files LFC Network Daemon User Interface Characteristics of resources Workload Manager Inform. Service Job Contr. - CondorG CE characts & status WMS SE characts & status Computing Element Storage Element

glite-wms-job-submit myjob.jdl Daemon responsible for accepting incoming requests waiting submitted LFC Network Daemon User Interface JDL Input Sandbox files Workload Manager Inform. Service RB storage glite-wms-job-submit myjob.jdl Job Contr. - CondorG CE characts & status WMS SE characts & status Computing Element Storage Element

WM: responsible to take the appropriate actions to satisfy the request waiting submitted LFC Network Daemon User Interface Job Workload Manager Inform. Service RB storage WM: responsible to take the appropriate actions to satisfy the request Job Contr. - CondorG CE characts & status WMS SE characts & status Computing Element Storage Element

RB WMS waiting submitted LFC Network Daemon User Interface Match- Maker/ Broker Workload Manager Inform. Service RB storage Where this job can be executed ? Job Contr. - CondorG CE characts & status WMS SE characts & status Computing Element Storage Element

Matchmaker: responsible to find the “best” CE where to submit a job waiting submitted LFC Network Daemon User Interface Matchmaker: responsible to find the “best” CE where to submit a job Match- Maker/ Broker Workload Manager Inform. Service RB storage Job Contr. - CondorG CE characts & status WMS SE characts & status Computing Element Storage Element

RB WMS waiting submitted LFC Network Daemon User Interface Match- Where is the needed InputData ? waiting submitted LFC Network Daemon User Interface Match- Maker/ Broker Workload Manager Inform. Service RB storage What is the status of the Grid ? Job Contr. - CondorG CE characts & status WMS SE characts & status Computing Element Storage Element

RB WMS waiting submitted LFC Network Daemon User Interface Match- Maker/ Broker Workload Manager Inform. Service RB storage CE choice Job Contr. - CondorG CE characts & status WMS SE characts & status Computing Element Storage Element

JA: responsible for the final “touches” waiting submitted LFC Network Daemon User Interface Workload Manager Inform. Service RB storage Job Adapter Job Contr. - CondorG CE characts & status JA: responsible for the final “touches” to the job before performing submission (e.g. creation of wrapper script, etc.) WMS SE characts & status Computing Element Storage Element

JC: responsible for the actual job management submitted waiting ready LFC Network Daemon User Interface Workload Manager Inform. Service RB storage Job Job Contr. - CondorG JC: responsible for the actual job management operations (done via CondorG) CE characts & status WMS SE characts & status Computing Element Storage Element

RB WMS submitted waiting ready scheduled LFC Network Daemon User Interface Workload Manager Inform. Service RB storage Job Contr. - CondorG Input Sandbox files CE characts & status WMS SE characts & status Job Computing Element Storage Element

RB WMS submitted waiting ready scheduled running LFC Network Daemon User Interface Workload Manager Inform. Service RB storage Job Contr. - CondorG Input Sandbox WMS “Grid enabled” data transfers/ accesses Computing Element Storage Element Job

RB WMS submitted waiting ready scheduled running done LFC Network Daemon User Interface Workload Manager Inform. Service RB storage Job Contr. - CondorG Output Sandbox files WMS Computing Element Storage Element

glite-wms-get-output <jobID> submitted waiting ready scheduled running done LFC Network Daemon User Interface Workload Manager Inform. Service RB storage glite-wms-get-output <jobID> Job Contr. - CondorG Output Sandbox WMS Computing Element Storage Element

RB WMS submitted LFC Network Daemon User Interface waiting ready Output Sandbox files Workload Manager Inform. Service RB storage scheduled Job Contr. - CondorG running done WMS cleared Computing Element Storage Element

Job monitoring glite-wms-job-status <jobID> glite-wms-job-logging-info <jobID> User Interface Network Daemon LB: receives and stores job events; processes corresponding job status LB proxy Workload Manager Job status Logging & Bookkeeping Job Contr. - CondorG WMS Log of job events Computing Element

Enjoy further details in presentations and hands-on sessions during the day!