The NorduGrid project: Using Globus toolkit for building Grid infrastructure presented by Aleksandr Konstantinov Mattias Ellert Aleksandr Konstantinov.

Slides:



Advertisements
Similar presentations
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
Advertisements

Data Management Expert Panel - WP2. WP2 Overview.
Interaction model of grid services in mobile grid environment Ladislav Pesicka University of West Bohemia.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
NorduGrid Grid Manager developed at NorduGrid project.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
A conceptual model of grid resources and services Authors: Sergio Andreozzi Massimo Sgaravatto Cristina Vistoli Presenter: Sergio Andreozzi INFN-CNAF Bologna.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Swedish participation in DataGrid and NorduGrid Paula Eerola SWEGRID meeting,
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Grids and Globus at BNL Presented by John Scott Leita.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
BaBar WEB job submission with Globus authentication and AFS access T. Adye, R. Barlow, A. Forti, A. McNab, S. Salih, D. H. Smith on behalf of the BaBar.
Overview of the NorduGrid Information System Balázs Kónya 3 rd NorduGrid Workshop 23 May, 2002, Helsinki.
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
Introduction to NorduGrid ARC / Arto Teräs Slide 1(16) Introduction to NorduGrid ARC Arto Teräs Free and Open Source Software Developers' Meeting.
Grid Computing - AAU 14/ Grid Computing Josva Kleist Danish Center for Grid Computing
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
NorduGrid Architecture and tools CHEP2003 – UCSD Anders Wäänänen
Computational grids and grids projects DSS,
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
File and Object Replication in Data Grids Chin-Yi Tsai.
National Computational Science National Center for Supercomputing Applications National Computational Science NCSA-IPG Collaboration Projects Overview.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Quick Introduction to NorduGrid Oxana Smirnova 4 th Nordic LHC Workshop November 23, 2001, Stockholm.
Production Tools in ATLAS RWL Jones GridPP EB 24 th June 2003.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
Using the EMI testbed ARC middleware Marek Kočan University of P. J. Šafárik, Košice.
The NorduGrid Information System Balázs Kónya GGF July, 2002, Edinburgh.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
Globus Toolkit Massimo Sgaravatto INFN Padova. Massimo Sgaravatto Introduction Grid Services: LHC regional centres need distributed computing Analyze.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
ATLAS Data Challenge on NorduGrid CHEP2003 – UCSD Anders Wäänänen
Introduction to Grid Computing and its components.
The NorduGrid toolkit user interface Mattias Ellert Presented at the 3 rd NorduGrid workshop, Helsinki,
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
Application examples Oxana Smirnova (Lund, EPF) 3 rd NorduGrid Workshop, May23, 2002.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
The EDG Testbed Deployment Details
Oxana Smirnova, Jakob Nielsen (Lund University/CERN)
Peter Kacsuk – Sipos Gergely MTA SZTAKI
Oxana Smirnova (Lund, EPF) 3rd NorduGrid Workshop, May23, 2002
Presentation transcript:

The NorduGrid project: Using Globus toolkit for building Grid infrastructure presented by Aleksandr Konstantinov Mattias Ellert Aleksandr Konstantinov Balázs Kónya Oxana Smirnova Anders Wäänänen

2 ACAT'2002, Moscow Introduction Launched in spring 2001, with the aim of creating a Grid infrastructure in the Nordic countries. Partners from Denmark, Norway, Sweden, and Finland. Powered mainly by ATLAS groups (Lund, Copenhagen, Stockholm, Uppsala, Oslo). Relatively short term project - ends in October Relies on very limited human resources (3 full-time researchers, few part-time ones) with funding from NorduNet2. More info

3 ACAT'2002, Moscow Introduction (cont.) The purpose of the project is to create and operate functional testbed. Use approved tools => Globus Toolkit TM (developed at Argonne National Laboratory and University of Southern California) and tools developed at European Data Grid project. Aim at High Energy Physics applications - take into account while choosing what to implement first. No temporary solutions (it is better not to implement something, than to be forced to provide backward compatibility for limited solution).

4 ACAT'2002, Moscow Globus Toolkit TM evaluation Widely accepted de-facto standard for Grid computing. Provides collection of (mostly) robust protocols, libraries and low-level services. Security built-in. Continuously evolving (??). Missing few important high-level services: grid-level scheduler job data stagein/stageout user-friendly grid entry points (simple user-interface, web portals, etc.) grid-level authorization system grid-level accounting and quotas

5 ACAT'2002, Moscow No single point of failure No central sandbox (unlike EDG) Lightweight brokering integrated into User Interface Job should not be Computing Element (cluster) specific Non grid-aware jobs allowed ("grid functionality" is provided by middleware on Computing Element) Job runs in as restrictive environment as possible (do not expect network on computing nodes) Minimal environment is provided on Computing Element Adequate and full (enough) information provided by InfoSystem Natural computing unit is cluster Queue, job and user information NorduGrid requirements

6 ACAT'2002, Moscow NorduGrid architecture

7 ACAT'2002, Moscow Information System NorduGrid operates an MDS based, hierarchically distributed Information System: new information model for clusters, queues, jobs, users, SE, RC efficient providers all the job monitoring, resource discovery, status monitoring and brokering are exclusively built on top of the MDS MDS hierarchy with dynamic site registrations

8 ACAT'2002, Moscow Information System(example) cluster entry queue entry job entry user entry

9 ACAT'2002, Moscow Information System (hierarchy)

10 ACAT'2002, Moscow Information System (interfaces)

11 ACAT'2002, Moscow Grid Manager - cluster middleware Provide job control and data handling functionality (HEP applications requirements are first priority). The Grid Manager is based on Globus Toolkit TM libraries and services. The following parts of Globus are used: GridFTP - fast and reliable data access for Grid GASS Copy interface - support for different data access protocols Replica Catalog - metadata storage GRAM - resource request RSL - expandable Resource Specification Language

12 ACAT'2002, Moscow Grid Manager (features) Stage in input data and executables. Possible sources: Job submission machine. GridFTP (preferred), FTP, HTTP or HTTPS servers. Files registered in Globus Replica Catalog. Secure authentication. Destination is chosen automatically or can be forced. Stage out output data. Possible destinations: Keep on cluster till user downloads. GridFTP, FTP, HTTP or HTTPS servers. Files can be registered in Globus Replica Catalog. Destination and protocol are obtained from Location information.

13 ACAT'2002, Moscow Grid Manager (features) notification of job status changes. Support for software runtime environment configuration. Jobs will be started with environment setup properly for requested application Customizable GridFTP server local access through plugins certificate oriented local file system access plugin job submission/access plugin - start job/upload input files/download output files through the same interface Limitation: Data is handled only at that beginning and end of the job. User must provide information about input and output data.

14 ACAT'2002, Moscow Extensions to RSL (evaluation) RSL stands for Resource Specification Language. Introduced to communicate job requirements to the Global Resource Allocation Manager (GRAM). Useful features: Allows basic logical expressions Set of attributes is expandable Unknown attributes are passed through. Allows different parts to be processed at different levels. Can be used to assist in writing brokers or filters which refine an RSL specification

15 ACAT'2002, Moscow Extensions to RSL (new attributes) To support additional features new attributes introduced. The most important are inputFiles=( [ ])...- list of files to be transferred to the computing node from a given location. outputFiles=( [ ])...-list of files to be preserved after the job completion and transferred to a given location. executables=... -list of files to be given executable permissions. notify=...- notification on job status change.

16 ACAT'2002, Moscow Extensions to RSL (new attributes) runTimeEnvironment=... - a pplication-specific runtime environment (e.g., ATLAS-3.2.1) middleware= -required middleware (e.g., NorduGrid ) cluster= -specific cluster request rerun= -number of attempts to re-run the job lifeTime= -maximum time for the session directory to remain on the execution node (can not override local policy) ftpThreads= -number of GridFTP threads to be used for file transfers

17 ACAT'2002, Moscow User Interface The NorduGrid toolkit user interface consists of a set of commands that can be executed from the command line ngsub- for job submission ngstat- to obtain the status of jobs and clusters ngcat- to display the stdout or stderr of a running job ngget- to retrieve the result from a finished job ngkill- to kill a running job ngclean- to delete a job from a remote cluster ngsync- to recreate local information about jobs

18 ACAT'2002, Moscow User Interface Job request is done through xRSL processes user-level xRSL request and transforms to one suitable for GM user-friendly values for some attributes conditional submission and xRSL transformation Performs brokering analyzes information about the different clusters obtained from the MDS servers from all suitable queues one is chosen randomly, with a weight proportional to the amount of free computing resources Passes modified job request to GM through GRAM or GridFTP interface and uploads input files.

19 ACAT'2002, Moscow User Authentication Management Using Globus certificates NorduGrid Certification Authority established Access control through gridmapfiles User access control is delegated to Virtual Organization managers Gridmapfiles are generated automatically from VO database GSI enabled secure LDAP server contains the Subject Names of the user's certificates VO managers User Groups and Group Managers Local site adminisrators have total control over their gridmapfiles

20 ACAT'2002, Moscow Applications It is possible to run any application with predefined set of input and output data From as simple as "Hello World" ngsub '&(executable=/bin/echo)(arguments="Hello World")(stdout=out.txt)'

21 ACAT'2002, Moscow Applications (cont.) to as difficult as Atlas Data Challenge ngsub '&(executable = prod)(arguments = "0002" "2" "100") (stdout = atlas.0002.log)(join = yes) (replicacollection = ldap://grid.uio.no/lc=ATLAS,rc=NorduGrid,dc=nordugrid, dc=org) (inputfiles = ("atlsim.makefile" "") ("atlas.kumac" "") ("gen0017_1.root" "rc:///gen0017_1.root") ) (outputfiles = ("atlas.0002.zebra" "rc:///results/atlas.0002.zebra") ("atlas.0002.his" "") ) (runtimeenvironment="ATLAS-3.2.0") (middleware="NorduGrid")'

22 ACAT'2002, Moscow Conclusions The minimal environment for Grid computing is established. Globus tools alone are not enough for convenient usage, but provide solid base. Additional layer of tools/services were developed to provide required infrastructure. A lot of things to do: Runtime data handling. Accounting. Better support for different LRMS. Enhanced Information System - more stability, access control, better and richer information providers etc....