The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Overveiw of the gLite middleware Yaodong Cheng

Slides:



Advertisements
Similar presentations
E-science grid facility for Europe and Latin America gLite Overview Vanessa Hamar Universidad de Los Andes.
Advertisements

INFSO-RI Enabling Grids for E-sciencE Workload Management System and Job Description Language.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
E-science grid facility for Europe and Latin America gLite Overview User and Site Admin Tutorial Riccardo Bruno – INFN Sez. Catania Dublin.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
AliEn uses bbFTP for the file transfers. Every FTD runs a server, and all the others FTD can connect and authenticate to it using certificates. bbFTP implements.
INFSO-RI Enabling Grids for E-sciencE Comparison of LCG-2 and gLite Author E.Slabospitskaya Location IHEP.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
FESR Consorzio COMETA Grid Introduction and gLite Overview Corso di formazione sul Calcolo Parallelo ad Alte Prestazioni (edizione.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Riccardo Rotondo
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
L ABORATÓRIO DE INSTRUMENTAÇÃO EM FÍSICA EXPERIMENTAL DE PARTÍCULAS Enabling Grids for E-sciencE Grid Computing: Running your Jobs around the World.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
The EDGeS project receives Community research funding 1 SG-DG Bridges Zoltán Farkas, MTA SZTAKI.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
Segundo Taller Latino Americano de Computación GRID – Primer Taller Latino Americano de EELA – Primer Tutorial Latino Americano de EELA
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Alexandre Duarte CERN IT-GD-OPS UFCG LSD 1st EELA Grid School.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid2Win: Porting of gLite middleware to.
EGI-Engage Data Services and Solutions Part 1: Data in the Grid Vincenzo Spinoso EGI.eu/INFN Data Services.
Segundo Taller Latino Americano de Computación GRID – Primer Taller Latino Americano de EELA – Primer Tutorial Latino Americano de EELA
The Institute of High Energy of Physics, Chinese Academy of Sciences Sharing LCG files across different platforms Cheng Yaodong, Wang Lu, Liu Aigui, Chen.
13th EELA Tutorial, La Antigua, 18-19, October E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Riccardo Bruno
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
INFSO-RI Enabling Grids for E-sciencE gLite Overview Riccardo Bruno, Salvatore Scifo gLite - Tutorial Catania, dd.mm.yyyy.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
CE design report Luigi Zangrando
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
EGEE Data Management Services
Gri2Win: Porting gLite to run under Windows XP Platform
Grid2Win Porting of gLite middleware to Windows XP platform
gLite Basic APIs Christos Filippidis
Grid Computing: Running your Jobs around the World
StoRM: a SRM solution for disk based storage systems
Vincenzo Spinoso EGI.eu/INFN
Practical: The Information Systems
gLite Grid Services Salma Saber
GDB 8th March 2006 Flavia Donno IT/GD, CERN
Comparison of LCG-2 and gLite v1.0
Introduction to gLite GRID Enviroment
Introduction to Data Management in EGI
Grid2Win: Porting of gLite middleware to Windows XP platform
Introduction to Grid Technology
Grid2Win: Porting of gLite middleware to Windows XP platform
Grid Services Ouafa Bentaleb CERIST, Algeria
CRC exercises Not happy with the way the document for testbed architecture is progressing More a collection of contributions from the mware groups rather.
Short update on the latest gLite status
Gri2Win: Porting gLite to run under Windows XP Platform
From Prototype to Production Grid
EGEE Middleware: gLite Information Systems (IS)
The GENIUS portal and the GILDA t-Infrastructure
Grid Introduction and gLite Overview
gLite Grid Services Riccardo Bruno
Overview of gLite Middleware
Installation/Configuration
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Overveiw of the gLite middleware Yaodong Cheng IHEP, CAS Asia EPIKH School for Grid Site Administrators Beijing,

EPIKH School for Grid Site Administrators, Beijing, /33 Outline Introduction gLite main components Site architecture and deployment

EPIKH School for Grid Site Administrators, Beijing, /33 gLite Grid gLite Grid System aims to: Integrate Virtualize Manage RESOURCEs and SERVICEs across different VOs The gLite middleware is the set of software packages able to do this

EPIKH School for Grid Site Administrators, Beijing, /33 Definitions Computational Resource –The machine where users can execute programs (called jobs), can store or access to stored data in a transparent fashion; no matter where they are located. Job –An executable or a batch script that the users want to execute on the Worker Node Job Submission –Is the action when the best available computational resource is assigned by the system to the user in order to execute the job.

EPIKH School for Grid Site Administrators, Beijing, /33 Overview The user joins to a VO INTERNET

EPIKH School for Grid Site Administrators, Beijing, /33 The user joins to a VO Each VO shares grid resources to other VOs accordingly to several policies. INTERNET Overview

EPIKH School for Grid Site Administrators, Beijing, /33 The user joins to a VO Each VO shares grid resources to other Vos accordingly to several policies. The Grid middleware allow to use and share: –“Computing Elements (CE)” –“Storage Element (SE)” INTERNET Overview

EPIKH School for Grid Site Administrators, Beijing, /33 Overview The users join to a VO Each VO shares grid resources to other Vos accordingly to several policies. The Grid middleware allow to use and share: –“Computing Elements (CE)” –“Storage Element (SE)” Plus Additional services to empower the capabilities of the Grid INTERNET

EPIKH School for Grid Site Administrators, Beijing, /33 The users join to a VO Each VO shares grid resources to other Vos accordingly to several policies. The Grid middleware allow to use and share: –“Computing Elements (CE)” –“Storage Element (SE)” Plus Additional servicees to empower the capabilities of the Grid Result: COLLABORATION INTERNET Overview

EPIKH School for Grid Site Administrators, Beijing, /33 Grid Requirements Heterogeneous (OSes, Devs, Apps.) VO Resource Sharing (Management, Security and Accounting) Resource Utilisation (Reservation, Metering, Monitoring and Logging) Job Execution (VO access, QoS, SLA) Data Services (Integration, Provisioning, Cataloguing, Metadata) Security (Authentication, Authorisation and Auditing) Administrative Costs (Provisioning, Deployment and Configuration) Scalability Availability (Disaster Recovery, Fault Management) Specific Requirements: (EGEE: HEP, BioMed)

EPIKH School for Grid Site Administrators, Beijing, /33 gLite evolution GTK2Web services based gLite-2gLite-1LCG-2LCG-1gLite-3 GTK4 gLite - Lightweight Middleware for Grid Computing –

EPIKH School for Grid Site Administrators, Beijing, /33 gLite Main components services User Interface (UI) User Interface (UI):The place where users logon to the Grid Computing Element (CE) Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Storage Element (SE) Storage Element (SE): provides (large-scale) storage for files Resource Broker (RB/WMS) Resource Broker (RB/WMS): Matches the user requirements with the available resources on the Grid Information System Information System: Characteristics and status of CE and SE (Uses “GLUE schema”)

EPIKH School for Grid Site Administrators, Beijing, /33 The User Interface (UI) is the user entry point normally considered the user workstation. It is normally considered as a WMS component. The Resource Broker (RB) or Workload Management System (WMS) its a set of services having the responsability to find the best available computing element where submit user’ jobs in a transparent fashion The Logging and bookkeeping service (LB), keep track of user job execution in terms of statuses: Ready, Scheduled, Waiting, Running, Done The Computing element (CE) is the computational resource, the entry point to a cluster or PCs handled by a job queue management system; in particular: TORQUE, PBS, LSF, CONDOR The Worker Nodes are the machines where jobs are really executed and managed by the CE’ queue management system Components

EPIKH School for Grid Site Administrators, Beijing, /33 The Information System and Monitoring (IS and MON now deprecated), maintain data related to available grid resources and their health statuses. The Virtual Organization Management service (VOMS), is the way gLite improves the management of authentication and authorization to the Grid resources. The VOMS allows to their own members to define different access rights to VO’ resources The Storage element (SE) and the File catalogue (LFC), allow to manage Grid files and offer a mechanism to locate them easily for users and jobs. Components

EPIKH School for Grid Site Administrators, Beijing, /33 Components interaction File Replica Catalogue Logging & Book-keeping ResourceBroker StorageResource ComputingResource = batch queue InformationService Job Status Datasets info Author. &Authen. Job Submit Event Job Query Job Status Input files Input Output Output files Publish Resource info User/Grid interface

EPIKH School for Grid Site Administrators, Beijing, /33 Job life cycle

EPIKH School for Grid Site Administrators, Beijing, /33 gLite services gLite services can be grouped in 5 main high level set of services Grid Access Security Information system & Monitoring Job Workload Management System Data Management

EPIKH School for Grid Site Administrators, Beijing, /33 gLite – Grid access Two possibilities: APIs or CLI Built on top of them there exist GridPortals and GUIs

EPIKH School for Grid Site Administrators, Beijing, /33 gLite Security User authentication is based on X.509 (PKI)‏ Authorized Certification Authorities (CA) can generate user and service certificates who identify univocally people or Grid services in the whole Grid Each Grid service may support or not certificates coming from different CAs To reduce the vulnerabilities the identification of users in to the grid is done through the use of proxy certificates. Proxyes are singed ‘copies’ of the original user certificate, having a limited lifetime. The use of Proxy certificates allows the following: Delegation: Any grid service can operates on the user behalf making signed copies of the original proxy. (Single Sign On) Add additional info (Add VO specific information provided by VOMS)‏ Store a long term proxy on a secure server (MyProxy) Renewal (A Proxy close to the expiration time can be automatically renewed)‏

EPIKH School for Grid Site Administrators, Beijing, /33 gLite Security: AutH/AhtZ Authentication –The user receive a certificate from a CA (PKI third party) –He connects to the UI via SSH –He Creates the proxy (single sign on) – All grid services will use this proxy to identify the user. Authorization –The user has to subscribe to a VO (VOMS) –The VO establishes the user rights –In any Grid service it will be verified if the user belongs to the VO and assigns the proper access rights to the user –A special configuration file named the “gridmapfile”, maintains the correspondency between grid users and resource users (unix pool accounts)

EPIKH School for Grid Site Administrators, Beijing, /33 Virtual Organization Membership Service –Manages many Virtual Organizations (VOs). –Multiple user roles can be defined inside each VO  Extends the X509 schema  Extensions are Digitally Signed –Service maintenance provided by a web front-end –Support MyProxy (stored proxies) –Allow the access rights by VO or by Role –Each Grid site associates to each VO member or role a a local authentication (unix pool accounts)‏.  Allows to implement fine grained security policies to grid resources VOMS

EPIKH School for Grid Site Administrators, Beijing, /33 MyProxy –Stores a long term proxy certificates to allow the automatic proxy renewal mechanism –Allow to execute jobs requesting a computation time larger that the normal proxy liferime (normally 12 hrs)  The WMS is the responsable of the proxy renewal  Users should not use long lived proxy directly –Allow the user to access grid resources without carrying out the public and private keys.  Proxy Delegation MyProxy

EPIKH School for Grid Site Administrators, Beijing, /33 Berkeley Database Information Index (BDII) The information hierarchically stored via tree modeling (The LDAP implementation of GLUE) GRISStores information at resource level Site BDII/GIIS (deprecated) Stores information at site level BDIIStores information at VO level VO Level Site Level Resource Level Information Systema and Monitoring

EPIKH School for Grid Site Administrators, Beijing, /33 WMS set of middleware components responsible of distribution and management of jobs across Grid resources. Two core components of WMS WM : accepts and satisfy requests for job management. (Matchmaking) is the process of assigning the best available resource. Logging & Bookeeping : keeps track of job execution in term of events: (Submitted, Running, Done,...) Workload Management

EPIKH School for Grid Site Administrators, Beijing, /33 Service that represents the computing resource that is responsible of to manage the queue of jobs to execute The CE may be used by a Generic Client: an end-user interacting directly with the Computing Element, or by the Workload Manager, which submits a given job to an appropriate CE found by the matchmaking process. Two job submission models : PUSH (Eager Scheduling) (jobs pushed to CE), PULL (Lazy Scheduling) (jobs coming from WMS when CE has free slots) Computing Element

EPIKH School for Grid Site Administrators, Beijing, /33 CEAComputing Element Acceptance JCJob Controller MONMonitoring LRMSLocal Resource Management System Computing Element: Architecture

EPIKH School for Grid Site Administrators, Beijing, /33 SE Services are at least: Storage back-end (Drivers and Hardware) Storage Resource Manager (SRM) Interface ( Interface to manage the specific storage solution : dpm, rfio, …) Transfer service ( Protocols: GridFTP ( gsiftp ), glubus-url-copy, …) Native POSIX like file I/O API (GFAL) Auxiliary Accounting and Logging services ( accounting: SAGE) Data are stored on Disk Pool Servers or Mass Storage Systems Storage Element

EPIKH School for Grid Site Administrators, Beijing, /33 OS like file access metaphor. LFN (Logical file name) GUID (Grid unique identifier) SimLinks SURL (Site URL) TURL (Transfer URL) LFC File Catalog

EPIKH School for Grid Site Administrators, Beijing, /33 AMGA – Arda Metadata Grid Application Metadata is data about data On the Grid: information about files  Describe files  Locate files based on their contents through queries But also makes DB access a simple task on the Grid  Many Grid applications need structured data  Many applications require only simple schemas  Can be modelled as metadata  Main advantage: better integration with the Grid environment  Metadata Service is a Grid component  Grid security  Hide DB heterogeneity Metadata

EPIKH School for Grid Site Administrators, Beijing, /33 Deployment Considerations Basic Site Architecture –User Interface (UI): User login environment –Storage Element (SE): Storage resource Services –Computing Element (CE): Gateway to computing resources  Small site will also install Site BDII Batch system manager Shared file system (NFS, Lustre…) for VO software –Worker Node (WN): job execution machine –MON: R-GMA server for accounting UI MON SE Switch CE Shared File system WN

EPIKH School for Grid Site Administrators, Beijing, /33 Deployment considerations Central services –BDII: Top Level information system service  Available regionally –Resource Broker (RB): job management  RB or WMS –VO services  LFC: LCG file catalogue Maps VO’s logical file names to physical file names  VOMS: VO membership service Manages list of VO Members UI MON SE Switch CE Shared File system WN Central services BDII RB VO services LFC VOMS

EPIKH School for Grid Site Administrators, Beijing, /33 Network consideration Grid Services –Public IP required by each grid service –Forward and reverse DNS configuration Worker node –Public IP for parallel stream file transfer –Private IP is possible  Single stream transfer for WNs to remote SE Storage Elements –Bandwidth to and from worker Nodes –Bandwidth to WAN Network Firewall requirements –

EPIKH School for Grid Site Administrators, Beijing, /33 Questions …