Based on material by Sergio Andreozzi INFN-CNAF

Slides:



Advertisements
Similar presentations
Grid Computing - DCC/FCUP
Advertisements

EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Middleware Claudio Grandi (INFN – Bologna) Workshop Commissione.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
EGEE-II INFSO-RI Enabling Grids for E-sciencE Slides based on material from Sergio Andreozzi INFN-CNAF and from Pedro.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Based on material by Sergio Andreozzi INFN-CNAF OMII-Europe All-Hands.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Giuseppe Andronico INFN Sezione di Catania.
IST E-infrastructure shared between Europe and Latin America Overview of gLite Middleware Pedro Henrique Rausch Bello Instituto.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Configuring and Maintaining EGEE Production.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
L ABORATÓRIO DE INSTRUMENTAÇÃO EM FÍSICA EXPERIMENTAL DE PARTÍCULAS Enabling Grids for E-sciencE Grid Computing: Running your Jobs around the World.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Security and Job Management.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
INFSO-RI Enabling Grids for E-sciencE The gLite Workload Management System Elisabetta Molinari (INFN-Milan) on behalf of the JRA1.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
EGEE-II INFSO-RI Enabling Grids for E-sciencE JRA1 in EGEE II Claudio Grandi (INFN and CERN) EGEE II Transition Meeting.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware reengineering Claudio Grandi – JRA1 Activity Manager - INFN EGEE Final EU.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Alexandre Duarte CERN IT-GD-OPS UFCG LSD 1st EELA Grid School.
EGEE-II INFSO-RI Enabling Grids for E-sciencE middleware status and plans Claudio Grandi (INFN and CERN) John White.
13th EELA Tutorial, La Antigua, 18-19, October E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
First South Africa Grid Training June 2008, Catania (Italy) OVERVIEW of the gLite COMPONENTS Marcello Iacono Manno FIRST.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Overveiw of the gLite middleware Yaodong Cheng
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
EGEE Data Management Services
Gri2Win: Porting gLite to run under Windows XP Platform
Grid2Win Porting of gLite middleware to Windows XP platform
gLite Basic APIs Christos Filippidis
Grid Computing: Running your Jobs around the World
gLite: status and perspectives
Claudio Grandi – JRA1 Activity Manager INFN and CERN
JRA1 Middleware Re-engineering Status Report
StoRM: a SRM solution for disk based storage systems
U.S. ATLAS Grid Production Experience
Practical: The Information Systems
gLite Grid Services Salma Saber
GDB 8th March 2006 Flavia Donno IT/GD, CERN
Comparison of LCG-2 and gLite v1.0
BOSS: the CMS interface for job summission, monitoring and bookkeeping
BOSS: the CMS interface for job summission, monitoring and bookkeeping
gLite Middleware Status
Slides contributed by EGEE Team
Accounting at the T1/T2 Sites of the Italian Grid
Grid2Win: Porting of gLite middleware to Windows XP platform
Introduction to Grid Technology
Grid2Win: Porting of gLite middleware to Windows XP platform
EGEE support for HEP and other applications
BOSS: the CMS interface for job summission, monitoring and bookkeeping
Grid Services Ouafa Bentaleb CERIST, Algeria
Current status of gLite
Grid Deployment Board meeting, 8 November 2006, CERN
Short update on the latest gLite status
Gri2Win: Porting gLite to run under Windows XP Platform
Data Management cluster summary
Report on GLUE activities 5th EU-DataGRID Conference
Overview of the EGEE project and the gLite middleware
The GENIUS portal and the GILDA t-Infrastructure
gLite Grid Services Riccardo Bruno
Overview of gLite Middleware
gLite The EGEE Middleware Distribution
Presentation transcript:

Based on material by Sergio Andreozzi INFN-CNAF The middleware Based on material by Sergio Andreozzi INFN-CNAF OMII-Europe All-Hands Meeting Bologna, 12-13 February 2007

Disclaimer This presentation is based on materials provided and authorized by the EGEE project and is freely available to download and use according to the terms of the following license: http://creativecommons.org/licenses/by-nc-sa/2.5/ gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

OUTLINE The EGEE Project The gLite middleware Objective Relationship to other projects The gLite middleware Middleware decomposition Foundation High-level services gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Part I The EGEE Project gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

The EGEE project EGEE EGEE-II EGEE-III Objectives 1 April 2004 – 31 March 2006 71 partners in 27 countries, federated in regional Grids EGEE-II 1 April 2006 – 31 March 2008 91 partners in 32 countries 13 Federations EGEE-III 1 April 2008 – 31 March 2010 More than 120 partners Objectives Large-scale, production-quality infrastructure for e-Science Attracting new resources and users from industry as well as science Improving and maintaining “gLite” Grid middleware US partners in EGEE-II: Univ. Chicago Univ. South. California Univ. Wisconsin RENCI gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Main lines of the EGEE project Infrastructure operation Currently includes sites across 39 countries Continuous monitoring of grid services & automated site configuration/management Middleware Production quality middleware distributed under business friendly open source licence User Support - Managed process from first contact through to production usage Training Expertise in grid-enabling applications Online helpdesk Networking events (User Forum, Conferences etc.) Interoperability Expanding geographical reach and interoperability with related infrastructures TWGRID KnowARC gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Applications on EGEE Applications from an increasing number of domains Astrophysics Computational Chemistry Earth Sciences Financial Simulation Fusion Geophysics High Energy Physics Life Sciences Multimedia Material Sciences … Book of abstracts: http://doc.cern.ch//archive/electronic/egee/tr/egee-tr-2006-005.pdf gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

EU projects related to EGEE GRID gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Sustainability: Beyond EGEE-II Need to prepare for permanent Grid infrastructure Ensure a reliable and adaptive support for all sciences Independent of short project funding cycles Infrastructure managed in collaboration with national grid initiatives Expand the idea and problems of the JRU gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Part II The gLite middleware Programming the Grid with gLite http://doc.cern.ch//archive/electronic/egee/tr/egee-tr-2006-001.pdf gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Middleware structure Applications Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed to help the users building their computing infrastructure but should not be mandatory Foundation Grid Middleware will be deployed on the EGEE infrastructure Must be complete and robust Should allow interoperation with other major grid infrastructures Should not assume the use of Higher-Level Grid Services Higher-Level Grid Services Workload Management Replica Management Visualization Workflow Grid Economies ... Foundation Grid Middleware Security model and infrastructure Computing (CE) and Storage Elements (SE) Accounting Information and Monitoring Overview paper http://doc.cern.ch//archive/electronic/egee/tr/egee-tr-2006-001.pdf gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

gLite Services Decomposition 6 High Level Services + CLI & API Legend: Available Foreseen in the architecture (only Job provenance will be available by the end of EGEE-II) Site proxy – allows outbound connectivity from hidden networks gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

gLite components UI: User Interface CE: Computing Element SE: Storage Element WN: Worker Node WMS: Workload Management System VOMS: Virtual Organization Membership Service LB: Logging and Bookkeeping MonBOX: monitoring LFC: Logical File Catalog BDII: Berkeley Database Information Index, stores all infomation about the resources available in the grid infrastructure gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Job Workflow in gLite LFC Catalog Information Service Resource Broker UI JDL Input “sandbox” DataSets info voms-proxy-init Information Service Output “sandbox” SE & CE info Resource Broker Output “sandbox” Expanded JDL Job Submit Event Job Query Job Status Input “sandbox” + Broker Info Publish Author. &Authen. Storage Element Globus RSL Job Submission Service Job Status Logging & Book-keeping Computing Element Job Status gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Job Workflow in gLite LFC Catalog Information Index Resource Broker UI JDL Input “sandbox” DataSets info voms-proxy-init Information Index Output “sandbox” SE & CE info Resource Broker Output “sandbox” Expanded JDL Job Submit Event Job Query Job Status Input “sandbox” + Broker Info Publish Author. &Authen. Storage Element Globus RSL Job Submission Service Job Status Logging & Book-keeping WMProxy Computing Element Job Status gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

High Level Services: Workload Manag. Resource brokering, workflow management, I/O data management Web Service interface: WMProxy Task Queue: keep non matched jobs Information SuperMarket: optimized cache of information system Match Maker: assigns jobs to resources according to user requirements Job submission & monitoring Condor-G ICE (to CREAM) External interactions: Information System Data Catalogs Logging&Bookkeeping Policy Management system (G-PBox) OSG Consortium Meeting - Seattle - 21-23 August 2006

Grid Foundation: Security Authentication based on X.509 PKI infrastructure Certificate Authorities (CA) issue (long lived) certificates identifying individuals (much like a passport) Commonly used in web browsers to authenticate to sites Trust between CAs and sites is established (offline) In order to reduce vulnerability, on the Grid user identification is done by using (short lived) proxies of their certificates Proxies can Be delegated to a service such that it can act on the user’s behalf Include additional attributes (like VO information via the VO Membership Service VOMS) Be stored in an external proxy store (MyProxy) Be renewed (in case they are about to expire) gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Grid foundation: Information Systems Generic Information Provider (GIP) Provides LDIF information about a grid service in accordance to the GLUE Schema BDII: Information system in gLite 3.0 (by LCG) LDAP database that is updated by a process More than one DBs is used separate read and write A port forwarder is used internally to select the correct DB GIP Cache Provider Plugin LDIF File Config File 2171 LDAP 2172 2173 2170 Port Fwd Update DB & Modify DB Swap DBs gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Grid foundation: Information Systems R-GMA: provides a uniform method to access and publish distributed information and monitoring data Used for job and infrastructure monitoring in gLite 3.0 Working to add authorization Service Discovery: Provides a standard set of methods for locating Grid services Currently supports R-GMA, BDII and XML files as backends Will add local cache of information Used by some DM and WMS components in gLite 3.0 gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Grid foundation: Computing Element Three flavours available now: LCG-CE (GT2 GRAM) In production now but will be phased-out next year gLite-CE (GSI-enabled Condor-C) Already deployed but still needs thorough testing and tuning. Being done now CREAM (WS-I based interface) Deployed on the JRA1 preview test-bed. After a first testing phase will be certified and deployed together with the gLite-CE Our contribution to the OGF-BES group for a standard WS-I based CE interface CREAM and WMProxy demo at SC06! BLAH is the interface to the local resource manager (via plug-ins) CREAM and gLite-CE Information pass-through: pass parameters to the LRMS to help job scheduling WMS, Clients Information System Grid Computing Element bdII R-GMA CEMon Site glexec + LCAS/ LCMAPS BLAH WN LRMS gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Grid foundation: Accounting APEL: Uses R-GMA to propagate and display job accounting information for infrastructure monitoring Reads LRMS log files provided by LCG-CE and BLAH Preparing an update for gLite 3.0 to use the files form BLAH DGAS: Collects, stores and transfers accounting data. Compliant with privacy requirements Reads LRMS log files provided by LCG-CE and BLAH. Stores information in a site database (HLR) and optionally in a central HLR. Access granted to user, site and VO administrators Not yet certified in gLite 3.0. Deployment plan: DGAS is in certification at INFN It will send records to the GOC via DGAS2APEL gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Grid foundation: Storage Element Common interface: SRMv1, migrating to SRM v2.2 Various implementation from LCG and other external projects disk-based: DPM, dCache / tape-based: Castor, dCache Support for ACLs in DPM (in future in Castor and dCache) After the summer: synchronization of ACLs between SEs Common rfio library for Castor and DPM being added Posix-like file access: Grid File Access Layer (GFAL) by LCG Support for ACL in the SRM layer (currently in DPM only) Support for SRMv2 being added now gLite I/O Support for ACLs from the file catalog and interfaced to Hydra for data encryption Not certified in gLite 3.0. To be dismissed when all functionalities will be also available in GFAL. gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

High Level Services: Catalogues File Catalogs LFC from LCG In June: interface to POOL. In the summer: LFC replication and backup. Hydra: stores keys for data encryption Being interfaced to GFAL (done by July) Currently only one instance, but in future there will be 3 instances: at least 2 need to be available for decryption. Not yet certified in gLite 3.0. Certification will start soon. AMGA Metadata Catalog: generic metadata catalogue Joint JRA1-NA4 (ARDA) development. Used mainly by Biomed gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

High Level Services: File transfer FTS: Reliable, scalable and customizable file transfer Manages transfers through channels mono-directional network pipes between two sites Web service interface Automatic discovery of services Support for different user and administrative roles Adding support for pre-staging and new proxy renewal schema Support for SRMv2.2, delegation, VOMS-aware proxy renewal in certification gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

High Level Services: Workload mgmt. WMS helps the user accessing computing resources Resource brokering, management of job input/output, ... LCG-RB: GT2 + Condor-G To be replaced when the gLite WMS proves to be reliable gLite WMS: Web service (WMProxy) + Condor-G Management of complex workflows (DAGs) and compound jobs bulk submission and shared input sandboxes support for input files on different servers (scattered sandboxes) Support for shallow resubmission of jobs Job File Perusal: file peeking during job execution Supports collection of information from CEMon, BDII, R-GMA and from DLI and StorageIndex data management interfaces Support for parallel jobs (MPI) when the home dir is not shared Deployed for the first time in gLite 3.0 gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

WMS/LB/UI and CE New WMS deployed and thoroughly debugged CMS: 100 collections * 200 jobs/collection, 3 UIs, 33 CEs ~ 2.5 h to submit jobs 0.5 seconds/job ~ 17 hours to transfer jobs to a CE 3 seconds/job 26K jobs/day Negligible failure rate due to WMS Shallow resubmission failure rate drops to less than 1% with 3 resubmissions CMS Stability problems investigating also other deployment scenarios to make it more robust gLite CE still to be tested and optimized ATLAS gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

High Level Services: Workflows Direct Acyclic Graph (DAG) is a set of jobs where the input, output, or execution of one or more jobs depends on one or more other jobs A Collection is a group of jobs with no dependencies basically a collection of JDL’s nodeE nodeC nodeA nodeD nodeB A Parametric job is a job having one or more attributes in the JDL that vary their values according to parameters Using compound jobs it is possible to have one shot submission of a (possibly very large, up to thousands) group of jobs Submission time reduction Single call to WMProxy server Single Authentication and Authorization process Sharing of files between jobs Availability of both a single Job ID to manage the group as a whole and an ID for each single job in the group gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

High Level Services: Job Information Logging and Bookkeeping service Tracks jobs during their lifetime (in terms of events) LBProxy for fast access L&B API and CLI to query jobs Support for “CE reputability ranking“: maintains recent statistics of job failures at CE’s and feeds back to WMS to aid planning Job Provenance: stores long term job information Supports job rerun If deployed will also help unloading the L&B Not yet certified in gLite 3.0. gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Highlights: Job Priorities Applications ask for the possibility to diversify the access to fast/slow queues depending on the user role/group inside the VO GPBOX is a tool that provides the possibility to define, store and propagate fine-grained VO policies based on VOMS groups and roles enforcement of policies at sites: sites may accept/reject policies Not yet certified. Certification will start when requested by the TCG. Current activities: test job prioritization without GPBOX: - Map VOMS groups to batch system shares - Publish info on the share in the CE GLUE 1.2 schema (VOView) - WMS match-making depending on submitter VOMS certificate - Settings are not dynamic (via e-mail or CE updates) - GIP available for Torque/Maui only. Working on the LSF one   - mainly a deployment issue gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

Summary gLite 3 is the next generation middleware for grid computing developed according to a well defined process controlled by the EGEE Technical Coordination Group deployed on the EGEE production infrastructure More than 200 sites development is continuing to provide increased robustness, usability, and functionality On the preview testbed CREAM, Job Provenance, glexec on the WNs, GPBOX gLite sources: http://glite.cvs.cern.ch/cgi-bin/glite.cgi/ gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007

www.glite.org gLite @ OMII-Europe All-Hands meeting, Bologna, 12-13 February 2007