EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org www.glite.org middleware status and plans Claudio Grandi (INFN and CERN) John White.

Slides:



Advertisements
Similar presentations
Middleware Roadmap for GridPP3 R.Middleton GridPP16- QMUL.
Advertisements

Grid Computing - DCC/FCUP
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Middleware Claudio Grandi (INFN – Bologna) Workshop Commissione.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
EGEE-II INFSO-RI Enabling Grids for E-sciencE Slides based on material from Sergio Andreozzi INFN-CNAF and from Pedro.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Based on material by Sergio Andreozzi INFN-CNAF OMII-Europe All-Hands.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Giuseppe Andronico INFN Sezione di Catania.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The middleware Roberto Barbera University of Catania and INFN ISSGC’06 Ischia,
IST E-infrastructure shared between Europe and Latin America Overview of gLite Middleware Pedro Henrique Rausch Bello Instituto.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
INFSO-RI Enabling Grids for E-sciencE XACML and G-PBox update MWSG 14-15/09/2005 Presenter: Vincenzo Ciaschini.
EGEE-II INFSO-RI Enabling Grids for E-sciencE gLite Overview Gang Chen CC-IHEP, Chinese Academy of Sciences
INFSO-RI Enabling Grids for E-sciencE Comparison of LCG-2 and gLite Author E.Slabospitskaya Location IHEP.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
INFSO-RI Enabling Grids for E-sciencE Status and Plans of gLite Middleware Erwin Laure 4 th ARDA Workshop 7-8 March 2005.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Security and Job Management.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks JRA1 summary Claudio Grandi EGEE-II JRA1.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
INFSO-RI Enabling Grids for E-sciencE The gLite Workload Management System Elisabetta Molinari (INFN-Milan) on behalf of the JRA1.
EGEE-II INFSO-RI Enabling Grids for E-sciencE JRA1 in EGEE II Claudio Grandi (INFN and CERN) EGEE II Transition Meeting.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Features and Future Frédéric Hemmer - CERN Deputy Head of IT Department.
Stefano Belforte INFN Trieste 1 Middleware February 14, 2007 Resource Broker, gLite etc. CMS vs. middleware.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Overview Roberto Barbera Univ. of.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware reengineering Claudio Grandi – JRA1 Activity Manager - INFN EGEE Final EU.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
INFSO-RI Enabling Grids for E-sciencE The gLite File Transfer Service: Middleware Lessons Learned form Service Challenges Paolo.
Enabling Grids for E-sciencE gLite for ATLAS Production Simone Campana, CERN/INFN ATLAS production meeting May 2, 2005.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Alexandre Duarte CERN IT-GD-OPS UFCG LSD 1st EELA Grid School.
INFSO-RI Enabling Grids for E-sciencE gLite Middleware Status Frédéric Hemmer, JRA1 Manager, CERN On behalf of JRA1.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
EGEE-II INFSO-RI Enabling Grids for E-sciencE gLite and Condor present and future Claudio Grandi (INFN – Bologna)
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Middleware reengineering Claudio Grandi (INFN – Bologna) EGEE Final.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE JRA1 All Hands Meeting July 10-12, 2006 Pilsen, CZ.
INFSO-RI Enabling Grids for E-sciencE Policy management and fair share in gLite Andrea Guarise HPDC 2006 Paris June 19th, 2006.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROCs Top 5 Middleware Issues Daniele Cesini,
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Management cluster summary David Smith JRA1 All Hands meeting, Catania, 7 March.
INFSO-RI Enabling Grids for E-sciencE DGAS, current status & plans Andrea Guarise EGEE JRA1 All Hands Meeting Plzen July 11th, 2006.
INFSO-RI Enabling Grids for E-sciencE Upcoming Releases Markus Schulz CERN SA1 15 th June 2005.
Enabling Grids for E-sciencE EGEE-III-INFSO-RI EGEE and gLite are registered trademarks Francesco Giacomini JRA1 Activity Leader.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
Status of gLite-3.0 deployment and uptake Ian Bird CERN IT LCG-LHCC Referees Meeting 29 th January 2007.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Middleware reengineering.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
First South Africa Grid Training June 2008, Catania (Italy) OVERVIEW of the gLite COMPONENTS Marcello Iacono Manno FIRST.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
INFSO-RI Enabling Grids for E-sciencE Padova site report Massimo Sgaravatto On behalf of the JRA1 IT-CZ Padova group.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware Claudio Grandi (INFN – Bologna) EGEE-NAREGI meeting March 20-22,
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
EGEE-II INFSO-RI Enabling Grids for E-sciencE Status of INFN middleware in gLite Claudio Grandi INFNGrid EB CNAF,
EGEE Data Management Services
gLite: status and perspectives
Claudio Grandi – JRA1 Activity Manager INFN and CERN
GDB 8th March 2006 Flavia Donno IT/GD, CERN
gLite Middleware Status
Short update on the latest gLite status
Francesco Giacomini – INFN JRA1 All-Hands Nikhef, February 2008
Data Management cluster summary
Based on material by Sergio Andreozzi INFN-CNAF
gLite The EGEE Middleware Distribution
Presentation transcript:

EGEE-II INFSO-RI Enabling Grids for E-sciencE middleware status and plans Claudio Grandi (INFN and CERN) John White (HIP)

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Outline Process Status and plans

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May gLite process Process controlled by the Technical Coordination Group Task Forces with developers, applications, testers and deployment experts After gLite 3.0 adopt a continuous release process –No more big-bang releases with fixed deadlines for all –Develop components as requested by users and sites –Deploy or upgrade as soon as testing is satisfactory Major releases synchronized with large scale activities of VOs (SCs) –Next major release foreseen for the autumn

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May SA3 Testing & Certification Functional Tests Testbed Deployment gLite Software Process JRA1 Development Software Error Fixing SA3 Integration Deployment Packages Integration Tests Installation Guide, Release Notes, etc SA1 Pre- Production Scalability Tests Pre-Production Deployment Fail Pass SA1 Production Infrastructure Release Problem Serious problem Directives

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Middleware structure Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed to help the users building their computing infrastructure but should not be mandatory Foundation Grid Middleware will be deployed on the EGEE infrastructure –Must be complete and robust –Should allow interoperation with other major grid infrastructures –Should not assume the use of Higher-Level Grid Services In line with BS group results

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May gLite status and plans In the following the current status and the plans for each component are given For all components the main priorities for the developers are: –support on the production infrastructure (GGUS, 2 nd line support) –bug-fixing (critical and not) –support for SL(C)4 and for x86-64 and IA64 –Task Forces together with applications and site experts –addressing requests for functionality improvements from users, site administrators, etc... (through the TCG) –improve robustness and usability (efficiency, error reporting,...) References in the presentation from: nIssuesTF but JRA1 has been working with: nIssuesTF

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Security VOMS: provides a way to add attributes to a certificate proxy. Enables VO policies. In gLite 3.0 (1.a) LCAS/LCMAPS: maps grid to local users. In gLite 3.0 VOMS-aware proxy renewal: used by WMS and available as separate library in one of the next updates of gLite 3.0 (1.d) glexec: change user identity on a service –available on the CE. A service may be made available on the WNs if accepted by sites (support for pilot jobs, 5.k,l) Delegation: delegate some subset of user privileges to another entity. In gLite 3.0 JobRepository: Collects the user, job and user- mapping information from jobs handled by LCMAPS –Not in gLite 3.0, work for later and with OSG

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Information System BDII: based on an LDAP, by LCG, is used as Information system in gLite 3.0 R-GMA: provides a uniform method to access and publish distributed information and monitoring data –used for job and infrastructure monitoring in gLite 3.0 (6.c,d) Service Discovery: provides a standard set of methods for locating Grid services. –Currently supports R-GMA, BDII and XML files as backends –Used by some DM and WMS components in gLite 3.0 CEMon: Web service to publish status of a computing resource to clients –Supports synchronous queries and asynchronous notification (6.d) –Uses the same information (GIP) used by BDII –In gLite 3 CEMon will be available to the users but the baseline is that the WMS queries the bdII

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Computing resources LCG-CE: based on GT2 GRAM –To be replaced when other CEs prove reliability gLite-CE: based on GSI enabled Condor-C –supported by Condor. More efficient. Uses BLAH (see below) –deployed for the first time on the PS in gLite 3.0 CREAM: new lightweight web service CE –Not in gLite 3 release. Will need exposure to users on dedicated system. –WSDL interface (5.j) –Will support bulk submission of jobs from WMS and optimization of input/output file transfer (5.d). Uses BLAH BLAH: interfaces the CE and the local batch system –May handle arbitrary information passing from CE to LRMS  patches to support this and logging for accounting being added now –Used by gLite-CE and CREAM

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Accounting APEL: Uses R-GMA to propagate and display job accounting information for infrastructure monitoring –Reads LRMS log files provided by LCG-CE and BLAH –Preparing an update for gLite 3.0 to use the files form BLAH DGAS: Collects, stores and transfers accounting data. Compliant with privacy requirements –Reads LRMS log files provided by LCG-CE and BLAH. –Stores information in a site database (HLR) and optionally in a central HLR. Access granted to user, site and VO administrators –Not yet certified in gLite 3.0. Deployment plan:  certify and activate local sensors and site HLR in parallel with APEL  replace APEL sensors with DGAS (DGAS2APEL)  certify and activate central HLR. Test scalability to the PS scale –(1.b, 6.a,b)

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Storage resources - file access Storage Element –Common interface: SRMv1,migrating to SRMv2 (3.a) –Various implementation from LCG and other external projects  disk-based: DPM, dCache; tape-based: Castor, dCache –Support for ACLs in DPM, in future in Castor and dCache (3.c)  After the summer: synchronization of ACLs between SEs (4.4.d) –Common rfio library for Castor and DPM by end of May Posix-like file access: –Grid File Access Layer (GFAL) by LCG  Support for ACL in the SRM layer (currently in DPM only)  Support for SRMv2 being added now (May). In the summer add thread safety and interface to the information system (3.b,e) –gLite I/O  Support for ACLs from the file catalog and interfaced to Hydra for data encryption  Not certified in gLite 3.0. To be dismissed when all functionalities are in GFAL.

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Catalogues File Catalogs –LFC from LCG (4.3.a)  In June: interface to POOL (4.3.c)  In the summer: LFC replication and backup –Fireman  Not certified in gLite 3.0. To be dismissed when all functionalities are in LFC Hydra: stores keys for data encryption –Being interfaced to GFAL (finish by July) –Currently only one instance, but in future there will be 3 instances: at least 2 need to be available for decryption –Not yet certified in gLite 3.0. Certification will start soon AMGA Metadata Catalog: generic metadata catalogue –Joint JRA1-NA4 (ARDA) development. Used mainly by Biomed –Not yet certified in gLite 3.0. Certification will start soon

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May File transfer FTS: Reliable, scalable and customizable file transfer (4.1.a,b,c,d,h,i) –Manages transfers through channels  mono-directional network pipes between two sites –Web service interface –Automatic discovery of services (addresses 4.2.e) –Support for different user and administrative roles (1.b) –Adding support for pre-staging and new proxy renewal schema (4.1.f). In the medium term add support for SRMv2 (3.b,4.1.g), delegation, VOMS-aware proxy renewal (1.d) –Status at

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Job services 1/2 WMS helps the user accessing computing resources –resource brokering, management of job input/output,... (5.c) LCG-RB: GT2 + Condor-G –To be replaced when the gLite WMS proves reliability gLite WMS: Web service (WMProxy) + Condor-G –Management of complex workflows (DAGs) and compound jobs  bulk submission and shared input sandboxes (5.d)  support for input files on different servers (scattered sandboxes) –Support for shallow resubmission of jobs –Job File Perusal: file peeking during job execution (5.i) –Supports collection of information from CEMon, BDII, R-GMA and from DLI and StorageIndex data management interfaces –Support for parallel jobs (MPI) when the home dir is not shared –Deployed for the first time on the PS with gLite 3.0

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Job services 2/2 –Short term updates: migration to GLUE 1.2 (see job priorities), bug fixing (shallow resubmission of compound jobs) (2) –Medium term updates: new Condor (increase efficiency), fixes to increase performance (5.b), job prologue/epilogue, short jobs (SDJ), improve interactive access to jobs (e.g. top, ls), UI round- robin (5.i) –Long term updates: bulk match making, high availability RB (5.a) Logging and Bookkeeping service –Tracks jobs during their lifetime (in terms of events) –LBProxy for fast access –L&B API and CLI to query jobs –Support for “CE reputability ranking“: maintains recent statistics of job failures at CE’s and feeds back to WMS to aid planning Job Provenance: long term job information (6.d) –If deployed will also help unloading the L&B –Not yet certified in gLite 3.0. Certification will start when requested by the TCG

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Job priorities GPBOX: Interface to define, store and propagate fine- grained VO policies (1.b, 5.f,h) –based on VOMS groups and roles –enforcement of policies at sites: sites may accept/reject policies –Not yet certified in gLite 3.0. Certification will start when requested by the TCG. Current plans: test job prioritization without GPBOX: 1.Mapping of VOMS groups to batch system shares (via GIDs?) 2.Two queues (long/short) for ATLAS & CMS 3.Publish info on the share in the CE GLUE 1.2 schema (VOView)  The gLite WMS is being modified to support GLUE 1.2  Testing with GPBOX if the “Service Class” is published 4.WMS match-making depending on submitter VOMS certificate  But no ranking of resources based on priority offered yet 5.Settings are not dynamic (via or CE updates) If GPBOX is needed for LHC, tests must start now!  12 months are needed to bring it to production quality

Enabling Grids for E-sciencE EGEE-II INFSO-RI Claudio Grandi, John White - LCG-MB- CERN 30 May Summary New gLite components in production –address requirements in terms of functionality and scalability –components deployed for the first time need extensive testing New gLite components ready for evaluation All Critical and High priority items in the Open Issues Twiki page are addressed or being addressed Need to uniform the “requirement lists”: too many! –unique list together with non-HEP applications –uniform requirements and make them consistent –propose to use the TCG list