Report on Installed Resource Capacity Flavia Donno CERN/IT-GS WLCG GDB, CERN 10 December 2008.

Slides:



Advertisements
Similar presentations
Storage: Futures Flavia Donno CERN/IT WLCG Grid Deployment Board, CERN 8 October 2008.
Advertisements

Storage Issues: the experiments’ perspective Flavia Donno CERN/IT WLCG Grid Deployment Board, CERN 9 September 2008.
EGEE SA1 Operations Workshop Stockholm, 13-15/06/2007 Enabling Grids for E-sciencE Service Level Agreement Metrics SLA SA1 Working Group Łukasz Skitał.
A conceptual model of grid resources and services Authors: Sergio Andreozzi Massimo Sgaravatto Cristina Vistoli Presenter: Sergio Andreozzi INFN-CNAF Bologna.
Storage Accounting John Gordon, STFC GDB June 2012.
Africa & Arabia ROC tutorial The GSTAT2 Grid Monitoring tool Mario Reale GARR - Italy ASREN-JUNET Grid School - 24 November 2011 Africa & Arabia ROC Tutorial.
BINP/GCF Status Report BINP LCG Site Registration Oct 2009
Status report on SRM v2.2 implementations: results of first stress tests 2 th July 2007 Flavia Donno CERN, IT/GD.
SRM 2.2: tests and site deployment 30 th January 2007 Flavia Donno, Maarten Litmaath IT/GD, CERN.
SRM 2.2: status of the implementations and GSSD 6 th March 2007 Flavia Donno, Maarten Litmaath INFN and IT/GD, CERN.
CERN IT Department CH-1211 Geneva 23 Switzerland t Storageware Flavia Donno CERN WLCG Collaboration Workshop CERN, November 2008.
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/02/08 VOMS deployment Extent of VOMS usage in LCG-2 –Node types gLite 3.0 Issues Conclusions.
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
GLUE 2 Open Issues in Storage Information Providers 16 th May 2014.
1 User Analysis Workgroup Discussion  Understand and document analysis models  Best in a way that allows to compare them easily.
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Pre-GDB Storage Classes summary of discussions Flavia Donno Pre-GDB.
WLCG Grid Deployment Board, CERN 11 June 2008 Storage Update Flavia Donno CERN/IT.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
CERN SRM Development Benjamin Coutourier Shaun de Witt CHEP06 - Mumbai.
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
Storage Accounting John Gordon, STFC GDB March 2013.
Automatic Resource & Usage Monitoring Steve Traylen/Flavia Donno CERN/IT.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
DataTAG is a project funded by the European Union DataTAG WP4 meeting, Bologna 29/07/2003 – n o 1 GLUE Schema - Status Report DataTAG WP4 meeting Bologna,
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
Service Availability Monitor tests for ATLAS Current Status Tests in development To Do Alessandro Di Girolamo CERN IT/PSS-ED.
Report from GSSD Storage Workshop Flavia Donno CERN WLCG GDB 4 July 2007.
LCG Accounting Update John Gordon, CCLRC-RAL WLCG Workshop, CERN 24/1/2007 LCG.
WLCG Grid Deployment Board CERN, 14 May 2008 Storage Update Flavia Donno CERN/IT.
Report on Installed Resource Capacity Flavia Donno CERN/IT-GS WLCG Management Board, CERN 25 November 2008.
Handling of T1D0 in CCRC’08 Tier-0 data handling Tier-1 data handling Experiment data handling Reprocessing Recalling files from tape Tier-0 data handling,
SRM v2.2 Production Deployment SRM v2.2 production deployment at CERN now underway. – One ‘endpoint’ per LHC experiment, plus a public one (as for CASTOR2).
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Ideal information system - CMS Andrea Sciabà IS.
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Storage Element Model and Proposal for Glue 1.3 Flavia Donno,
Definitions Information System Task Force 8 th January
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
Grid Deployment Board 5 December 2007 GSSD Status Report Flavia Donno CERN/IT-GD.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GLUE Schema Configuration for SRM 2.2 Stephen.
1 SRM v2.2 Discussion of key concepts, methods and behaviour F. Donno CERN 11 February 2008.
The Grid Storage System Deployment Working Group 6 th February 2007 Flavia Donno IT/GD, CERN.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
Installation Accounting Status Flavia Donno CERN/IT-GS WLCG Management Board, CERN 28 October 2008.
SRM 2.2: experiment requirements, status and deployment plans 6 th March 2007 Flavia Donno, INFN and IT/GD, CERN.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
LCG Accounting Update John Gordon, CCLRC-RAL 10/1/2007.
WLCG Information System Status Maria Alandes Pradillo, CERN CERN IT Department, Support for Distributed Computing Group GDB 9 th September 2015.
John Gordon EMI TF and EGI CF March 2012 Accounting Workshop.
Accounting Update John Gordon. Outline Multicore CPU Accounting Developments Cloud Accounting Storage Accounting Miscellaneous.
Storage Accounting John Gordon STFC GDB, Lyon 6 th April2011 GDB January 2012.
Maria Alandes Pradillo, CERN Training on GLUE 2 information validation EGI Technical Forum September 2013.
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
SRM v2.2: service availability testing and monitoring SRM v2.2 deployment Workshop - Edinburgh, UK November 2007 Flavia Donno IT/GD, CERN.
The Grid Information System Maria Alandes Pradillo IT-SDC White Area Lecture, 4th June 2014.
Accounting Review Summary and action list from the (pre)GDB Julia Andreeva CERN-IT WLCG MB 19th April
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI MPI VT report OMB Meeting 28 th February 2012.
Status of the SRM 2.2 MoU extension
Flavia Donno CERN GSSD Storage Workshop 3 July 2007
Short term improvements to the Information System: a status report
Towards GLUE Schema v Sergio Andreozzi – INFN/CNAF sergio
SRM2 Migration Strategy
Proposal for obtaining installed capacity
Summary from last MB “The MB agreed that a detailed deployment plan and a realistic time scale are required for deploying glexec with setuid mode at WLCG.
GLUE 2 Support in gLite Data Management
The INFN Tier-1 Storage Implementation
A conceptual model of grid resources and services
New Types of Accounting Beyond CPU
Stephen Burke egi.eu EGI TF Prague September 20th 2012
Site availability Dec. 19 th 2006
Presentation transcript:

Report on Installed Resource Capacity Flavia Donno CERN/IT-GS WLCG GDB, CERN 10 December 2008

Outline Details about reporting installed capacity through the Information System for: Computing Storage The new version of the document is v1.8 publicly available A few more comments received. They will be integrated in the next days Operational plan will be prepared starting in February 2009 WLCG GDB, CERN 10 December

Goals Publishing information in order to provide the WLCG management with a view of the total installed capacity and resource usage by VOs at sites. Focus on four main use cases: Publishing information to provide the WLCG management with a view of the total installed capacity at a site. Publishing information to provide the WLCG and VO management with a view of the resource assignment per VO at a site Publishing information to allow VO operators to monitor the VO usage of the resources at a site Allowing clients (consumers of the information published) to select the correct resources and their characteristics WLCG GDB, CERN 10 December

Organization of the document It includes an explanatory part where all involved Glue Schema v1.3 classes and attributes are explicitly described. Appendixes with specific instructions and examples. Logically organized in two parts. Computing (theory + appendix) Storage (theory + appendix to come) Explanation of commonly published attributes and recommendations Specific directions to publish specific Glue classes and attributes in order to satisfy the requirements stated in the goals. WLCG GDB, CERN 10 December

Computing Resources Main Glue Classes. They are mandatory for WLCG sites CE Cluster <- SubCluster (Host) SubClusters: Are mostly hetherogeneous For WMS matchmaking it is recommended to publish homogeneous queues using one Cluster and one SubCluster(Host) Mandatory SubCluster attributes: PhysicalCPUs - Total number of real CPUs/physical chips in the Subcluster (includes offline/down nodes) LogicalCPUs – Total number of cores/hyperthreaded CPUs in the SubCluster (includes offline/down nodes) BenchmarkSI00 – Average SpecInt2000 rating per LogicalCPU WLCG GDB, CERN 10 December

Computing Resources Other recommended SubCluster attributes: ProcessorModel ProcessorSpeed SMPSize – number of Logical CPUs (cores) per WN ProcessorOtherDescription: Cores= RAMSize – Total physical memory of a WN in the SubCluster VirtualSize – Total virtual memory of a WN in the SubCluster CE attributes to be published AssignedJobSlots – Total number of assigned Job Slots in the queue available at a given moment CECapability: CPUScalingReferenceSI00= the CPU SI00 for which the published GlueCEMaxCPUTimes are valid CECapability: Share= : ; This value is used to express a specific VO share if VO shares are in operation (this attribute is not mandatory). Published only for WLCG VOs. Value between 0 and 100. The sum of the values over all WLCG VOs <= 100. WLCG GDB, CERN 10 December

Computing Resources The installed computing capacity at a site is then expressed by the following formula: WLCG GDB, CERN 10 December

Computing Resources: notes for admins When a new head node is added at a site, YAIM configures automatically a new Cluster and SubCluster associated, causing double counting of resources. For the newly created Clusters and SubClusters insisting on the same resources, site administrators MUST set the GlueSubClusterPhysicalCPUs and GlueSubClusterLogicalCPUs attributes to 0. Explicit instructions on how to set YAIM variables are given in section A.2 of Appendix A in the document. An explicit example of information published by a site with correction is given in section A.3 of Appendix A in the document. WLCG GDB, CERN 10 December

Storage Resources Main Glue Classes. They are mandatory for WLCG sites SE SA <- VOInfo Service; SEControlProtocol; SEAccessProtocol SE: UniqueID – The value SHOULD be the FQDN of the SRM host. One to one mapping between SE and SRM Service Status – Production, Closed, Draining, Queueing ImplementationName, ImplementationVersion Architecture – disk, multidisk, tape, other SizeTotal, SizeFree, TitalOnlineSize and UsedOnlineSize (GB) WLCG GDB, CERN 10 December

Storage Resources AccessProtocol LocalID Type – Must be set to one of the allowed type. If secure, GSI MUST be published in SupportedSecurity Version – Version of the protocol ControlProtocol (MUST be published): Endpoint – The value MUST be the full SOAP endpoint for SRM: httpg://srm.example.com:8443/srm/managerv2 Type – SRM Version – or WLCG GDB, CERN 10 December

Storage Resources StorageArea (SA): describes a logical view of a portion of physical space that can include disks and tape resources. It can correspond to SRM static space reservations or not. SAs MAY overlap. One SA can be shared by several VOs. LocalID AccessControlBaseRule – VO: or VOMS: Reserved[Online|Nearline]Size (GB) – Space statically reserved to a VO. 0 if SA does not represent SRM reserved space Total[Online|Nearline]Size (GB) – Less than or equal to ReservedSpace iff ReservedSpace > 0 Used[Online|Nearline]Size (GB) – Space occupied by available and accessible files not candidates for garbage collection. Used <= Total. Free[Online|Nearline]Size (GB) – Total – Used. WLCG GDB, CERN 10 December

Storage Resources StorageArea: the Capability special attribute Installed[Online|Nearline]Capacity= (GB) – MUST be published. Introduced for accounting purposes only. It expresses the size of the physical space of a Storage Area. It can be 0 for overlapping SAs. scratch – Storage Area is of type T0D0 stage – Storage Area is used for staging activities only WLCG GDB, CERN 10 December

Storage Resources VOInfo: describes VO specific atributes of a Storage Area. One SA can be associated to one or more VOInfo objects. VOInfo represents the ability of a client/end-user to write into a particular Storage Area with a set of parameters specified by the VOInfo attributes. LocalID – MUST be set AccessControlBaseRule – VO: or VOMS: VOInfoTag – Space Token Description VOInfoPath– It describes the default path an application can use to write into the associated SA. WLCG GDB, CERN 10 December

Storage Capacity WLCG GDB, CERN 10 December The installed storage online capacity at a site is then expressed by the following formula: Installed Online Capacity = ( ∑ WLCG GlueSA GlueSACapability(InstalledOnlineCapacitySize) The installed storage nearline capacity at a site is then expressed by the following formula: Installed Nearline Capacity = ( ∑ WLCG GlueSA GlueSACapability(InstalledNearlineCapacitySize)

Validation procedures Gstat: Computing resources sanity checks in preparation Storage resources sanity checks already available but not yet in production More accurate validation from executing jobs at sites SAM tests WNCM JobWrapper scripts Reports published WLCG GDB, CERN 10 December

Installed Capacity: status WLCG GDB, CERN 10 December The latest version of the document (v1.8-1) is published with all details, operational instructions for site administrators and explicative examples (OSG missing) ges/WLCG_GlueSchemaUsage-1.8.pdf Technical agreement on the essential content of the the document : OSG requested and was accorded one more month to digest the proposals. Minor changes are still expected Operative instructions and examples from OSG needed

Installed Capacity: plan WLCG GDB, CERN 10 December Operational plan in preparation. We will start in February 2009 as soon as document formally approved. Precise instructions for site admins Examples Local test suites Coordination for deployment Many site admins ask for support. Support group needed. Client tools: evaluating the effort for conformance to the document Gstat sanity checks. GridMAP is being adapted. Client tools gfal/lcg-utils will improve resource selection algorithm with next releases. Reviewing SAM tests Nothing yet done for APEL OSG ?

Installed Capacity: plan WLCG GDB, CERN 10 December Coordination with WN working group Includes changes in YAIM and specific recommendations Storage is mostly automatic GRIF and RAL have new info providers for DPM and CASTOR. dCache and StoRM expected in January. dCache requires info about nearline storage to be set by hands. gstat sanity checks for storage ensure coherence of published information. Experience is needed in order to better plan A few sites are already publishing according to the document. This is really useful.