Farida Naz Andrea Sciabà

Slides:



Advertisements
Similar presentations
Accounting Update Dave Kant Grid Deployment Board Nov 2007.
Advertisements

:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/02/08 VOMS deployment Extent of VOMS usage in LCG-2 –Node types gLite 3.0 Issues Conclusions.
Enabling Grids for E-sciencE SGE J. Lopez, A. Simon, E. Freire, G. Borges, K. M. Sephton All Hands Meeting Dublin, Ireland 12 Dec 2007 Batch system support.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
Portal Update Plan Ashok Adiga (512)
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Priorities update Andrea Sciabà IT/GS Ulrich Schwickerath IT/FIO.
USATLAS deployment We currently use VOMS Role based authorization in production within USATLAS. In the VO we have defined 4 groups/roles that satisfy our.
Certification and test activity ROC/CIC Deployment Team EGEE-SA1 Conference, CNAF – Bologna 05 Oct
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Update Authorization Service Christoph Witzig,
ATLAS intentions on Job Priorities Simone Campana.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA3 partner collaboration tasks & process.
INFSO-RI Enabling Grids for E-sciencE Policy management and fair share in gLite Andrea Guarise HPDC 2006 Paris June 19th, 2006.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite configuration (plans) Robert Harakaly.
TP: Grid site installation BEINGRID site installation.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
Installation Accounting Status Flavia Donno CERN/IT-GS WLCG Management Board, CERN 28 October 2008.
II EGEE conference Den Haag November, ROC-CIC status in Italy
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CREAM: current status and next steps EGEE-JRA1.
Job Priorities and Resource sharing in CMS A. Sciabà ECGI meeting on job priorities 15 May 2006.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Simone Campana (CERN) Job Priorities: status.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
Virtual Organization Management Registration Service (VOMRS) T. Levshina J. Weigand S. White Co-Authors: L. Bauerdick, G. Carcassi, I. Fisk, A. Heavey,
CREAM Status and plans Massimo Sgaravatto – INFN Padova
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI MPI VT report OMB Meeting 28 th February 2012.
Status of the SL5 migration ALICE TF Meeting
Applications Area News
CEMon
NGI and Site Nagios Monitoring
Xiaomei Zhang CMS IHEP Group Meeting December
David Bouvet Fabio Hernandez IN2P3 Computing Centre - Lyon
The EMT Oliver Keeble, SA3 CERN.
VOs and ARC Florido Paganelli, Lund University
Security aspects of the CREAM-CE
Andreas Unterkircher CERN Grid Deployment
Workload Management System
Summary on PPS-pilot activity on CREAM CE
Bulk production of Monte Carlo
GDB 8th March 2006 Flavia Donno IT/GD, CERN
CREAM Status and Plans Massimo Sgaravatto – INFN Padova
Global Banning List and Authorization Service
OMII evaluation: Preliminary results Current status of T6
Accounting at the T1/T2 Sites of the Italian Grid
Moving from CREAM CE to ARC CE
The CREAM CE: When can the LCG-CE be replaced?
Grid2Win: Porting of gLite middleware to Windows XP platform
UI Installation and Configuration
AAA from HEP* Perspective
The Workload Management System
Update on gLite WMS tests
CREAM-CE/HTCondor site
1 VO User Team Alarm Total ALICE ATLAS CMS
QoS in the Tier1 batch system(LSF)
Grid Deployment Board meeting, 8 November 2006, CERN
Short update on the latest gLite status
Future Test Activities SA3 All Hands Meeting Dublin
Glexec/SCAS Pilot: IN2P3-CC status
Artem Trunov, Günter Quast EKP – Uni Karlsruhe
Developments in Batch and the Grid
LCG Job Reliability Julia Andreeva, Benjamin Gaidioz, Juha Herrala, Birger Koblitz, Massimo Lamanna, Pablo Saiz, Andrea Sciaba`
Discussions on group meeting
Francesco Giacomini – INFN JRA1 All-Hands Nikhef, February 2008
The New Virtual Organization Membership Service (VOMS)
From Prototype to Production Grid
Argus: General Introduction
gLite Job Management Christos Theodosiou
WMS+LB Server Installation and Configuration
Site availability Dec. 19 th 2006
The LHCb Computing Data Challenge DC06
Presentation transcript:

Farida Naz Andrea Sciabà Job Priorities update Farida Naz Andrea Sciabà Grid Deployment Board February 6, 2008

Status of testing on PPS The JP mechanism has been installed on a CERN PPS CE lxb1937.cern.ch gLite 3.1 LCG CE on SLC4 PBS, Torque Server on SLC3 three queues with VOViews attached short with 6 VOViews VO:cms, DENY:/cms/Role=lcgadmin, DENY:/cms/Role=production VOMS:/cms/Role=lcgadmin VOMS:/cms/Role=production VO:atlas, DENY:/atlas/Role=lcgadmin, DENY:/atlas/Role=production VOMS:/atlas/Role=lcgadmin VOMS:/atlas/Role=production cms with 1 VOView atlas with 1 VOView Job submission with a CMS certificate using a gLite 3.1 WMS

Tests performed Goals Method Verify that the jobs are submitted to the queues where the FQAN in the proxy is authorized Verify that the number of waiting and running jobs are correctly reported in the VOView information Verify that the WMS takes the information used to calculate the rank from the correct VOView Method Submit jobs with different VOMS FQANs and check

Results All checks eventually succeeded It took some time to make things work Some misconfigurations The experience will come useful to prepare the documentation

Configuration Preliminary documentation in a Wiki page https://twiki.cern.ch/twiki/bin/view/EGEE/JPTest The CE was configured by an administrator (Farida) with no previous experience with job priorities stuff Important to understand the problems a site administrator could run into Notes Need to define FQANVOVIEWS=yes in YAIM Need to define for each queue a variable {queue}_GROUP_ENABLE listing the VOViews (1-1 correspondence with FQANs for now) whose value is something like {queue)_GROUP_ENABLE="atlas /VO=atlas/GROUP=/atlas/ROLE=production /VO=atlas/GROUP=/atlas/ROLE=lcgadmin cms /VO=cms/GROUP=/cms/ROLE=lcgadmin /VO=cms/GROUP=/cms/ROLE=production" Need to install two rpms with respect to the latest gLite release lcg-info-dynamic-scheduler-pbs-2.0.0-1 lcg-info-dynamic-scheduler-generic-2.1.0-1 Without doing so, the number of running and waiting jobs published was wrong

Still to do Repeat the tests on a LSF CE Test with 2 or more VOs at the same time Probably things will work, but… Check that the fair share assignment works But this is really a batch system configuration thing Prepare the documentation for the release Including clear examples for the batch system configuration

Conclusions The Job Priorities mechanism has been successfully tested on the PPS at CERN The certification process can start The documentation will be extremely important Unthinkable to spend 2-3 days per site to debug misconfigurations, if the probability of a misconfiguration is more than a few percent!