EGEE is a project funded by the European Union under contract INFSO-RI-508833 Grid accounting with GridICE Sergio Fantinel, INFN LNL/PD LCG Workshop November.

Slides:



Advertisements
Similar presentations
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Torsten Antoni – LCG Operations Workshop, CERN 02-04/11/04 Global Grid User Support - GGUS -
Advertisements

FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
Consorzio COMETA - PI2S2 Project UNIONE EUROPEA SAGE – Storage Accounting for Grid Environments in gLite Fabio Scibilia Consorzio.
Large scale data flow in local and GRID environment V.Kolosov, I.Korolko, S.Makarychev ITEP Moscow.
Enabling Grids for E-sciencE Grid Monitoring Workshop Monterey Bay, California, 25 June 2007 Antonio Pierro INFN-BARI (Italy) Antonio.pierro.
Dave Kant Grid Monitoring and Accounting Dave Kant CCLRC e-Science Centre, UK HEPiX at Brookhaven 18 th – 22 nd Oct 2004.
May 12, 2008 Overview on monitoring tools for Grid Systems - Antonio Pierro (INFN-BARI)1 Overview of monitoring tools for Grid Systems Varenna, 12 May.
1 BIG FARMS AND THE GRID Job Submission and Monitoring issues ATF Meeting, 20/06/03 Sergio Andreozzi.
Computational grids and grids projects DSS,
INFSO-RI Enabling Grids for E-sciencE GridICE: a monitoring service for Grid Systems Sergio Andreozzi INFN (Italy)
A.Guarise – F.Rosso 1 Enabling Grids for E-sciencE INFSO-RI Comprehensive Accounting Views on large computing farms. Andrea Guarise & Felice Rosso.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
A monitoring tool for a GRID operation center Sergio Andreozzi (INFN CNAF), Sergio Fantinel (INFN Padova), David Rebatto (INFN Milano), Gennaro Tortone.
Fabric Monitor, Accounting, Storage and Reports experience at the INFN Tier1 Felice Rosso on behalf of INFN Tier1 Workshop sul.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
Steve Traylen PPD Rutherford Lab Grid Operations PPD Christmas Lectures Steve Traylen RAL Tier1 Grid Deployment
Grid Operations Centre LCG Accounting Trevor Daniels, John Gordon GDB 8 Mar 2004.
Certification and test activity IT ROC/CIC Deployment Team LCG WorkShop on Operations, CERN 2-4 Nov
Local Monitoring at SARA Ron Trompert SARA. Ganglia Monitors nodes for Load Memory usage Network activity Disk usage Monitors running jobs.
Fabric Monitoring at the INFN Tier1 Felice Rosso on behalf of INFN Tier1 Joint OSG & EGEE Operations WS, Culham (UK)
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
INFSO-RI Enabling Grids for E-sciencE GridICE: Grid and Fabric Monitoring Integrated for gLite-based Sites Sergio Fantinel INFN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
LCG workshop on Operational Issues CERN November, EGEE CIC activities (SA1) Accounting: current status
Recent improvements in HLRmon, an accounting portal suitable for national Grids Enrico Fattibene (speaker), Andrea Cristofori, Luciano Gaido, Paolo Veronesi.
Ceilometer + Gnocchi + Aodh Architecture
Certification and test activity ROC/CIC Deployment Team EGEE-SA1 Conference, CNAF – Bologna 05 Oct
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
DataTAG is a project funded by the European Union CERN, 8 May 2003 – n o 1 / 29 GridICE The eyes of the grid A monitoring tool for a Grid Operation Center.
CERN Running a LCG-2 Site – Oxford July - 1 LCG2 Administrator’s Course Oxford University, 19 th – 21 st July Developed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The LCG interface Stefano BAGNASCO INFN Torino.
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
Accounting in LCG/EGEE Can We Gauge Grid Usage via RBs? Dave Kant CCLRC, e-Science Centre.
Analysis of job submissions through the EGEE Grid Overview The Grid as an environment for large scale job execution is now moving beyond the prototyping.
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
EGEE is a project funded by the European Union under contract IST Experiment Software Installation toolkit on LCG-2
INFN GRID Production Infrastructure Status and operation organization Cristina Vistoli Cnaf GDB Bologna, 11/10/2005.
INRNE's participation in LCG Elena Puncheva Preslav Konstantinov IT Department.
INFSO-RI Enabling Grids for E-sciencE DGAS, current status & plans Andrea Guarise EGEE JRA1 All Hands Meeting Plzen July 11th, 2006.
DataTAG is a project funded by the European Union CERN, 8 May 2003 – n o 1 / 10 Grid Monitoring A conceptual introduction to GridICE Sergio Andreozzi
II EGEE conference Den Haag November, ROC-CIC status in Italy
– n° 1 Grid di produzione INFN – GRID Cristina Vistoli INFN-CNAF Bologna Workshop di INFN-Grid ottobre 2004 Bari.
EGEE is a project funded by the European Union under contract IST GENIUS and GILDA Guy Warner NeSC Training Team Induction to Grid Computing.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
HLRmon Enrico Fattibene INFN-CNAF 1EGI-TF Lyon, France19-23 September 2011.
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
Enabling Grids for E-sciencE INFN Workshop – May 7-11 Rimini 1 Grid Accounting Status at INFN Riccardo Brunetti INFN-TORINO.
INFSO-RI Enabling Grids for E-sciencE GridICE: status and plans for gLite integration and user level job monitoring Sergio Andreozzi.
Grid Monitoring and Diagnostic Tools: GridICE, GSTAT, SAM Giuseppe Misurelli INFN-CNAF giuseppe.misurelli cnaf.infn.it.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
DGAS Accounting – toward national grid infrastructures HPDC workshop on Monitoring, Logging and Accounting, (MLA) in production Grids 10/06/2009, Munich.
Enabling Grids for E-sciencE GridICE: overview and current status Guido Cuscela INFN – Bari Service Challenge Technical Meeting September.
EGEE is a project funded by the European Union under contract INFSO-RI DGAS Grid accounting L.Gaido on behalf of A.Guarise LCG Workshop November.
Job monitoring and accounting data visualization
DGAS A.Guarise April 19th, Athens
Regional Operations Centres Core infrastructure Centres
Real Time Fake Analysis at PIC
How to connect your DG to EDGeS? Zoltán Farkas, MTA SZTAKI
INFNGRID Monitoring Group report
Monitoring: problems, solutions, experiences
Sergio Fantinel, INFN LNL/PD
a VO-oriented perspective
The EU DataGrid Fabric Management Services
INFNGRID Workshop – Bari, Italy, October 2004
Presentation transcript:

EGEE is a project funded by the European Union under contract INFSO-RI Grid accounting with GridICE Sergio Fantinel, INFN LNL/PD LCG Workshop November 2004, CERN (

LCG Workshop, November Information & Sources GridICE Server EX GRIS (port 2136) (GridICE collector node) Std. GRIS (port 2135) (CE, SE) Basic info: Number of queues Jobs running/waiting (simple LRMS publish) Storage Areas info CPUSLOTS per queue Extended info: Job Monitoring (effective VO, user & all related info) Disk partitions space, Network Adapters activity Role based (CE, SE, RB, RLS, WN,…) user defined services (daemons, agents,…) More… (MEM, physical CPU, swap, interrupts, reg. open files, sockets, procs, INodes, host power w/ HT detection,…) GRIS status info: GRIS Service Online/Offline

LCG Workshop, November Job Monitoring Info (1/2) Each job is related to the user certificate, the VO, and the site (resource); a sample of job related metrics stored on the RDBMS: General Info LocalIDlocal job identifier (given by the LRMS) GlobalID*Grid identifier (EDGJobId) LocalOwnerlocal user account GlobalOwneruser certificate subject ExecutionTarge t execution host ExitStatusexit status given by the LRMS * The GlobalID (EDGJobId) is available for jobs that remain on the LRMS at least for 10/20 minutes (it depends on the frequency configured to run the job monitoring info provider)

LCG Workshop, November Job Monitoring Info (2/2) Each job is related to the user certificate, the VO, and the site (resource); a sample of job related metrics stored on the RDBMS: Resources Usage Metering Info CPUTimeCPU time usage (sec) WallTimetime on the execution host (sec) CreationTimewhen job was submitted to the LRMS (timestamp) StartTimewhen was started on the execution host (timestamp) EndTimewhen finished (timestamp) RAMUsedRAM used (KB) VirtualUsedVirtual memory used (KB)

LCG Workshop, November Info relationship: accounting info It is possible to aggregate/retrieve the info on different dimensions : per user (DN certificate) per site per VO This means that, for example, it is possible to (given a time interval as last few hours/ week/month,…) generate graphs and/or statistic as: Site usage (CPU/RAM) by a single user or an entire VO Total/average usage of all the resources (CPU/RAM) by a single user or VO Site grid usage (number of grid jobs run by the site; CPU usage,…) Number of distinct users that submitted job to the GRID (all the GRID, per site, per VO)

LCG Workshop, November Screen shot online from Gridice Number of jobs per VO

LCG Workshop, November Number of jobs per VO Real case (Grid.it) period 1st August to 23th August 2004

LCG Workshop, November Resources occupancy per VO Real case (Grid.it) period 1st August to 23th August 2004

LCG Workshop, November Lhcb vs site (number of jobs) Real case (Grid.it) period 1st August to 23th August 2004

LCG Workshop, November Lhcb vs site (resources occupancy (CPU hours)) Real case (Grid.it) period 1st August to 23th August 2004

LCG Workshop, November Reconstructed time profile per FARM: ba.infn.it and VO: LHCB

LCG Workshop, November Highlights Each job can be associated to all the execution host metrics (load, cpu, file system, network adapter, …) LSF has native support, but also PBS and TORQUE are as well supported by our info providers. Online usage metering: continuous metering of all resource usage (no need to send local accounting DB) since the job is submitted; info are ready to be processed at every time. We only record GRID resources activity with a single local info provider (it is possible to turn on also the recording of local activity if the local site manager turn it on). Through the GlobalID we can:  Interoperate with other accounting/monitoring systems  Relationship our collected info with L&B systems  Statistics of RBs usage against resources

LCG Workshop, November Next Steps We will improve the WEB interface to obtain reports, graphs and statistics about the accounting. Maybe we can think to send by to key people (GOC, CIC, ROC) reports on a regularly base. We need input to understand what information are needed (type of reports, graphs and statistics).

LCG Workshop, November Experience on Data Validation With the CMS DC04 datachallenge we got a validation of the data recorded by GridICE vs. BOSS CMS application confirming that the acquired data was good. graph and analysis provided by: M. Maggi et al. – INFN Bari CMS group