MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.

Slides:



Advertisements
Similar presentations
1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
Advertisements

WLCG Monitoring Consolidation NEC`2013, Varna Julia Andreeva CERN IT-SDC.
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
A tool to enable CMS Distributed Analysis
Client/Server Grid applications to manage complex workflows Filippo Spiga* on behalf of CRAB development team * INFN Milano Bicocca (IT)
Analysis demos from the experiments. Analysis demo session Introduction –General information and overview CMS demo (CRAB) –Georgia Karapostoli (Athens.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
CERN IT Department CH-1211 Geneva 23 Switzerland t The Experiment Dashboard ISGC th April 2008 Pablo Saiz, Julia Andreeva, Benjamin.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services GS group meeting Monitoring and Dashboards section Activity.
Enabling Grids for E-sciencE Overview of System Analysis Working Group Julia Andreeva CERN, WLCG Collaboration Workshop, Monitoring BOF session 23 January.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks VO-specific systems for the monitoring of.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Julia Andreeva CERN (IT/GS) CHEP 2009, March 2009, Prague New job monitoring strategy.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES PhEDEx Monitoring Nicolò Magini CERN IT-ES-VOS For the PhEDEx.
ATLAS in LHCC report from ATLAS –ATLAS Distributed Computing has been working at large scale Thanks to great efforts from shifters.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
PanDA Monitor Development ATLAS S&C Workshop by V.Fine (BNL)
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
Dashboard program of work Julia Andreeva GS Group meeting
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Julia Andreeva, CERN IT-ES GDB Every experiment does evaluation of the site status and experiment activities at the site As a rule the state.
Grid Deployment Enabling Grids for E-sciencE BDII 2171 LDAP 2172 LDAP 2173 LDAP 2170 Port Fwd Update DB & Modify DB 2170 Port.
WLCG Monitoring Roadmap Julia Andreeva, CERN , WLCG workshop, CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
PanDA Status Report Kaushik De Univ. of Texas at Arlington ANSE Meeting, Nashville May 13, 2014.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
ATP Future Directions Availability of historical information for grid resources: It is necessary to store the history of grid resources as these resources.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CRAB: the CMS tool to allow data analysis.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
CERN IT Department CH-1211 Genève 23 Switzerland t HEPiX Conference, ASGC, Taiwan, Oct 20-24, 2008 The CASTOR SRM2 Interface Status and plans.
Julia Andreeva on behalf of the MND section MND review.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Hammercloud and Nagios Dan Van Der Ster Nicolò Magini.
Conclusions on Monitoring CERN A. Read ADC Monitoring1.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI User-centric monitoring of the analysis and production activities within.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
FTS monitoring work WLCG service reliability workshop November 2007 Alexander Uzhinskiy Andrey Nechaevskiy.
GridView - A Monitoring & Visualization tool for LCG Rajesh Kalmady, Phool Chand, Kislay Bhatt, D. D. Sonvane, Kumar Vaibhav B.A.R.C. BARC-CERN/LCG Meeting.
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
Mardi 8 mars 2016 Status of new features in CIC Portal Latest Release of 22/08/07 Osman Aidel, Hélène Cordier, Cyril L’Orphelin, Gilles Mathieu IN2P3/CNRS.
Enabling Grids for E-sciencE Experience Supporting the Integration of LHC Experiments Computing Systems with the LCG Middleware Simone.
LCG Issues from GDB John Gordon, STFC WLCG MB meeting September 28 th 2010.
WLCG Transfers Dashboard A unified monitoring tool for heterogeneous data transfers. Alexandre Beche.
Predrag Buncic (CERN/PH-SFT) CernVM Status. CERN, 24/10/ Virtualization R&D (WP9)  The aim of WP9 is to provide a complete, portable and easy.
CERN - IT Department CH-1211 Genève 23 Switzerland t IT-GD-OPS attendance to EGEE’09 IT/GD Group Meeting, 09 October 2009.
MND section. Summary of activities Job monitoring In collaboration with GridView and LB teams enabled full chain from LB harvester via MSG to Dashboard.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
Acronyms GAS - Grid Acronym Soup, LCG - LHC Computing Project EGEE - Enabling Grids for E-sciencE.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
WLCG Accounting Task Force Update Julia Andreeva CERN GDB, 8 th of June,
Using HLRmon for advanced visualization of resource usage Enrico Fattibene INFN - CNAF ISCG 2010 – Taipei March 11 th, 2010.
1 DIRAC Project Status A.Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille 10 March, DIRAC Developer meeting.
Site notifications with SAM and Dashboards Marian Babik SDC/MI Team IT/SDC/MI 12 th June 2013 GDB.
WLCG Transfers monitoring EGI Technical Forum Madrid, 17 September 2013 Pablo Saiz on behalf of the Dashboard Team CERN IT/SDC.
Daniele Bonacorsi Andrea Sciabà
Key Activities. MND sections
POW MND section.
Experiment Dashboard overviw of the applications
Monitoring of the infrastructure from the VO perspective
Presentation transcript:

MND review

Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring - Monitoring of the distributed sites and services  Development and support of the high level view monitoring applications showing LHC distributed computing in action : - Dashboard Google Earth - SiteView  Development and support of the Dashboard framework. Dashboard framework is used not only by the Dashboard applications but by other projects as well. ES Group Meeting

Progress report. Experiment Dashboard applications  Usage of all Experiment Dashboard applications by the LHC experiments is steadily growing. Example: For CMS server # unique visitors from March to April increased more than 20% Volume of accessed data from March to April increased increased more than 30% Same trend for other Dashboard servers  Several application are shared by all LHC experiments : Dashboard SAM portal, Site Status Board ES Group Meeting

4  Prove to scale well though the load on the servers had considerably increased after start of data taking. In some cases when scalability issues were discovered, they were promptly addressed (see next slides)  New functionality requested by the experiments is enabled quickly (see next slides).  Started to work on migration of the Dashboard SAM portal to the new SAM structure Progress report. Experiment Dashboard applications

Progress report. Performance and scalability improvements  ATLAS DDM example During ATLAS reprocessing week in April, ATLAS was moving 3 times more files than during STEP09, while STEP was already 1.5 times higher than foreseen in ATLAS computing model (in terms of files). This sudden unforeseen increase caused delay in statistics generation in ATLAS DDM Dashboard. Statistics generation procedures were modified to use bulk operations and to run logically independent components in parallel. This allowed to solve the problem and cope with high file transfer rate.  CMS Job monitoring example Due to modification of the CRAB server configuration, update rate of the Dashboard job monitoring DB had increased 3-4 times compared to situation in 2009 and beginning of The Dashboard DB collectors were modified to run in multithreaded mode which allowed to handle much higher update rate without delay.  Site Status Board (SSB) SSB performance was considerably improved by running collectors in parallel and by optimizing DB access ES Group Meeting

Progress report. New functionality  On request of the CMS Analysis Support team new Analysis support UI and new API for accessing job failure information had been developed  Added new distributions for CPU and wall clock consumption and CPU efficiency to the historical job monitoring application  Enabled information flow for job monitoring data from GANGA to Dashboard via MSG. Next step is to instrument PANDA jobs in order to get complete picture of ATLAS job processing in Dashboard and to adapt for ATLAS all job monitoring applications developed for CMS  Started migration of the current Dashboard SAM portal to the new SAM structure ES Group Meeting

Experiment Dashboard applications. Future development  Continue to develop generic job monitoring which can be used by any VO. Currently prototyped for ATLAS. Use MSG as a message bus. This includes common library for data reporting, common job monitoring schema, data collectors and set of user interfaces.  Continue migration of the SAM Dashboard portal to the new SAM structure. We call new portal Site Usability portal.  Develop flexible statistics display for ATLAS DDM which will have several additional views and filters. Use existing backend, leverage new frontend using jQuery/JSON  Integrate SSB with Google calendar for site/service downtime  Enable reporting of ALICE data transfer information to MSG. This would allow to make this data available for GridView. ES Group Meeting

Progress report. High level view of the LHC computing activities on the WLCG infrastructure.  Dashboard Google Earth monitor proved to play an important role as a dissemination tool. ES Group Meeting TeV Media Event ATLAS CMS CCC and Openlab

Dashboard Google Earth monitor was demonstrated ES Group Meeting  Event organized by CERN for schoolchildren in Meyrin  WLCG booth at the EGEE User Forum  WLCG demo at The Third International Scalable Computing Challenge (SCALE 2010)  And of course CCC

Dashboard Google Earth clients (last month) ES Group Meeting  CERN  PIC  DESY  JINR  SARA  U. Toronto  U. Melbourne  Merck (the pharmaceutical company!)  Plus connections from ISPs from: New Zealand, Romania, Portugal, Norway, Finland, France,...

Dashboard Google Earth development  Two major releases were done during last two months Main improvements: -Changed the algorithm for computing trajectory of data links shown at the screen. -Enable caching inside the collector is order to increase the performance and decrease load on the information source. -Make collectors safe from the exceptions coming from the VO- specific information sources. -Fix logging of various components  Future development: Code consolidation - Faster - More robust - More generic in order to enable it for other VOs if required ES Group Meeting

Dashboard framework  Migrated from cvs to svn Future plans -Support for SLC5 and multiple versions of python -Create ‘Dashboard host management’ tool ES Group Meeting

Summary  Experiment Dashboard proved to become an important instrument used by the LHC experiments in offline distributed computing. Usage of the Dashboard applications is steadily growing. Current effort aims to provide reliable Dashboard service and to improve it’s performance and scalability.  Some of Dashboard applications are generic and can be used other user communities  Dashboard Google Earth became an important dissemination tool presented at various events to show LHC distributed computing in action ES Group Meeting