DDM FAX Dashboard status and future Luca Magnoni IT/SDC 2 nd June 2014.

Slides:



Advertisements
Similar presentations
Summer Student presentation Changing Dashboard build system to Bamboo Robert Varga IT/SDC
Advertisements

Testing as a Service with HammerCloud Ramón Medrano Llamas CERN, IT-SDC
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Summary of issues and questions raised. FTS workshop for experiment integrators Summary of use  Generally positive response on current state!  Now the.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
LHCC Comprehensive Review – September WLCG Commissioning Schedule Still an ambitious programme ahead Still an ambitious programme ahead Timely testing.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
Input from CMS Nicolò Magini Andrea Sciabà IT/SDC 5 July 2013.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
OSG Operations and Interoperations Rob Quick Open Science Grid Operations Center - Indiana University EGEE Operations Meeting Stockholm, Sweden - 14 June.
CERN IT Department CH-1211 Geneva 23 Switzerland t The Experiment Dashboard ISGC th April 2008 Pablo Saiz, Julia Andreeva, Benjamin.
Enabling Grids for E-sciencE Overview of System Analysis Working Group Julia Andreeva CERN, WLCG Collaboration Workshop, Monitoring BOF session 23 January.
Network and Transfer WG Metrics Area Meeting Shawn McKee, Marian Babik Network and Transfer Metrics Kick-off Meeting 26 h November 2014.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP35, Liverpool 11 Sep 2015.
CMS STEP09 C. Charlot / LLR LCG-DIR 19/06/2009. Réunion LCG-France, 19/06/2009 C.Charlot STEP09: scale tests STEP09 was: A series of tests, not an integrated.
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES PhEDEx Monitoring Nicolò Magini CERN IT-ES-VOS For the PhEDEx.
1 Andrea Sciabà CERN Towards a global monitoring system for CMS computing Lothar A. T. Bauerdick Andrea P. Sciabà Computing in High Energy and Nuclear.
Marian Babik, Luca Magnoni SAM Test Framework. Outline  SAM Test Framework  Update on Job Submission Timeouts  Impact of Condor and direct CREAM tests.
Network and Transfer Metrics WG Meeting Shawn McKee, Marian Babik Network and Transfer Metrics WG Meeting 8 th April 2015.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
Network and Transfer Metrics WG Meeting Shawn McKee, Marian Babik perfSONAR Operations Sub-group 22 nd October 2014.
PanDA Update Kaushik De Univ. of Texas at Arlington XRootD Workshop, UCSD January 27, 2015.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
Efi.uchicago.edu ci.uchicago.edu FAX status developments performance future Rob Gardner Yang Wei Andrew Hanushevsky Ilija Vukotic.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons 10/12/2014.
1 User Analysis Workgroup Discussion  Understand and document analysis models  Best in a way that allows to compare them easily.
Julia Andreeva, CERN IT-ES GDB Every experiment does evaluation of the site status and experiment activities at the site As a rule the state.
Automated Grid Monitoring for LHCb Experiment through HammerCloud Bradley Dice Valentina Mancinelli.
Network awareness and network as a resource (and its integration with WMS) Artem Petrosyan (University of Texas at Arlington) BigPanDA Workshop, CERN,
Xrootd Monitoring and Control Harsh Arora CERN. Setting Up Service  Monalisa Service  Monalisa Repository  Test Xrootd Server  ApMon Module.
PanDA Status Report Kaushik De Univ. of Texas at Arlington ANSE Meeting, Nashville May 13, 2014.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
HammerCloud Functional tests Valentina Mancinelli IT/SDC 28/2/2014.
XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,
ATP Future Directions Availability of historical information for grid resources: It is necessary to store the history of grid resources as these resources.
Julia Andreeva on behalf of the MND section MND review.
Andrea Manzi CERN On behalf of the DPM team HEPiX Fall 2014 Workshop DPM performance tuning hints for HTTP/WebDAV and Xrootd 1 16/10/2014.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
Import XRootD monitoring data from MonALISA Sergey Belov, JINR, Dubna DNG section meeting,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Data Management Highlights in TSA3.3 Services for HEP Fernando Barreiro Megino,
FTS monitoring work WLCG service reliability workshop November 2007 Alexander Uzhinskiy Andrey Nechaevskiy.
GridView - A Monitoring & Visualization tool for LCG Rajesh Kalmady, Phool Chand, Kislay Bhatt, D. D. Sonvane, Kumar Vaibhav B.A.R.C. BARC-CERN/LCG Meeting.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
WLCG Information System Use Cases Review WLCG Operations Coordination Meeting 18 th June 2015 Maria Alandes IT/SDC.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013.
WLCG critical services update Andrea Sciabà WLCG operations coordination meeting December 18, 2014.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
WLCG Transfers Dashboard A unified monitoring tool for heterogeneous data transfers. Alexandre Beche.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
Streaming Analytics with Spark 1 Magnoni Luca IT-CM-MM 09/02/16EBI - CERN meeting.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
XRootD Monitoring Report A.Beche D.Giordano. Outlines  Talk 1: XRootD Monitoring Dashboard  Context  Dataflow and deployment model  Database: storage.
INFSO-RI Enabling Grids for E-sciencE GOCDB Requirements John Gordon, STFC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
WLCG Accounting Task Force Update Julia Andreeva CERN GDB, 8 th of June,
Activities and Perspectives at Armenian Grid site The 6th International Conference "Distributed Computing and Grid- technologies in Science and Education"
Site notifications with SAM and Dashboards Marian Babik SDC/MI Team IT/SDC/MI 12 th June 2013 GDB.
WLCG Transfers monitoring EGI Technical Forum Madrid, 17 September 2013 Pablo Saiz on behalf of the Dashboard Team CERN IT/SDC.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
The Grid Information System Maria Alandes Pradillo IT-SDC White Area Lecture, 4th June 2014.
Daniele Bonacorsi Andrea Sciabà
WLCG Transfers Dashboard
FTS Monitoring Ricardo Rocha
Experiment Dashboard overviw of the applications
Discussions on group meeting
Presentation transcript:

DDM FAX Dashboard status and future Luca Magnoni IT/SDC 2 nd June 2014

News  Alexandre left  Thanks for all the work!  New people take over:  Luca Magnoni  Marian Babik  Domenico Giordano (continues Data Popularity)  Cristovao Cordeiro  Collaboration from Dubna and Brunel university 2nd June 2014 DDM FAX dashboard / status update 2

The XRootD Dashboard project  Dashboard project as common ground for monitoring miscellaneous data transfer services  XRootD, FTS, etc.  Same architecture and code /different instances:  ATLAS FAX, CMS AAA, (LHCb) 2nd June 2014 DDM FAX dashboard / status update 3

Architecture (for XRootD ) 2nd June 2014 DDM FAX dashboard / status update 4

FAX dashboard  dashb-atlas-xrootd-transfers.cern.ch  Twiki:   Jira:   Mailing List:  2nd June 2014 DDM FAX dashboard / status update 5

Open Issue / Solution  Issue: Topology resolution  Incorrect accounting for sites on the same domain (e.g. in2p3.fr)  Solution: improve host/domain to site mapping and rely on server-provided site-name  Work already started (Cristovao)  Servers have to report site-name correctly 2nd June 2014 DDM FAX dashboard / status update 6

Open Issue/Solution II  Issue: EOS data missing  EOS data already collected  Statistics generation is the bottleneck  PL/SQL procedure latency proportional to raw data volume  Not matter which statistic method used  Solution: promising results on new storage/processing technology  HDFS/MapReduce  Scale horizontally  ready for other transfer services/protocols  In the framework of the WLCG Analytics project :  Temporary fix:  Try to optimize the PL/SQL procedure, if possible 2nd June 2014 DDM FAX dashboard / status update 7

Open issue/Solution III  Issue: Transfer accounting for for Multi- VO sites  Static mapping between XRootD server/Gled collector/broker topic  Solution: GLED-mix with multi VO filtering in autumn  Others: missing concepts of failures  from XRootD reporting 2nd June 2014 DDM FAX dashboard / status update 8

On data consistency  Comparison between Monalisa and DDM FAX dashboard was done for the last week ( thanks Igor, from Dubna )  very good overall data consistency  (< 1-5% diff)  few exceptions, with major discrepancies  Site not reporting on f-stream  detailed result herehere  Comparison can be easily automated  we are working on a comparison web tool for systematic validation 2nd June 2014 DDM FAX dashboard / status update 9

On the raw data access  No plan to support programmatic way to access raw data  Planning to offer snapshot for data mining  Files  DB dump 2nd June 2014 DDM FAX dashboard / status update 10

Now, Your turn!  What you would like to get from the tool?  Missing data?  If it is on the reporting side, we cannot do much!  If you experience issues, let us know 2nd June 2014 DDM FAX dashboard / status update 11