Physics & Data Quality Monitoring at CMS Emilio Meschi (original design, run-control, mentoring) CL (core functionality, rules & alarms library, tech support)

Slides:



Advertisements
Similar presentations
DQM Services: Status and Plans DQM Workshop (CMS week) – 28 February 2007 Christos Leonidopoulos CERN PH for the EvF/DQM group.
Advertisements

Clara Gaspar on behalf of the LHCb Collaboration, “Physics at the LHC and Beyond”, Quy Nhon, Vietnam, August 2014 Challenges and lessons learnt LHCb Operations.
GNAM and OHP: Monitoring Tools for the ATLAS Experiment at LHC GNAM and OHP: Monitoring Tools for the ATLAS Experiment at LHC M. Della Pietra, P. Adragna,
André Augustinus ALICE Detector Control System  ALICE DCS is responsible for safe, stable and efficient operation of the experiment  Central monitoring.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
CHEP04 - Interlaken - Sep. 27th - Oct. 1st 2004T. M. Steinbeck for the Alice Collaboration1/20 New Experiences with the ALICE High Level Trigger Data Transport.
CHEP04 - Interlaken - Sep. 27th - Oct. 1st 2004T. M. Steinbeck for the Alice Collaboration1/27 A Control Software for the ALICE High Level Trigger Timm.
GLAST LAT ProjectOnline Peer Review – July 21, Integration and Test L. Miller 1 GLAST Large Area Telescope: I&T Integration Readiness Review.
Data Quality Monitoring for CMS RPC A. Cimmino, D. Lomidze P. Noli, M. Maggi, P. Paolucci.
March 2003 CHEP Online Monitoring Software Framework in the ATLAS Experiment Serguei Kolos CERN/PNPI On behalf of the ATLAS Trigger/DAQ Online Software.
Control and monitoring of on-line trigger algorithms using a SCADA system Eric van Herwijnen Wednesday 15 th February 2006.
First year experience with the ATLAS online monitoring framework Alina Corso-Radu University of California Irvine on behalf of ATLAS TDAQ Collaboration.
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
A. Cimmino - P. Paolucci - G. Polese / DCS meeting DQM tool for the DCS data What the RPC community needs: 1.Make online/offline DCS data.
Use of ROOT in the D0 Online Event Monitoring System Joel Snow, D0 Collaboration, February 2000.
Data Quality Monitoring of the CMS Tracker
Architecture Of ASP.NET. What is ASP?  Server-side scripting technology.  Files containing HTML and scripting code.  Access via HTTP requests.  Scripting.
Framework for Automated Builds Natalia Ratnikova CHEP’03.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Tutorial 111 The Visual Studio.NET Environment The major differences between Visual Basic 6.0 and Visual Basic.NET are the latter’s support for true object-oriented.
Storage Manager Overview L3 Review of SM Software, 28 Oct Storage Manager Functions Event data Filter Farm StorageManager DQM data Event data DQM.
DQM Architecture From Online Perspective EvF wkg 11/10/2006 E. Meschi – CERN PH/CMD.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Event Data History David Adams BNL Atlas Software Week December 2001.
Control in ATLAS TDAQ Dietrich Liko on behalf of the ATLAS TDAQ Group.
Management of the LHCb DAQ Network Guoming Liu * †, Niko Neufeld * * CERN, Switzerland † University of Ferrara, Italy.
Module 2: Using Microsoft Visual Studio.NET. Overview Overview of Visual Studio.NET Creating an ASP.NET Web Application Project.
CMS pixel data quality monitoring Petra Merkel, Purdue University For the CMS Pixel DQM Group Vertex 2008, Sweden.
Online Software 8-July-98 Commissioning Working Group DØ Workshop S. Fuess Objective: Define for you, the customers of the Online system, the products.
Overview of DAQ at CERN experiments E.Radicioni, INFN MICE Daq and Controls Workshop.
CHEP 2013, Amsterdam Reading ROOT files in a browser ROOT I/O IN JAVASCRIPT B. Bellenot, CERN, PH-SFT B. Linev, GSI, CS-EE.
DEPARTEMENT DE PHYSIQUE NUCLEAIRE ET CORPUSCULAIRE JRA1 Parallel - DAQ Status, Emlyn Corrin, 8 Oct 2007 EUDET Annual Meeting, Palaiseau, Paris DAQ Status.
News on GEM Readout with the SRS, DATE & AMORE
September 2007CHEP 07 Conference 1 A software framework for Data Quality Monitoring in ATLAS S.Kolos, A.Corso-Radu University of California, Irvine, M.Hauschild.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
HIGUCHI Takeo Department of Physics, Faulty of Science, University of Tokyo Representing dBASF Development Team BELLE/CHEP20001 Distributed BELLE Analysis.
Online Monitoring for the CDF Run II Experiment T.Arisawa, D.Hirschbuehl, K.Ikado, K.Maeshima, H.Stadie, G.Veramendi, W.Wagner, H.Wenzel, M.Worcester MAR.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Management of the LHCb Online Network Based on SCADA System Guoming Liu * †, Niko Neufeld † * University of Ferrara, Italy † CERN, Geneva, Switzerland.
Pixel DQM Status R.Casagrande, P.Merkel, J.Zablocki (Purdue University) D.Duggan, D.Hidas, K.Rose (Rutgers University) L.Wehrli (ETH Zuerich) A.York (University.
DQM for the RPC subdetector M. Maggi and P. Paolucci.
Online Monitoring System at KLOE Alessandra Doria INFN - Napoli for the KLOE collaboration CHEP 2000 Padova, 7-11 February 2000 NAPOLI.
Alarm Handling Oliver Holme 7 th November Guidelines & Strategies All provide recommendation to use Framework Classes Standard severities/behaviour/colours.
Monitoring Update David Lawrence, JLab Feb. 20, /20/14Online Monitoring Update -- David Lawrence1.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
Examine Overview D0 Online Workshop June 3, 1999 Jae Yu Outline 1. What is an Examine? 2. How Many Examines? 3. How does it work? 4. What are the features?
Software for the CMS Cosmic Challenge Giacomo BRUNO UCL, Louvain-la-Neuve, Belgium On behalf of the CMS Collaboration CHEP06, Mumbay, India February 16,
October Test Beam DAQ. Framework sketch Only DAQs subprograms works during spills Each subprogram produces an output each spill Each dependant subprogram.
Management of the LHCb DAQ Network Guoming Liu *†, Niko Neufeld * * CERN, Switzerland † University of Ferrara, Italy.
DQM for the RPC subdetector M. Maggi and P. Paolucci.
Online Consumers produce histograms (from a limited sample of events) which provide information about the status of the different sub-detectors. The DQM.
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
LHC CMS Detector Upgrade Project RCT/CTP7 Readout Isobel Ojalvo, U. Wisconsin Level-1 Trigger Meeting June 4, June 2015, Isobel Ojalvo Trigger Meeting:
The NA62RunControl: Status update Nicolas Lurkin School of Physics and Astronomy, University of Birmingham NA62 TDAQ Meeting – CERN, 10/06/2015.
The ALICE data quality monitoring Barthélémy von Haller CERN PH/AID For the ALICE Collaboration.
Online Software November 10, 2009 Infrastructure Overview Luciano Orsini, Roland Moser Invited Talk at SuperB ETD-Online Status Review.
CHEP 2010 – TAIPEI Robert Gomez-Reino on behalf of CMS DAQ group.
Barthélémy von Haller CERN PH/AID For the ALICE Collaboration The ALICE data quality monitoring system.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
DAQ thoughts about upgrade 11/07/2012
WP18, High-speed data recording Krzysztof Wrona, European XFEL
CMS High Level Trigger Configuration Management
LHC experiments Requirements and Concepts ALICE
Controlling a large CPU farm using industrial tools
Commissioning of the ALICE HLT, TPC and PHOS systems
Monitoring of the infrastructure from the VO perspective
Status of RPC DQM for Global DAQ in CMSSW
DQM for the RPC subdetector
Offline framework for conditions data
Presentation transcript:

Physics & Data Quality Monitoring at CMS Emilio Meschi (original design, run-control, mentoring) CL (core functionality, rules & alarms library, tech support) Dimitrios Tsirigkas (Web interface), Giulio Eulisse (Qt-GUI) Ilaria Segoni (specialized clients, coordination with detector groups) Online Computing CHEP 2006 – February 13-17, 2005, Mumbai Christos Leonidopoulos CERN on behalf of the CMS-DQM group

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 2 DQM: Outline Rationale − Build a product that various CMS groups can use (now & later); Functionality that every subdetector needs: Save resources − Provide people w/ infrastructure (relatively) early in the game − CMS is in “Magnet Test” mode (magnet & subdetector electronics commissioning): Use “real-world” conditions to get feedback Provide a general, homogeneous monitoring solution at CMS − Flexibility & customization to be usable across experiment − Content: Trigger/Physics performance, subdetector data quality

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 3 Monitoring the High Level Trigger “Filter Farm” at CMS runs the High Level Trigger − 1000 dual-CPU PC farm replacing traditional L2+L3 − Input from L1: 100 kHz, Output: 150 Hz − HLT runs all reconstruction algorithms Monitoring needs at HLT − “Keep an eye” on 1000 machines  HLT Inputs  Physics objects  Trigger rates − Monitoring should not slow down main application (HLT algorithm) − Collect & process information “centrally”

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 4 The Big Picture CPU DQM principle: use same code to serve different customers Inputs Physics objects Triggersetc… Monitoring producers Monitoring consumers (clients) DQM infrastructure: Collectors/Servers

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 5 DQM from a client’s perspective Client “DQM” Monitoring information Configuration Reference objects Historic plots Etc… “Comparison-to-reference” Collation of similar objects Database Tools “Alarm” “System ok” Clear separation of creation of monitoring information from collection, processing Too much work not to share with rest of CMS! TCP/IP TCP/IP or http

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 6 Core Features Support for all the “usual stuff”: static and dynamic sets of objects 1,2,3-D histograms, 1,2-D profiles, integers, floats, strings (ROOT objects behind the scenes) Support for unix-like directory structures Support for “monitoring producers” (Publish) & “monitoring consumers” (Subscribe) Clients can subscribe to (sub)directories, or “à-la carte” Support for root-tuples Create and save root-tuples w/ monitoring structure on the fly Stability Be able to handle connecting/disconnecting producers, clients at run-time: robust behavior and support for dynamic lists of nodes

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 7 Tools “Soft-reset”: reset t < t 0 contents of monitoring content − Does not permanently erase contents “Accumulate”: sum up contents over multiple monitoring periods “Collate”: add multiple monitoring data − Sum-up (same-format) contributions from different sources − Interface supports search-strings with wildcards (?, *)

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 8 More toys: Web interface Select information to visualize on the fly…

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 9 Web interface in (real) action “Monitoring producer” (and collector): CERN “Monitoring consumers” (clients): one at CERN, one at Florida (US) You are looking at web browser running in Florida office Live cosmic test data for end-cap muon detector

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 10 Even more toys: Qt GUI

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 11 Qt GUI in (real) action Cosmic test data for calorimeter detector (reading from file)

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 12 More Tools: Quality Tests Library with “rules” for assigning “quality” value to tests − Comparison to reference (χ 2, Kolmogorov tests) − Contents within range ([x min, x max ], ([y min, y max ]) − Exact match − Mean of (e.g. gaussian) distribution “near” expected value − Flat occupancy − Etc… “Alarm” library Warnings & error messages should propagate to all clients downstream Group monitoring in sets with links to status, messages Create intuitive interface for quick problem spotting (see next slide)

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 13 Organizing tests, results Compare to reference ON  OFF ERROR1: L1 Acc Increment =0 (5%) Move to Hidden Warnings Take Action2 WARNING1: L1 Acc Increment =2 (10%) Move to Hidden Warnings Take Action3 WARNING1: DDU Trailer Problems (7%) Move to Hidden Warnings Take Action1 STATUS OK Compare to reference ON  OFF Compare to reference ON  OFF Data Integrity Checks vs. L1 Accept. Number

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 14 Time measurements 50-bin ROOT histograms (floats) 1 Gb/s Ethernet connection Mini-farm prototype at CMS site =6.0±0.2 ms N = 1000 histograms Could use this info to adjust update rate on the fly… ROOT v b

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 15 Summary DQM: a homogeneous monitoring solution for CMS − Content: Trigger/Physics performance, subdetector data quality − Environment: ○ “HLT” processes in Filter Farm ○ Monitoring processes fed by “live streams” or local DAQ ○ Batch jobs (potentially “production” validation) Makes use of general framework and services − ROOT − Transfer protocols: TCP/IP, http − Tools for processing of monitoring information − Visualization: Web & Qt-GUI Ongoing development − Database requirements & interface design − Organization of alarms & quick problem spotting − Components for client customization

Backup Slides

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 17 High Level Trigger: Event Filter Farm

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 18 Subdetectors & The Magnet Test All subdetector groups have set up test sites (cosmic-ray tests, radioactive sources, laser beams) where monitoring programs are used (custom or not) All subdetector groups either plan to port/are porting their monitoring code or are already using the DQM infrastructure

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 19 Rational for additional interface Question: Why not give users direct access to ROOT objects? An abstract interface does not bind the user access methods to a specific analysis framework. In our particular case, the transfer mechanism and the “true” format (ROOT) could change in the future, without breaking the customized programs. Having an abstract interface that hides the raw monitoring data from the user is a good OO practice. The set of allowed operations on the monitoring objects should be defined by the (abstract) user interface, not the framework used for the implementation. Additional functionality can be added to the monitoring objects (e.g. alarms) without directly inheriting from ROOT classes.

Physics & Data Quality Monitoring at CMSChristos Leonidopoulos 20 DQM documentation Release notes Documentation Archive with presentations from DQM group: DQM status for subdetector groups Draft on DQM “requirement & design” CMS note: Under preparation