Offline shifter training tutorial

Slides:



Advertisements
Similar presentations
CWG10 Control, Configuration and Monitoring Status and plans for Control, Configuration and Monitoring 16 December 2014 ALICE O 2 Asian Workshop
Advertisements

Clara Gaspar on behalf of the LHCb Collaboration, “Physics at the LHC and Beyond”, Quy Nhon, Vietnam, August 2014 Challenges and lessons learnt LHCb Operations.
T1 at LBL/NERSC/OAK RIDGE General principles. RAW data flow T0 disk buffer DAQ & HLT CERN Tape AliEn FC Raw data Condition & Calibration & data DB disk.
June 19, 2002 A Software Skeleton for the Full Front-End Crate Test at BNL Goal: to provide a working data acquisition (DAQ) system for the coming full.
1 Databases in ALICE L.Betev LCG Database Deployment and Persistency Workshop Geneva, October 17, 2005.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
O FFLINE T RIGGER M ONITORING TDAQ Training 30 July 2010 On behalf of the Trigger Offline Monitoring Experts team.
If you are very familiar with SOAR, try these quick links: Principal’s SOAR checklist here here Term 1 tasks – new features in 2010 here here Term 1 tasks.
Central DQM Shift Tutorial Online/Offline. Overview of the CMS DAQ and useful terminology 2 Detector signals are collected through individual data acquisition.
Central DQM Shift Tutorial Online/Offline. Overview of the CMS DAQ and useful terminology 2 Detector signals are collected through individual data acquisition.
Central DQM Shift Tutorial Online/Offline. Overview of the CMS DAQ and useful terminology 2 Detector signals are collected through individual data acquisition.
5. Data Manager 1. Introduction 2. Data Manager Duties 3. Quality Checking 4. Problem Reporting 5. Data Monitoring 6. Histogram Presenter 7. Trend Presenter.
11 CTP Training A.Jusko, M. Krivda and R.Lietava..
Computing Infrastructure Status. LHCb Computing Status LHCb LHCC mini-review, February The LHCb Computing Model: a reminder m Simulation is using.
ALICE Roadmap for 2009/2010 Patricia Méndez Lorenzo (IT/GS) Patricia Méndez Lorenzo (IT/GS) On behalf of the ALICE Offline team Slides prepared by Latchezar.
Real data reconstruction A. De Caro (University and INFN of Salerno) CERN Building 29, December 9th, 2009ALICE TOF General meeting.
Costin Grigoras ALICE Offline. In the period of steady LHC operation, The Grid usage is constant and high and, as foreseen, is used for massive RAW and.
What is expected from ALICE during CCRC’08 in February.
Offline shifter training tutorial L. Betev February 19, 2009.
Production status Preparation for HI running ALICE TF+AF November 14, 2010.
OFFLINE TRIGGER MONITORING TDAQ Training 5 th November 2010 Ricardo Gonçalo On behalf of the Trigger Offline Monitoring Experts team.
A.Golunov, “Remote operational center for CMS in JINR ”, XXIII International Symposium on Nuclear Electronics and Computing, BULGARIA, VARNA, September,
Offline report – 7TeV data taking period (Mar.30 – Apr.6) ALICE SRC April 6, 2010.
© 2006 Cisco Systems, Inc. All rights reserved.1 Connection 7.0 Serviceability Reports Todd Blaisdell.
Run Coordination Summary of the May 14, 2004 Run Coordinator Meeting CT-EC2 Run Preparation Meeting May 17, 2004 Slide 1 Peter Loch University of Arizona.
Planning and status of the Full Dress Rehearsal Latchezar Betev ALICE Offline week, Oct.12, 2007.
CERN – Alice Offline – Thu, 20 Mar 2008 – Marco MEONI - 1 Status of Cosmic Reconstruction Offline weekly meeting.
ALICE Pixel Operational Experience R. Santoro On behalf of the ITS collaboration in the ALICE experiment at LHC.
Part I – Shifter Duties Part II – ACR environment Part III – Run Control & DAQ Part IV – Beam Part V – DCS Part VI – Data Quality Monitoring Part VII.
February 07, 2002 Online Monitoring Meeting Detector Examines Should aid in: 1.Diagnosing problems early and getting it fixed 2.Making decisions on the.
Data & Storage Services CERN IT Department CH-1211 Genève 23 Switzerland t DSS Castor incident (and follow up) Alberto Pace.
Online Monitoring for the CDF Run II Experiment T.Arisawa, D.Hirschbuehl, K.Ikado, K.Maeshima, H.Stadie, G.Veramendi, W.Wagner, H.Wenzel, M.Worcester MAR.
Vendor Bid System (VBS) Seminar. Agenda Vendor Bid System Overview Step-by-Step Advertisement Posting Editing Active Advertisements Recommended Practices.
K. Jon-And, Stockholm University1 Tilecal Team 5 meeting 8 November 2005.
LCG-LHCC mini-review ALICE Latchezar Betev Latchezar Betev for the ALICE collaboration.
Predrag Buncic CERN ALICE Status Report LHCC Referee Meeting 01/12/2015.
ALICE experiences with CASTOR2 Latchezar Betev ALICE.
CSC Shifter Training Course – Global Running Fred Borcherding Reach from CSCOperations Twiki page or directly:
How to complete and submit a Final Report through Mobility Tool+ Technical guidelines Authentication, Completion and Submission 1 Antonia Gogaki IT Officer.
ALICE Physics Data Challenge ’05 and LCG Service Challenge 3 Latchezar Betev / ALICE Geneva, 6 April 2005 LCG Storage Management Workshop.
GGUS summary (3 weeks) VOUserTeamAlarmTotal ALICE7029 ATLAS CMS LHCb Totals
ALICE Full Dress Rehearsal ALICE TF Meeting 02/08/07.
ECAL Shift Duty: A Beginners Guide By Pourus Mehta.
M4 Operations ● Operational model for M4 ● Shifts and Experts ● Documentation and Checklists ● Control Room(s) ● AOB Murrough Landon 24 July 2007.
THIS MORNING (Start an) informal discussion to -Clearly identify all open issues, categorize them and build an action plan -Possibly identify (new) contributing.
Federating Data in the ALICE Experiment
SchoolSuccess for Coordinators
Narragansett Council Online Registration
Central DQM Shift Tutorial Online/Offline
F. Bellini for the DQM core DQM meeting, 04th October 2012
Simulation Production System
WP18, High-speed data recording Krzysztof Wrona, European XFEL
Now every configuration is possible
Supplier Profile Key Data
ALICE Monitoring
LHC experiments Requirements and Concepts ALICE
Status of the CERN Analysis Facility
Patricia Méndez Lorenzo ALICE Offline Week CERN, 13th July 2007
Shift instructions August 16, 2017 Antoni Aduszkiewicz
Offline shifter training tutorial
Philippe Charpentier CERN – LHCb On behalf of the LHCb Computing Group
1 VO User Team Alarm Total ALICE ATLAS CMS
Experience between AMORE/Offline and sub-systems
Central DQM Shift Tutorial Online/Offline
Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF
Offline monitoring, shifter dashboard
CTP offline meeting 16/03/2009 A.Jusko and R.Lietava
DQM for the RPC subdetector
Data Quality 2 (DQ2) & Staff Reporting Webinar
Offline framework for conditions data
Presentation transcript:

Offline shifter training tutorial L. Betev September 23, 2009

Systems and tools (separate talks) The Dashboard Outline Offline shifter basic responsibilities The shifter check list Systems and tools (separate talks) The Dashboard The Shuttle Offline Shifter Information System Event display

Basic responsibilities – RAW data The RAW data path DAQ online buffer @P2 Fast optical link to CERN CC, maximum rates: 500MB/sec (p+p), 1.25GB/sec (Pb+Pb) Step A Reduced 100 MB/sec (p+p) CASTOR2 disk buffer CASTOR2 tape buffer Step B

Step A – Online buffer -> CASTOR buffer Automatic and well-exercised (it almost never goes wrong) At this step, the files are also registered in the AliEn catalogue through a gateway DAQ is nominally responsible for the transfers Offline provides the registration gateway If not working, DAQ/SL notifies the shifter and/or the alice-shift-alarms@cern.ch expert list

Step A – Shifter responsibilities Monitors the fill of the CASTOR buffer (dashboard) Notify the run coordinator/shift leader if more than 80% full Clear disk space following instructions received from the SL Follow the registration of RAW (dashboard) All runs in PHYSICS partition are typically written to CASTOR Follow the run screen and grow suspicious if none of the runs are being registered Contact the SL and ask what is going on

Step B – CASTOR buffer -> Tape storage Selective copying of runs to tape Part of the p+p data stream (depends on the acquisition rate, max 100MB/sec) Full data stream in Pb+Pb (1.25GB/sec) The selection of runs to be copied/removed is provided by the SL Offline shifter is responsible for the copy procedure (dashboard) And for the deletion of data from the CASTOR buffer

Basic responsibilities – Shuttle Covered in Shuttle presentation Here just to put it in the context of the basic responsibilities

Basic responsibilities – event display Covered in Event Display presentation Here just to put it in the context of the basic responsibilities

Basic responsibilities – data replication After RAW is recorded to tape in CASTOR A copy is made to a remote T1 centre (out of 6 possible) for custodial storage and processing The replication is an automatic process, triggered at EoR Progress is displayed on the dashboard, the shifter follows the transfers and reports problems Presently (muon/calibration runs) – automatic replication is disabled

Basic responsibilities – offline processing pass 1 (at CERN T0) After RAW is recorded to tape in CASTOR + Shuttle is done Processing is launched automatically Progress is displayed on the dashboard Automatic processing – only for PHYSICS runs Detector calibration runs are processed on request The Offline shifter (if asked by detector groups/run coordination) collects the run numbers and writes them in the shifter report

Offline shifter check list Registration of RAW (dashboard) Periodic check of status Follow PHYSICS runs (start/stop in DAQ logbook) and registration to CASTOR Ask shift leader in case of doubt Report registration errors to on-call expert (list of experts in aloshi) Run copy and removal procedure (dashboard) Shuttle (dashboard) Follow on processing of all runs + global Shuttle messages In case of preprocessor failures, escalate to (concerned) detector shifters, note in shifter report (aloshi) In case of Shuttle failures first follow the restart/debug procedures, then report to on-call expert

Offline shifter check list (2) Data replication (dashboard) Periodic check of replication status Note ‘stuck’ runs – not replicated 12 hours after registration – in the shifter report pages and sent list to alice-shift-alarms@cern.ch Data processing pass 1 (dashboard) Periodic check of processing status Note ‘stuck’ runs – not processed 12 hours after registration – in the shifter report pages and sent list to alice-shift-alarms@cern.ch Shift report (aloshi) At end of shift – summary of the operation and noteworthy events

System Run Coordination meeting Day shifter only Attend the daily @16:33 System Run Coordination (SRC) meeting Prepare a 24-hour Offline status report Template for the report is given in aloshi

General shifter rules Before pressing the Read the procedures and rules, defined for each error type aloshi has a search feature, use it to look for similar problems and solutions Try out the remedies If all fails, inform the on-call expert

Information sources for the shifter The shifter manual – instructions Shifter interface (http://aloshi.cern.ch) Monitoring – MonALISA (http://alimonitor.cern.ch/) Dashboard Shuttle Processing and data management