FAX UPDATE 12 TH AUGUST 2013. Discussion points: Developments FAX failover monitoring and issues SSB Mailing issues Panda re-brokering to FAX Monitoring.

Slides:



Advertisements
Similar presentations
SkimSlimService ENABLING NEW WAYS. Problems of Current Analysis Model 2/18/13ILIJA VUKOTIC 2 Unsustainable in the long run (higher luminosity, no faster.
Advertisements

Pro Exchange SPAM Filter An Exchange 2000 based spam filtering solution.
Efi.uchicago.edu ci.uchicago.edu FAX update Rob Gardner Computation and Enrico Fermi Institutes University of Chicago Sep 9, 2013.
Efi.uchicago.edu ci.uchicago.edu FAX status report Ilija Vukotic Computation and Enrico Fermi Institutes University of Chicago US ATLAS Computing Integration.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
AMOD Report Simone Campana CERN IT-ES. Grid Services A very good week for sites – No major issues for T1s and T2s The only one to report is
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
ATLAS DQ2 Deletion Service D.A. Oleynik, A.S. Petrosyan, V. Garonne, S. Campana (on behalf of the ATLAS Collaboration)
ATLAS federated xrootd monitoring requirements Rob Gardner July 26, 2012.
Tier 3 Data Management, Tier 3 Rucio Caches Doug Benjamin Duke University.
Storage Wahid Bhimji DPM Collaboration : Tasks. Xrootd: Status; Using for Tier2 reading from “Tier3”; Server data mining.
FAX UPDATE 1 ST JULY Discussion points: FAX failover summary and issues Mailing issues Panda re-brokering to sites using FAX cost and access Issue.
FAX UPDATE 26 TH AUGUST Running issues FAX failover Moving to new AMQ server Informing on endpoint status Monitoring developments Monitoring validation.
Xrootd Monitoring for the CMS Experiment Abstract: During spring and summer 2011 CMS deployed Xrootd front- end servers on all US T1 and T2 sites. This.
Efi.uchicago.edu ci.uchicago.edu Towards FAX usability Rob Gardner, Ilija Vukotic Computation and Enrico Fermi Institutes University of Chicago US ATLAS.
Efi.uchicago.edu ci.uchicago.edu FAX meeting intro and news Rob Gardner Computation and Enrico Fermi Institutes University of Chicago ATLAS Federated Xrootd.
New perfSonar Dashboard Andy Lake, Tom Wlodek. What is the dashboard? I assume that everybody is familiar with the “old dashboard”:
PanDA Monitor Development ATLAS S&C Workshop by V.Fine (BNL)
MW Readiness Verification Status Andrea Manzi IT/SDC 21/01/ /01/15 2.
CERN IT Department CH-1211 Geneva 23 Switzerland GT WG on Storage Federations First introduction Fabrizio Furano
Efi.uchicago.edu ci.uchicago.edu FAX Dress Rehearsal Status Report Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group Computation.
PanDA Update Kaushik De Univ. of Texas at Arlington XRootD Workshop, UCSD January 27, 2015.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
Efi.uchicago.edu ci.uchicago.edu Using FAX to test intra-US links Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group Computing Integration.
Efi.uchicago.edu ci.uchicago.edu FAX status developments performance future Rob Gardner Yang Wei Andrew Hanushevsky Ilija Vukotic.
STATUS OF DCACHE N2N AND MONITORING REPORT I. CURRENT SITUATION xrootd4j is a part of dCache implemented in such a way that each change requires new dCache.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
Storage cleaner: deletes files on mass storage systems. It depends on the results of deletion, files can be set in states: deleted or to repeat deletion.
Storage Federations and FAX (the ATLAS Federation) Wahid Bhimji University of Edinburgh.
Efi.uchicago.edu ci.uchicago.edu Status of the FAX federation Rob Gardner Computation and Enrico Fermi Institutes University of Chicago ATLAS Tier 1 /
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
SLACFederated Storage Workshop Summary For pre-GDB (Data Access) Meeting 5/13/14 Andrew Hanushevsky SLAC National Accelerator Laboratory.
Efi.uchicago.edu ci.uchicago.edu FAX status report Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group S&C week Jun 2, 2014.
GGUS Slides for the 2012/07/24 MB Drills cover the period of 2012/06/18 (Monday) until 2012/07/12 given my holiday starting the following weekend. Remove.
PanDA Status Report Kaushik De Univ. of Texas at Arlington ANSE Meeting, Nashville May 13, 2014.
GGUS summary (4 weeks) VOUserTeamAlarmTotal ALICE1102 ATLAS CMS LHCb Totals
Busy Storage Services Flavia Donno CERN/IT-GS WLCG Management Board, CERN 10 March 2009.
XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,
FAX PERFORMANCE TIM, Tokyo May PERFORMANCE TIM, TOKYO, MAY 2013ILIJA VUKOTIC 2  Metrics  Data Coverage  Number of users.
PERFORMANCE AND ANALYSIS WORKFLOW ISSUES US ATLAS Distributed Facility Workshop November 2012, Santa Cruz.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
GGUS summary (4 weeks) VOUserTeamAlarmTotal ALICE4015 ATLAS CMS LHCb Totals
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
Efi.uchicago.edu ci.uchicago.edu Data Federation Strategies for ATLAS using XRootD Ilija Vukotic On behalf of the ATLAS Collaboration Computation and Enrico.
Efi.uchicago.edu ci.uchicago.edu Ramping up FAX and WAN direct access Rob Gardner on behalf of the atlas-adc-federated-xrootd working group Computation.
Efi.uchicago.edu ci.uchicago.edu Storage federations, caches & WMS Rob Gardner Computation and Enrico Fermi Institutes University of Chicago BigPanDA Workshop.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
Network integration with PanDA Artem Petrosyan PanDA UTA,
WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
WLCG Transfers Dashboard A unified monitoring tool for heterogeneous data transfers. Alexandre Beche.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
EU privacy issue Ilija Vukotic 6 th October 2014.
/16 Final Project Report By Facializer Team Final Project Report Eagle, Leo, Bessie, Five, Evan Dan, Kyle, Ben, Caleb.
XRootD Monitoring Report A.Beche D.Giordano. Outlines  Talk 1: XRootD Monitoring Dashboard  Context  Dataflow and deployment model  Database: storage.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) HEPIX, BNL 13 Oct 2015.
GGUS summary (3 weeks) VOUserTeamAlarmTotal ALICE7029 ATLAS CMS LHCb Totals
PanDA Configurator and Network Aware Brokerage Fernando Barreiro Megino, Kaushik De, Tadashi Maeno 14 March 2015, US ATLAS Distributed Facilities Meeting,
WLCG Accounting Task Force Update Julia Andreeva CERN GDB, 8 th of June,
SLACFederated Storage Workshop Summary Andrew Hanushevsky SLAC National Accelerator Laboratory April 10-11, 2014 SLAC.
Efi.uchicago.edu ci.uchicago.edu Federating ATLAS storage using XrootD (FAX) Rob Gardner on behalf of the atlas-adc-federated-xrootd working group Computation.
Efi.uchicago.edu ci.uchicago.edu Sharing Network Resources Ilija Vukotic Computation and Enrico Fermi Institutes University of Chicago Federated Storage.
Efi.uchicago.edu ci.uchicago.edu FAX status report Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group Computation and Enrico Fermi.
Data Federation with Xrootd Wei Yang US ATLAS Computing Facility meeting Southern Methodist University, Oct 11-12, 2011.
1 VO User Team Alarm Total ALICE ATLAS CMS
WLCG Accounting Task Force Update Julia Andreeva CERN WLCG Workshop 08
FDR readiness & testing plan
Monitoring Of XRootD Federation
Presentation transcript:

FAX UPDATE 12 TH AUGUST 2013

Discussion points: Developments FAX failover monitoring and issues SSB Mailing issues Panda re-brokering to FAX Monitoring validation Running issues /atlas/dq2/user/gangarbt lookups Remaining issues with x509 Response times Deployed dCache versions Expansion Ilija Vukotic 2

FAX FAILOVER FAX failover works. Need a way to monitor it’s effects. Pilot changed so information is collected. (Thanks to Paul for quick turnaround in debugging pilot) Several issues (message formats, message content, SSO problems) with sending info to PandaMon logger. Finally all solved. On Wednesday final pilot version sending to production server will be deployed. Ilija Vukotic 3

FAX FAILOVER Needed a nice UI to investigate effects of failover and reasons why they happen. A python plugin to PandaMon written to create web pages. (thanks to Valeri F.) Can be found here: Ilija Vukotic 4

FAX FAILOVER Still open question: When do we want to turn on ALL the other sites. When pilot comes with rucio format file names will fallback work? Ilija Vukotic 5

MAILING FROM SSB We need information on issues with FAX endpoints/redirectors sent once a day together with other mail that people do read. NOW it WORKS! Thanks to: Helmut Wolters Question: Are there sites that do not get/care about these mails? Ilija Vukotic 6

MAILING FROM SSB From: Subject: [ATLAS SSB Notification] Cloud US: Daily Résumé (Fri Aug 09, 2013) Date: August 9, :30:38 AM CDT To: Cc: Cloud US info: WT2 ggus State:reopened Date: Info:SLACXRD failing transfers FAX: Data unreachable via parent redirector. - More info95491 State:reopened Date: Info:SLACXRD failing transfersinfo MWT2 FAX: Data unreachable via parent redirector. - More infoinfo SWT2_CPB FAX: Data unreachable via parent redirector. - More infoinfo BU_ATLAS_Tier2 FAX: ATLAS role extension not enabled for access. - More infoinfo BNL-ATLAS FAX: ATLAS role extension not enabled for access. - More infoinfo US cloud savannah Date: :53 Info:"NERSC : Transfer blacklisted” Date: :53 Info:"NERSC : Transfer blacklisted” Ilija Vukotic 7

MONITORING VALIDATION Alexander provided a json interface to dashboard records for test files. I need to write code comparing tests runs with info from Dashboard, publish into SSB. Ilija Vukotic 8

PANDA RE-BROKERING Discussed at last CERN S&C week We agreed on providing an estimate of cost to move data in WAN to PANDA, so it could re-broker jobs from very long queues to sites with free slots that have good connection to data. Cost matrix exist in SSB. Code reading it from SSB doing exponential decay smoothing runs and sends info to AGIS. Have to check scalability of AGIS bulk update. Waiting for Artem to code moving data from AGIS to schedconfig. Next step is Tadashi making use of that table from schedconfig and actually re-broker. Finally we’ll have to monitor it the same way we do with Failover. Ilija Vukotic 9

RUNNING ISSUES /atlas/dq2/user/gangarbt lookups Made half of federation endpoints not accessible from upstream redirectors. will be more explained by Johannes. Remaining issues with x509 Are there any issues here or just communicating our wish to get it turned on BU, DESY-HH, FZK, LRZ-LMU, MPPMU, Freiburg, Wuppertal dCache versions We need to at least know what are deployed versions Have to plan move to 2.6. Will ask Simone to present this move as an official ATLAS request Ilija Vukotic 10

Ilija Vukotic 11 RESPONSE TIMES A number of sites does not find file when asked through latest version of xrdfs. Investigating differences between deployed xrootd versions, storage backends. Changed SSB test from “stat” call to “locate -r” call.

EXPANSION Australia? CC-IN2P3 ? Ilija Vukotic 12