Presentation is loading. Please wait.

Presentation is loading. Please wait.

FAX UPDATE 12 TH AUGUST 2013. Discussion points: Developments FAX failover monitoring and issues SSB Mailing issues Panda re-brokering to FAX Monitoring.

Similar presentations


Presentation on theme: "FAX UPDATE 12 TH AUGUST 2013. Discussion points: Developments FAX failover monitoring and issues SSB Mailing issues Panda re-brokering to FAX Monitoring."— Presentation transcript:

1 FAX UPDATE 12 TH AUGUST 2013

2 Discussion points: Developments FAX failover monitoring and issues SSB Mailing issues Panda re-brokering to FAX Monitoring validation Running issues /atlas/dq2/user/gangarbt lookups Remaining issues with x509 Response times Deployed dCache versions Expansion Ilija Vukotic ivukotic@uchicago.edu 2

3 FAX FAILOVER FAX failover works. Need a way to monitor it’s effects. Pilot changed so information is collected. (Thanks to Paul for quick turnaround in debugging pilot) Several issues (message formats, message content, SSO problems) with sending info to PandaMon logger. Finally all solved. On Wednesday final pilot version sending to production server will be deployed. Ilija Vukotic ivukotic@uchicago.edu 3

4 FAX FAILOVER Needed a nice UI to investigate effects of failover and reasons why they happen. A python plugin to PandaMon written to create web pages. (thanks to Valeri F.) Can be found here: http://pandamon.cern.ch/fax/failoverhttp://pandamon.cern.ch/fax/failover Ilija Vukotic ivukotic@uchicago.edu 4

5 FAX FAILOVER Still open question: When do we want to turn on ALL the other sites. When pilot comes with rucio format file names will fallback work? Ilija Vukotic ivukotic@uchicago.edu 5

6 MAILING FROM SSB We need information on issues with FAX endpoints/redirectors sent once a day together with other mail that people do read. NOW it WORKS! Thanks to: Helmut Wolters helmut@lip.pthelmut@lip.pt Question: Are there sites that do not get/care about these mails? Ilija Vukotic ivukotic@uchicago.edu 6

7 MAILING FROM SSB From: atlas-ssb-notifications-noreply@cern.ch> Subject: [ATLAS SSB Notification] Cloud US: Daily Résumé (Fri Aug 09, 2013) Date: August 9, 2013 7:30:38 AM CDT To: atlas-support-cloud-US@cern.ch> Cc: atlas-adc-ssb-devs@listbox.cern.ch> Cloud US info: WT2 ggus 95491 State:reopened Date:2013-07-07 Info:SLACXRD failing transfers FAX: Data unreachable via parent redirector. - More info95491 State:reopened Date:2013-07-07 Info:SLACXRD failing transfersinfo MWT2 FAX: Data unreachable via parent redirector. - More infoinfo SWT2_CPB FAX: Data unreachable via parent redirector. - More infoinfo BU_ATLAS_Tier2 FAX: ATLAS role extension not enabled for access. - More infoinfo BNL-ATLAS FAX: ATLAS role extension not enabled for access. - More infoinfo US cloud savannah 138943 Date:2013-07-27 07:53 Info:"NERSC : Transfer blacklisted”138943 Date:2013-07-27 07:53 Info:"NERSC : Transfer blacklisted” Ilija Vukotic ivukotic@uchicago.edu 7

8 MONITORING VALIDATION Alexander provided a json interface to dashboard records for test files. I need to write code comparing tests runs with info from Dashboard, publish into SSB. Ilija Vukotic ivukotic@uchicago.edu 8

9 PANDA RE-BROKERING Discussed at last CERN S&C week We agreed on providing an estimate of cost to move data in WAN to PANDA, so it could re-broker jobs from very long queues to sites with free slots that have good connection to data. Cost matrix exist in SSB. Code reading it from SSB doing exponential decay smoothing runs and sends info to AGIS. Have to check scalability of AGIS bulk update. Waiting for Artem to code moving data from AGIS to schedconfig. Next step is Tadashi making use of that table from schedconfig and actually re-broker. Finally we’ll have to monitor it the same way we do with Failover. Ilija Vukotic ivukotic@uchicago.edu 9

10 RUNNING ISSUES /atlas/dq2/user/gangarbt lookups Made half of federation endpoints not accessible from upstream redirectors. will be more explained by Johannes. Remaining issues with x509 Are there any issues here or just communicating our wish to get it turned on BU, DESY-HH, FZK, LRZ-LMU, MPPMU, Freiburg, Wuppertal dCache versions We need to at least know what are deployed versions Have to plan move to 2.6. Will ask Simone to present this move as an official ATLAS request Ilija Vukotic ivukotic@uchicago.edu 10

11 Ilija Vukotic ivukotic@uchicago.edu 11 RESPONSE TIMES A number of sites does not find file when asked through latest version of xrdfs. Investigating differences between deployed xrootd versions, storage backends. Changed SSB test from “stat” call to “locate -r” call.

12 EXPANSION Australia? CC-IN2P3 ? Ilija Vukotic ivukotic@uchicago.edu 12


Download ppt "FAX UPDATE 12 TH AUGUST 2013. Discussion points: Developments FAX failover monitoring and issues SSB Mailing issues Panda re-brokering to FAX Monitoring."

Similar presentations


Ads by Google