Network integration with PanDA Artem Petrosyan PanDA UTA, 03.09.2013.

Slides:



Advertisements
Similar presentations
ETIS+: European Transport Policy Information System - Development and Implementation of Data Collection Methodology for EU Transport Modelling Funded by.
Advertisements

MS CRM Integration WhosOn Service Integration Presentation MS CRM User Group.
1 ECM System Monitor in the CMOD Environment. © 2013 IBM Corporation Enterprise Content Management IBM ECM System Monitor Improve Availability / Lower.
Basic concept Technologies we have used The Design Problems, challenges & solutions Educational Gain.
ERAU Weather Archive Status Update 6 October 2009.
Implementing ISA Server Caching. Caching Overview ISA Server supports caching as a way to improve the speed of retrieving information from the Internet.
LHCbPR V2 Sasha Mazurov, Amine Ben Hammou, Ben Couturier 5th LHCb Computing Workshop
MultiJob PanDA Pilot Oleynik Danila 28/05/2015. Overview Initial PanDA pilot concept & HPC Motivation PanDA Pilot workflow at nutshell MultiJob Pilot.
Distributed Databases
1 SMT On Demand Read March 12, 2012 Processes and Storyboards ‘Access, Control & Convenience’
Load Test Planning Especially with HP LoadRunner >>>>>>>>>>>>>>>>>>>>>>
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Page 1 ISMT E-120 Desktop Applications for Managers Introduction to Microsoft Access.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
ATLAS DQ2 Deletion Service D.A. Oleynik, A.S. Petrosyan, V. Garonne, S. Campana (on behalf of the ATLAS Collaboration)
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
FAX UPDATE 1 ST JULY Discussion points: FAX failover summary and issues Mailing issues Panda re-brokering to sites using FAX cost and access Issue.
FAX UPDATE 26 TH AUGUST Running issues FAX failover Moving to new AMQ server Informing on endpoint status Monitoring developments Monitoring validation.
Some Design Notes Iteration - 2 Method - 1 Extractor main program Runs from an external VM Listens for RabbitMQ messages Starts a light database engine.
Network and Transfer Metrics WG Meeting Shawn McKee, Marian Babik Network and Transfer Metrics WG Meeting 8 th April 2015.
What is Sure Stats? Sure Stats is an add-on for SAP that provides Organizations with detailed Statistical Information about how their SAP system is being.
Optimizer Deployment Centralized Database module on Optimizer hub server Each monitored server has an instance of optimizer installed.
New perfSonar Dashboard Andy Lake, Tom Wlodek. What is the dashboard? I assume that everybody is familiar with the “old dashboard”:
PanDA Monitor Development ATLAS S&C Workshop by V.Fine (BNL)
PanDA Update Kaushik De Univ. of Texas at Arlington XRootD Workshop, UCSD January 27, 2015.
Efi.uchicago.edu ci.uchicago.edu FAX status developments performance future Rob Gardner Yang Wei Andrew Hanushevsky Ilija Vukotic.
WLCG infrastructure monitoring proposal Pablo Saiz IT/SDC/MI 16 th August 2013.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition File System Implementation.
MultiJob pilot on Titan. ATLAS workloads on Titan Danila Oleynik (UTA), Sergey Panitkin (BNL) US ATLAS HPC. Technical meeting 18 September 2015.
Network awareness and network as a resource (and its integration with WMS) Artem Petrosyan (University of Texas at Arlington) BigPanDA Workshop, CERN,
EMI is partially funded by the European Commission under Grant Agreement RI SA2 – Development Tools Andres Abad Rodriguez SA2.4 Tools Activity Leader.
Efi.uchicago.edu ci.uchicago.edu FAX status report Ilija Vukotic on behalf of the atlas-adc-federated-xrootd working group S&C week Jun 2, 2014.
Biomedical Informatics Research Network BIRN Workflow Portal.
PanDA Status Report Kaushik De Univ. of Texas at Arlington ANSE Meeting, Nashville May 13, 2014.
FAX PERFORMANCE TIM, Tokyo May PERFORMANCE TIM, TOKYO, MAY 2013ILIJA VUKOTIC 2  Metrics  Data Coverage  Number of users.
PERFORMANCE AND ANALYSIS WORKFLOW ISSUES US ATLAS Distributed Facility Workshop November 2012, Santa Cruz.
Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.
Distributed Logging Facility Castor External Operation Workshop, CERN, November 14th 2006 Dennis Waldron CERN / IT.
Conclusions on Monitoring CERN A. Read ADC Monitoring1.
1 Project Status Update “Postcards from the Edge”: The Cache-and-Forward Architecture FIND Wireless BBN Sept 27, 2007 Dipankar Raychaudhuri,
FAX UPDATE 12 TH AUGUST Discussion points: Developments FAX failover monitoring and issues SSB Mailing issues Panda re-brokering to FAX Monitoring.
HPC pilot code. Danila Oleynik 18 December 2013 from.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
May Charles Dunbar, Ben Kallal, Ankit Patel, Peter Purcell, Kody Reynolds Advisor: Tom Daniels Client: Lisa Hein-Iowa Natural Heritage Foundation.
TAG and iELSSI Progress Elisabeth Vinek, CERN & University of Vienna on behalf of the TAG developers group.
SUM like functionality with WLCG-MON Ivan Dzhunov.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
PanDA & Networking Kaushik De Univ. of Texas at Arlington ANSE Workshop, CalTech May 6, 2013.
Dynamicpartnerconnections.com Development for performance Oleksandr Katrusha, Program manager
PanDA Configurator and Network Aware Brokerage Fernando Barreiro Megino, Kaushik De, Tadashi Maeno 14 March 2015, US ATLAS Distributed Facilities Meeting,
Bigtable A Distributed Storage System for Structured Data.
CERN IT Department CH-1211 Genève 23 Switzerland t Load testing & benchmarks on Oracle RAC Romain Basset – IT PSS DP.
WLCG Transfers monitoring EGI Technical Forum Madrid, 17 September 2013 Pablo Saiz on behalf of the Dashboard Team CERN IT/SDC.
PanDA & Networking Kaushik De Univ. of Texas at Arlington UM July 31, 2013.
Development of a GIS based software
WLCG Transfers Dashboard
The Client-Server Model
Complex Geometry Visualization TOol
BDII Performance Tests
Monitoring Of XRootD Federation
Discussions on group meeting
Performance Load Testing Case Study – Agilent Technologies
Continuous Performance Engineering
You’ve lost the hope in it.
Selected Topics: External Sorting, Join Algorithms, …
Initial job submission and monitoring efforts with JClarens
Presentation transcript:

Network integration with PanDA Artem Petrosyan PanDA UTA,

AGIS side PanDA uses AGIS as information system, network metrics were put into source-destination matrix for ATLASSites in AGIS: dev.cern.ch/agis/close_sites/atlassites_links/ dev.cern.ch/agis/close_sites/atlassites_links/ RESTful API was prepared to bulk fill the data: For bootstrap AGIS was instrumented with SSB collector To transfer data to PanDA, export JSON API was prepared: dev.cern.ch/request/site/query/list_atlassites_links/?json dev.cern.ch/request/site/query/list_atlassites_links/?json 2

SchedConfig side SchedConfig controller application serves software releases updates, SchedConfig parameters, etc. Network metrics collector part implemented as part of SchedConfig controller Metrics part downloads data and mapping from AGIS, prepares and then inserts network data into SchedConfigDB 3

Metrics collected Sonar small files, deviation Sonar medium files, deviation Sonar large files, deviation Perfsonar transfer speed average Xrdcp transfer speed average Each with last update datetime Optimization ideas: – Do we need them all? 4

Tests, AGIS Executed from lxplus AGIS update metrics for all source-destination pairs, one client Full cycle - 100sec/45Hz No real bulk update, each record updated separately Keep in mind that updates can be executed from several clients Optimization ideas – Try real bulk update? – Use AGIS collectors? 5

Tests, controller Development machine voatlas142 (2 cores, 4Gb RAM) Get all data from AGIS – 2.2Mb source-destination matrix with data – 5sec – 220Kb ATLAS sites-to-PanDA sites mapping – 1sec Build panda sites source-destination matrix with data - 35sec Bulk insert into db (one transaction for all) - 20sec/325Hz Full cycle – 1min10sec Optimization ideas: – Reduce data volume by using filters to download the latest data or only desired metrics – Better work with data structures 6

How network data should be used? Raw data is data collected from sources Processed data is data after weight calculation Should we keep raw data in the database or should we calculate and keep only weights? Who, how and when will retrieve info from the database and use it for task brokerage and decision making? 7

Usage scenario Extend current brokerage implementation Start from xrdcp data For each request containing source Prepare 5 best destinations Maximum network weight should not exceed 0.5 in general brokerage formula so that these xrdcp-enabled sites to be used only when normally selected sites are unavailable 8

Status & plan Transport layer is ready: data is being delivered from AGIS to SchedConfig DB Tuning is available: – Check possibility of optimization data bulk update on AGIS side – Add filters to reduce data volume downloaded from AGIS – More sophisticated work with data structures on collector’ side Start filling PerfSonar data (waiting for green light from Ale) Web UI for monitoring network data development Move tables to production database Update SchedConfig controller on prod machines Implementation of decision making algorithm These steps can be done till next ATLAS S&C Workshop 9