PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003.

Slides:



Advertisements
Similar presentations
Web100 at SLAC Presented at the Web100 Workshop, Boulder, CO, August 2002.
Advertisements

HEPiX Edinburgh 28 May 2004 LCG les robertson - cern-it-1 Data Management Service Challenge Scope Networking, file transfer, data management Storage management.
1 High Performance Active End-to- end Network Monitoring Les Cottrell, Connie Logg, Warren Matthews, Jiri Navratil, Ajay Tirumala – SLAC Prepared for the.
1 IEPM-BWIEPM-BW Warren Matthews (SLAC) Presented at the UCL Monitoring Infrastructure Workshop, London, May 15-16, 2003.
MAGGIE Monitoring and Analysis for the Global Grid and Internet End-to-end performance Warren Matthews (SLAC) Presented at the Measurement SIG ESCC/Internet2.
1 End-to-end Monitoring of High Performance Network Paths Les Cottrell, Connie Logg, Jerrod Williams SLAC, for the ESCC meeting, Columbus Ohio, July 2004.
1 SLAC Site Report By Les Cottrell for UltraLight meeting, Caltech October 2005.
1 Traceanal: a tool for analyzing and representing traceroutes Les Cottrell, Connie Logg, Ruchi Gupta, Jiri Navratil SLAC, for the E2Epi BOF, Columbus.
1 Internet End-to-end Monitoring Project at SLAC Les Cottrell, Connie Logg, Jerrod Williams, Gary Buhrmaster Site visit to SLAC by DoE program managers.
1 SLAC Internet Measurement Data Les Cottrell, Jerrod Williams, Connie Logg, Paola Grosso SLAC, for the ISMA Workshop, SDSC June,
CAIDA Bandwidth Estimation Meeting San Diego June 2002 R. Hughes-Jones Manchester 1 EU DataGrid - Network Monitoring Richard Hughes-Jones, University of.
MAGGIE NIIT- SLAC On Going Projects Measurement & Analysis of Global Grid & Internet End to end performance.
1 Terapaths: Datagrid WAN Network Monitoring Infrastructure Les Cottrell, Connie Logg, Jerrod Williams SLAC, for the DoE 2004 PI Network Research Meeting,
1 IEPM-BW a new network/application throughput performance measurement infrastructure Les Cottrell – SLAC Presented at the GGF4 meeting, Toronto Feb 20-21,
Measurement and Fault-Finding Using MAGGIE and PIPES. Presented at the HENP SIG Internet2 Members Meeting, Indianapolis, October Paola Grosso (SLAC)
Network Monitoring grid network performance measurement, simulation & analysis Presented by Warren Matthews at the Performance.
Network Performance Measurement Atlas Tier 2 Meeting at BNL December Joe Metzger
1 ESnet Network Measurements ESCC Feb Joe Metzger
What we have learned from developing and running ABwE Jiri Navratil, Les R.Cottrell (SLAC)
Monitoring: Grid, Fabric, Network Jennifer M. Schopf, Argonne National Lab PPDG Review 28 April 2003, Fermilab.
1 End-to-end Monitoring of High Performance Network Paths Les Cottrell, Connie Logg, Jerrod Williams, Jiri Navratil, SLAC, for the ESCC meeting, Columbus.
LAN and WAN Monitoring at SLAC Connie Logg September 21, 2005.
27-Jan-2005 Internet2 Activities Toward a Global Measurement Infrastructure Matt Zekauskas Network Performance Measurement and Monitoring APAN19.
1 Using Netflow data for forecasting Les Cottrell SLAC and Fawad Nazir NIIT, Presented at the CHEP06 Meeting, Mumbai India, February
PPDG and ATLAS Particle Physics Data Grid Ed May - ANL ATLAS Software Week LBNL May 12, 2000.
Data Grid projects in HENP R. Pordes, Fermilab Many HENP projects are working on the infrastructure for global distributed simulated data production, data.
DoE SciDAC high-performance networking research project: INCITE INCITE.rice.edu 2004 Technical Challenges INCITE R. Baraniuk, E. Knightly, R. Nowak, R.
1 Status Report on US networks at the Turn of the Century Les Cottrell – SLAC & Stanford U.
DataGrid Wide Area Network Monitoring Infrastructure (DWMI) Connie Logg February 13-17, 2005.
Measurement & Analysis of Global Grid & Internet End to end performance (MAGGIE) Network Performance Measurement.
1 Overview of IEPM-BW - Bandwidth Testing of Bulk Data Transfer Tools Connie Logg & Les Cottrell – SLAC/Stanford University Presented at the Internet 2.
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Measuring End-to-end Bandwidth with Iperf using Web100 Presented by Warren Matthews (SLAC) on behalf of Ajay Tirumala (U of Illinois), Les Cottrell (SLAC)
1 Internet End-to-end Monitoring Project - Overview Les Cottrell – SLAC/Stanford University Partially funded by DOE/MICS Field Work Proposal on Internet.
February 6-8, 2006[Joint Techs] Albuquerque, NM Performance Tool Development: NLANR Network Performance Advisor J. W. Ferguson NCSA.
1 SLAC IEPM PingER and BW monitoring & tools PingER Presented by Les Cottrell, SLAC At LBNL, Jan 21,
IEPM. Warren Matthews (SLAC) Presented at the ESCC Meeting Miami, FL, February 2003.
PPDGLHC Computing ReviewNovember 15, 2000 PPDG The Particle Physics Data Grid Making today’s Grid software work for HENP experiments, Driving GRID science.
13-Oct-2003 Internet2 End-to-End Performance Initiative: piPEs Eric Boyd, Matt Zekauskas, Internet2 International.
1 IEPM/PingER Project Les Cottrell, SLAC DoE 2004 PI Network Research Meeting, FNAL Sep ‘04
Storage and Data Movement at FNAL D. Petravick CHEP 2003.
DoE SciDAC high-performance networking research project: INCITE INCITE.rice.edu 2004 Technical Challenges INCITE R. Baraniuk, E. Knightly, R. Nowak, R.
1 MAGGIE Monitoring and Analysis for the Global Grid and Internet End-to-end performance Warren Matthews Stanford Linear Accelerator Center (SLAC)
Internet Connectivity and Performance for the HEP Community. Presented at HEPNT-HEPiX, October 6, 1999 by Warren Matthews Funded by DOE/MICS Internet End-to-end.
1 WAN Monitoring Prepared by Les Cottrell, SLAC, for the Joint Engineering Taskforce Roadmap Workshop JLab April 13-15,
Advanced Network Diagnostic Tools Richard Carlson EVN-NREN workshop.
1 Particle Physics Data Grid (PPDG) project Les Cottrell – SLAC Presented at the NGI workshop, Berkeley, 7/21/99.
1 Deploying Measurement Systems in ESnet Joint Techs, Feb Joseph Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Toward a Measurement Infrastructure. Warren Matthews (SLAC) Presented at the e2e Workshop Miami, FL, February 2003.
IEPM-BW (or PingER on steroids) and the PPDG
Milestones/Dates/Status Impact and Connections
High Speed File Replication
Warren Matthews and Les Cottrell (SLAC)
Wide Area Networking at SLAC, Feb ‘03
High Performance Active End-to-end Network Monitoring
Connie Logg February 13 and 17, 2005
Experiences in Traceroute and Available Bandwidth Change Analysis
Experiences in Traceroute and Available Bandwidth Change Analysis
Network Performance Measurement
E2E piPEs Overview Eric L. Boyd Internet2 24 February 2019.
SLAC monitoring Web Services
Advanced Networking Collaborations at SLAC
IEPM. Warren Matthews (SLAC)
Wide-Area Networking at SLAC
MAGGIE NIIT- SLAC On Going Projects
PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003.
Interoperable Measurement Frameworks: Internet2 E2E piPEs and NLANR Advisor Eric L. Boyd Internet2 17 April 2019.
Internet2 E2E piPEs Project
Net Rat Network Reliability and Troubleshooting.
Warren Matthews (SLAC) Presented at the PIPEfitters Breakfast,
Presentation transcript:

PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003

AbstractAbstract The vision of science grids allocating resources to analyze huge quantities of HENP data clearly depends on reliable network performance. Tools developed at SLAC in conjunction with the Internet2 PIPES project will help to ensure this. In this talk, these tools will be discussed and the procedure for publishing performance data, in particular using the Globus toolkit's MDS and web services will be reviewed. The subsequent analysis and trouble-shooting methodology will be discussed with real world examples from the particle physics data grid (PPDG) and the European data grid (EDG).

OverviewOverview What is the problem ? What is PIPES ? Network performance monitoring Problem identification

Resource Broker Farm Farm Farm Data Data Data requestor The Network Network Monitoring for the Grid The Data Grid consists of many components that must interoperate requestor

Resource Broker Farm Data Data Data requestor The Network Allocate Resources The resource broker must be fully informed Measurement is required ! requestor 12% pkt loss OC4880% Utilization

What is PIPES ? Internet2 End-to-end performance initiative PI Performance Evaluation System (PIPES) PIPES Monitoring Platform (PMP) Overlap with goals of HENP Tremendous resources

IEPM-BWIEPM-BW Package developed at SLAC –Measurement Engine Iperf, bbftp, bbcp, ping, traceroute Abwe, owamp, udpmon, gridftp –Job Manager –Data Storage and data server –Analysis Engine

SNV SLAC CHI ESnet NY Stanford CalREN NERSC LANL JLAB TRIUMF KEK Abilene SLAC SNV FNAL ANL NIKHEF CERN IN2P3 CERN CALTECH SDSC BNL JAnet HSTN SEA ATL CLV IPLS RAL UCL UManc DL NNW NY Rice UTDallas NCSA UMich I2 SOX UFL APAN RIKEN INFN-Roma INFN-Milan CESnet APAN Geant EDG PPDG/GriPhyN Monitoring Site ORNL Stanford UTAH DNVR ORNL NASA WASH Imperial INFN-Padua

SLAC Manchester Bristol Dresden IN2P3 RAL Stanford Calren Abilene Renater DFN Janet NNW TVN SWERN ESnet BaBar Grid Geant 622Mbps 2.5 Gbps 1 Gbps 10 Gbps

Problem Identification Typical Scenario –User complains file transfer is slow –Net admin runs ping, traceroute, iperf test –Complain to upstream provider Proactive –What do we mean by throughput? –How do we know there was a performance hit? –Our approach is diurnal changes

Alarms Too much to keep track of Rather not wait for complaints Automated Alarms Rolling average à la RIPE-TT –May not be the best approach AMP Automated Detection System

LimitationsLimitations Could be over an hour before alarm is generated More frequent measurements impact the network and measurements overlap Low impact tools allow finer grained measurement –Use NWS multi-variate method –Use SCIDAC ABwE tool –Use PingER, OWAMP

PublishingPublishing Many monitoring projects, publish data to allow them to inter-operate MDS –EDG NM Schema Web Services –GLUE NE Schema GGF NMWG –Hierarchy Doc –Tools Doc./get_data

Net Rat Alarm System –Multiple tools –Multiple measurement points –Trigger further measurements –Cross reference off site stats Informant database No measurement is ‘authoritative’ –Cannot even believe a measurement

LogLog 03/20/ :13:46 ALARM pcgiga throughput= ctresh= athresh= /20/ :13:48 TRACE no change in route detected 03/20/ :16:07 CALM Throughput within acceptable limits. ALARM CANCELLED

Toward a Monitoring Infrastructure MAGGIE –Measurement and Analysis package built on NIMI/Akenti EDEE –production-quality Data Grid for Europe

More Information IEPM Home Page IEPM-BW I2 E2E and PIPES RIPE-TT AMP Automated Event Detection NWS ABWE

EndEnd This talk made possible by the IEPM team at SLAC (Les Cottrell, Connie Logg, Jiri Navratil, Jerrod Williams, Fabrizio Coccetti), and the many developers and maintainers around the world.