1 WAN Monitoring Prepared by Les Cottrell, SLAC, for the Joint Engineering Taskforce Roadmap Workshop JLab April 13-15, 2004 www.slac.stanford.edu/grp/scs/net/talk03/jet-apr04.ppt.

Slides:



Advertisements
Similar presentations
Logically Centralized Control Class 2. Types of Networks ISP Networks – Entity only owns the switches – Throughput: 100GB-10TB – Heterogeneous devices:
Advertisements

1 Traceanal: a tool for analyzing and representing traceroutes Les Cottrell, Connie Logg, Ruchi Gupta, Jiri Navratil SLAC, for the E2Epi BOF, Columbus.
1 Correlating Internet Performance & Route Changes to Assist in Trouble- shooting from an End-user Perspective Les Cottrell, Connie Logg, Jiri Navratil.
1 SLAC Internet Measurement Data Les Cottrell, Jerrod Williams, Connie Logg, Paola Grosso SLAC, for the ISMA Workshop, SDSC June,
MAGGIE NIIT- SLAC On Going Projects Measurement & Analysis of Global Grid & Internet End to end performance.
PIPE Dreams Trouble Shooting Network Performance for Production Science Data Grids Presented by Warren Matthews at CHEP’03, San Diego March 24-28, 2003.
INCITE – Edge-based Traffic Processing for High-Performance Networks R. Baraniuk, E. Knightly, R. Nowak, R. Riedi Rice University L. Cottrell, J. Navratil,
1 PingER: Methodology, Uses & Results Les Cottrell SLAC, Warren Matthews GATech Extending the Reach of Advanced Networking: Special International Workshop.
Next Generation Network Monitoring for Pakistan: Proposal Prepared by: Les Cottrell SLAC, Arshad Ali NIIT For Prof. Dr. Atta-ur-Rehman, Chairman of HEC.
Semester 4 - Chapter 3 – WAN Design Routers within WANs are connection points of a network. Routers determine the most appropriate route or path through.
1 ICFA/SCIC Network Monitoring Prepared by Les Cottrell, SLAC, for ICFA
Network Monitoring grid network performance measurement, simulation & analysis Presented by Warren Matthews at the Performance.
1 Monitoring Internet connectivity of Research and Educational Institutions Les Cottrell – SLAC/Stanford University Prepared for the workshop on “Developing.
1 ESnet Network Measurements ESCC Feb Joe Metzger
1 Internet Monitoring & Tools Les Cottrell – SLAC Presented at the HEP Networking, Grids and Digital Divide meeting Daegu, Korea May 23-27, 2005 Partially.
1 State of Network Monitoring and Analysis in the US Les Cottrell, KC Claffy, Brian Tierney, Ronn Ritke, Hans-Werner Braun Prepared for the LSN meeting.
LAN and WAN Monitoring at SLAC Connie Logg September 21, 2005.
workshop eugene, oregon What is network management? System & Service monitoring  Reachability, availability Resource measurement/monitoring.
1 Using Netflow data for forecasting Les Cottrell SLAC and Fawad Nazir NIIT, Presented at the CHEP06 Meeting, Mumbai India, February
Internet2 Performance Update Jeff W. Boote Senior Network Software Engineer Internet2.
Tony McGregor RIPE NCC Visiting Researcher The University of Waikato DAR Active measurement in the large.
DataTAG Research and Technological Development for a Transatlantic Grid Abstract Several major international Grid development projects are underway at.
Measurement & Analysis of Global Grid & Internet End to end performance (MAGGIE) Network Performance Measurement.
1 ESnet/HENP Active Internet End-to-end Performance & ESnet/University performance Les Cottrell – SLAC Presented at the ESSC meeting Albuquerque, August.
1 Overview of IEPM-BW - Bandwidth Testing of Bulk Data Transfer Tools Connie Logg & Les Cottrell – SLAC/Stanford University Presented at the Internet 2.
IEPM-BW: Bandwidth Change Detection and Traceroute Analysis and Visualization Connie Logg, Joint Techs Workshop February 4-9, 2006.
1 The PingER Project: Measuring the Digital Divide PingER Presented by Les Cottrell, SLAC At the SIS Show Palexpo/Geneva December 2003.
OS Services And Networking Support Juan Wang Qi Pan Department of Computer Science Southeastern University August 1999.
Network Measurement Tools ESnet Site Coordinators Meeting 26 April 2000 Tracie Monk, UCSD/SDSC/CAIDA -
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
1 Internet End-to-end Monitoring Project - Overview Les Cottrell – SLAC/Stanford University Partially funded by DOE/MICS Field Work Proposal on Internet.
1 Quantifying the Digital Divide: focus Africa Prepared by Les Cottrell, SLAC for the NSF IRNC meeting, March 11,
1 SLAC IEPM PingER and BW monitoring & tools PingER Presented by Les Cottrell, SLAC At LBNL, Jan 21,
IEPM. Warren Matthews (SLAC) Presented at the ESCC Meeting Miami, FL, February 2003.
1 High Performance Network Monitoring Challenges for Grids Les Cottrell, SLAC Presented at the International Symposium on Grid Computing 2006, Taiwan
13-Oct-2003 Internet2 End-to-End Performance Initiative: piPEs Eric Boyd, Matt Zekauskas, Internet2 International.
Jeremy Nowell EPCC, University of Edinburgh A Standards Based Alarms Service for Monitoring Federated Networks.
Measurement in the Internet Measurement in the Internet Paul Barford University of Wisconsin - Madison Spring, 2001.
1 IEPM/PingER Project Les Cottrell, SLAC DoE 2004 PI Network Research Meeting, FNAL Sep ‘04
4: Network Layer4b-1 OSPF (Open Shortest Path First) r “open”: publicly available r Uses Link State algorithm m LS packet dissemination m Topology map.
1 Internet Performance Monitoring for the HENP Community Les Cottrell & Warren Matthews – SLAC Presented.
Interoperable Measurement Frameworks: Joint Monitoring of GEANT & Abilene Eric L. Boyd, Internet2 Nicolas Simar, DANTE.
TeraPaths: A QoS Enabled Collaborative Data Sharing Infrastructure for Petascale Computing Research The TeraPaths Project Team Usatlas Tier 2 workshop.
Internet Connectivity and Performance for the HEP Community. Presented at HEPNT-HEPiX, October 6, 1999 by Warren Matthews Funded by DOE/MICS Internet End-to-end.
1 IEPM / PingER project & PPDG Les Cottrell – SLAC Presented at the NGI workshop, Berkeley, 7/21/99 Partially funded by DOE/MICS Field Work Proposal on.
1 Quantifying the Digital Divide Prepared by Les Cottrell, SLAC for the Internet2/World Bank meeting, Feb 7,
Advanced Network Diagnostic Tools Richard Carlson EVN-NREN workshop.
1 Performance Network Monitoring for the LHC Grid Les Cottrell, SLAC International ICFA Workshop on Grid Activities within Large Scale International Collaborations,
1 Deploying Measurement Systems in ESnet Joint Techs, Feb Joseph Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Toward a Measurement Infrastructure. Warren Matthews (SLAC) Presented at the e2e Workshop Miami, FL, February 2003.
1 High Performance Network Monitoring Challenges for Grids Les Cottrell, Presented at the Internation Symposium on Grid Computing 2006, Taiwan
Semester 4 - Chapter 3 – WAN Design
Planning and Troubleshooting Routing and Switching
Establishing End-to-End Guaranteed Bandwidth Network Paths Across Multiple Administrative Domains The DOE-funded TeraPaths project at Brookhaven National.
Measurement Projects Overview
Internet2 Measurement Perspective
WAN Monitoring Issues Prepared by Les Cottrell, SLAC, for the
Warren Matthews and Les Cottrell (SLAC)
Using Netflow data for forecasting
Connie Logg, Joint Techs Workshop February 4-9, 2006
Wide Area Networking at SLAC, Feb ‘03
Experiences in Traceroute and Available Bandwidth Change Analysis
Prepared by Les Cottrell, SLAC, for the Grid Performance Workshop
Experiences in Traceroute and Available Bandwidth Change Analysis
SLAC monitoring Web Services
IEPM. Warren Matthews (SLAC)
Correlating Internet Performance & Route Changes to Assist in Trouble-shooting from an End-user Perspective Les Cottrell, Connie Logg, Jiri Navratil SLAC.
MAGGIE NIIT- SLAC On Going Projects
Interoperable Measurement Frameworks: Internet2 E2E piPEs and NLANR Advisor Eric L. Boyd Internet2 17 April 2019.
The PingER Project: Measuring the Digital Divide
Presentation transcript:

1 WAN Monitoring Prepared by Les Cottrell, SLAC, for the Joint Engineering Taskforce Roadmap Workshop JLab April 13-15, Partially funded by DOE/MICS Field Work Proposal on Internet End-to-end Performance Monitoring (IEPM), also supported by IUPAP

2 Why (Can’t manage what you can’t measure) Need measurements for both production networks & tesbeds: –Planning, setting expectations, policy/funding –Trouble-shooting: reliability & performance Problems may not be logical, e.g. most Internet problems caused by operator error (Sci Am Jun’03), most LAN problems are Ethernet duplex, host config, bugs Made hard by transparency, size & rate of change of network A distributed system is one in which I can’t get my work done because a computer I never heard of has failed. Butler Lampson –Application steering (e.g. Grid data replication) E2E performance problem is THE critical user metric

3 E.g. Policy - trends S.E. Europe, Russia: catching up Latin Am., Mid East, China: keeping up India, Africa: falling behind C. Asia, Russia, S.E. Europe, L. America, M. East, China: 4- 5 yrs behind India, Africa: 7 yrs behind Important for policy makers

4 E.g. Changes in network topology (BGP) result in dramatic change in performance Snapshot of traceroute summary table Samples of traceroute trees generated from the table ABwE measurement one/minute for 24 hours Thurs Oct 9 9:00am to Fri Oct 10 9:01am Drop in performance (From original path: SLAC-CENIC-Caltech to SLAC-Esnet-LosNettos (100Mbps) -Caltech ) Back to original path Changes detected by IEPM-Iperf and AbWE Esnet-LosNettos segment in the path (100 Mbits/s) Hour Remote host Dynamic BW capacity (DBC) Cross-traffic (XT) Available BW = (DBC-XT) Mbits/s Notes: 1. Caltech misrouted via Los-Nettos 100Mbps commercial net 14:00-17:00 2. ESnet/GEANT working on routes from 2:00 to 14:00 3. A previous occurrence went un-noticed for 2 months 4. Next step is to auto detect and notify Los-Nettos (100Mbps)

5 Methods Active Measurement probes: –Include: Ping, traceroute, owamp, pathload/abwe, major apps (e.g. bbftp, bbcp, GridFTP…) –Typically used for end-to-end testing –Inject data into network Passive tools: –Include: SNMP, NetFlow, OCxMon, NetraMet, cflowd, SCNM –Typically used at border or inside backbones SNMP heavily used for utilization, errors on LAN & backbones Flows for traffic characterization and intrusion detection –Need access to network devices (e.g. routers, taps) Need to put together data from multiple sources –Different probes, different source & destinations, network- centric & end-to-end

6 Some Challenges for Active monitoring Bandwidth used, e.g. iperf etc. & apps For TCP tools: configuring windows at clients/servers and optimizing windows, streams Some lightweight tools (e.g. packet pairs) not effective at >> 1Gbits/s Many tools tuned for shared TCP/IP nets not for dedicated circuits Simplifying use and understanding for end-user, automating problem detection & resolution, need close collaboration today

7 Infrastructures Many measurement projects with different emphases, different communities –Passive (usually requires network control, used at borders and on backbones, e.g. MICSmon/Netflow, ISP/SNMP, SCNM) –Active Lightweight (PingER, AMP, Surveyor, RIPE …) Medium weight (PiPES, NWS, IEPM-Lite …) Heavy weight/hi-perf (IEPM-BW, NTAF –End-to-end vs net centric (skitter, macroscopic views) –Repetitive (PingER, AMP, IEPM, PiPES, NWS, NTAF, …) –On demand, or non-production (NDT, NIMI, PiPES …) –Dedicated hardware (AMP, RIPE, NDT, PlanetLab …) –Hierarchical (e.g. AMP) vs Full mesh (e.g. PingER) For a table comparing 13 public domain infrastructures, see:

8 NMI challenges Sustaining deployment/operation in multi-agency / international world Scaling beyond hundreds of hosts very hard over the long term: –Hosts change, upgrade, new OS No control over shared hosts –Depend on friendly admin contacts who may be busy, uninterested, have moved etc. Policy/fears at remote site can make dedicated changes painful web100 upgrades not coordinated with Linux upgrades New TCP kernel upgrades not coordinated with OS upgrades –Hosts age, become measurement bottleneck Need constant upgrades for dedicated hosts –Access policies change (pings & ports filtered) –Probes (iperf etc.) change: new features, patches Appropriate security

9 So Recognize Unrealistic to think multiple admin domains will all deploy one and the same infrastructure –Scaling and interests make unrealistic Multiple-domain, multi-infrastructures will be deployed Need to tie together heterogeneous collection of monitoring systems –Create a federation of existing NMIs –Infrastructures work together –Share data with peer infrastructures and others using a common set of protocols for describing, exchanging & locating monitoring data (e.g. GGF NMWG) –Enables much improved overall view of network using multiple measurement types from multiple sources

10 MAGGIE Proposal Measurement and Analysis for the Global Grid and Internet End-to-end performance Contribute to, utilize the GGF NMWG naming hierarchy and the schema definitions for network measurements Develop tools to allow sharing –Web services based –Integrate information from multiple sources Brings together several major infrastructure participants: LBNL (NTAP, SCNM), SLAC (IEPM- PingER/BW), Internet2 (PiPES, NDT), NCSC (NIMI), U Delaware, ESnet Will work with others, e.g. MonALISA, AMP, UltraLight, PPDG, StarLIght, UltraScienceNet

11 Federation goals Appropriate security Interoperable Useful for applications, network engineers, scientists & end users Easy to deploy & configure As un-intrusive as possible As accurate & timely as possible Identify most useful features of each NMI to improve each NMI faster than working alone

12 NMI Challenges: Reduce “Wizard gap” Applications cross agency AND international funding boundaries (includes Digital Divide) Incent multi-disciplinary teams, including people close to scientists, operational teams –Make sure what is produced is used, tested in real environment, include deployment in proposals Network management research historically underfunded, because it is difficult to get funding bodies to recognize as legitimate networking research, IAB Without excellent trouble-shooting capabilities, the Grid vision will fail

13 More Information Some Measurement Infrastructures: –CAIDA list: –AMP: amp.nlanr.net/, PMA amp.nlanr.net/ –IEPM/PingER home site: www-iepm.slac.stanford.edu/www-iepm.slac.stanford.edu/ –IEPM-BW site: www-iepm.slac.stanford.edu/bwwww-iepm.slac.stanford.edu/bw –NIMI: ncne.nlanr.net/nimi/ ncne.nlanr.net/nimi/ –RIPE: –NWS: nws.cs.ucsb.edu/ nws.cs.ucsb.edu/ –Internet2 PiPES: e2epi.internet2.edu/ e2epi.internet2.edu/ Tools –CAIDA measurement taxonomy: –SLAC Network Tools: Internet research needs: – –