Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 IEPM/PingER Project Les Cottrell, SLAC DoE 2004 PI Network Research Meeting, FNAL Sep 15- 17 ‘04 www.slac.stanford.edu/grp/scs/net/talk03/scidac-pinger-sep04.ppt.

Similar presentations


Presentation on theme: "1 IEPM/PingER Project Les Cottrell, SLAC DoE 2004 PI Network Research Meeting, FNAL Sep 15- 17 ‘04 www.slac.stanford.edu/grp/scs/net/talk03/scidac-pinger-sep04.ppt."— Presentation transcript:

1 1 IEPM/PingER Project Les Cottrell, SLAC DoE 2004 PI Network Research Meeting, FNAL Sep 15- 17 ‘04 www.slac.stanford.edu/grp/scs/net/talk03/scidac-pinger-sep04.ppt

2 2 Outline PingER –Purpose etc. –Methodology –Results PingER-NG ≡ IEPM-BW –Low network impact bandwidth tool (INCITE) –Traceroute viz –Topology (INCITE)

3 3 PingER Uses ping to provides lightweight performance monitoring: –< 100bits/s per pair measured –No software to install at remote sites –Measures loss, RTT, reachability, jitter For planning, trouble shooting Originally (1990s) for HENP sites More recently also to characterize the Digital Divide –ICFA/SCIC, Internet2 Hard to Reach Places, WSIS, ICTP/eJDS

4 4 Methodology Use ubiquitous ping Each 30 minutes from monitoring site to target : –1 ping to prime caches –by default send11x100Byte pkts followed by 10x1000Byte pkts Low network impact + no software to install / configure / maintain at remote sites + no passwords / accounts needed = good for developing sites / regions Record loss & RTT, (+ reorders, duplicates) Derive throughput, jitter, unreachability …

5 5 Architecture Hierarchical vs. full mesh WWW Archive Monitoring Remote FNAL Reports & Data Cache Monitoring SLAC Ping HTTP Archive 1 monitor host remote host pair ~35 ~550

6 6 Coverage In last 9 months added: –Several sites in Russia (thanks GLORIAD) –Many hosts in Africa (5=>36 now in 27 out of 54 countries) –Monitoring sites in Pakistan and Brazil (Sao Paolo and Rio) Now monitoring 650 sites in 115 countries Working to install monitoring host in Bangalore, India Monitoring site Remote site

7 7 World View S.E. Europe, Russia: catching up Latin Am., Mid East, China: keeping up India, Africa: falling behind C. Asia, Russia, S.E. Europe, L. America, M. East, China: 4- 5 yrs behind India, Africa: 7 yrs behind Important for policy makers

8 8 View from CERN Confirms view from N. America From the PingER project August 2004.

9 9 From Developing Regions As expected Brazil to L. America is good Actually dominated by Brazil to Brazil To Chile & Uruguay poor since goes via US Brazil (Sao Paolo) Novosibirsk NSK to Moscow used to be OK but loss went up in Sep. 2003 GLORIAD may help Novosibirsk

10 10 Technology Achievement Index (TAI) TAI captures how well a country is creating and diffusing technology and building a human skills base. TAI from UNDP hdr.undp.org/reports/global/2001/en/pdf/techindex.pdf hdr.undp.org/reports/global/2001/en/pdf/techindex.pdf TAI top 12 Finland 0.744 US 0.733 Sweden 0.703 Japan0.698 Korea Rep. of 0.666 Netherlands0.630 UK 0.606 Canada0.589 Australia 0.587 Singapore0.585 Germany0.583 Norway0.579 US & Canada off-scale

11 11 PingER-NG = IEPM-BW Need measurement tools for high-performance paths/applications –BER 10 -8 takes > day to see 1 loss –Ping losses ≠ TCP losses Build infrastructure to –Measure with: Iperf (TCP mem-to-mem), GridFTP, bbftp Lightweight packet pair dispersion –Evaluate measurement tools

12 12 Low impact bandwidth measurement Goals: –Make a measurement in < second rather than tens of seconds –Injects little network traffic –Provide reasonable agreement with more intense methods (e.g. iperf) Enables: –Measurements of low performance links (e.g. to developing countries) –Helps avoid need for scheduling –More frequent measurements (minutes vs. hours) –Lower impact more friendly

13 13 Low impact Bandwidth Use 20 packet pairs to roughly estimate dynamic bw Capacity & Xtraffic, then Available = Capacity – Xtraffic –Capacity  min pair separation; Xtraffic  packet pair dispersion Dynamic bandwidth capacity (DBC) Available bandwidth = DBC – X-traffic Cross-traffic Iperf ABwE SLAC to Caltech Mar 19, 2004

14 14 Achievable throughput & file transfer IEPM-BW –High impact (iperf, bbftp, GridFTP …) measurements 90+-15 min intervals Select focal area Fwd route change Rev route change Min RTT Iperf bbftp iperf1 abing Avg RTT

15 15 Anomalous Event Detection Too many graphs to scan by hand, need to automate –SLAC Caltech link performance dropped by factor 5 for ~ month before noticed, fixed within 4 hours of reporting Looking for long-term step down changes in bandwidth Use modified “plateau” algorithm from NLANR –Divide data into history & trigger buffer –If y <  h –  *  h then trigger, else history (  When trigger buffer fills: if  t <  *  h, then have an event

16 16 Route table Example Compact so can see many routes at once History navigation Multiple route changes (due to GEANT), later restored to original route Available bandwidth Raw traceroute logs for debugging Textual summary of traceroutes for email to ISP Description of route numbers with date last seen User readable (web table) routes for this host for this day Route # at start of day, gives idea of root stability Mouseover for hops & RTT

17 17 Another example TCP probe type Host not pingable Intermediate router does not respond ICMP checksum error Level change Get AS information for routes

18 18 Topology Choose times and hosts and submit request DL CLRCCLRC CLRC IN2P3 CESnet ESnet JAnet GEANT Nodes colored by ISP Mouseover shows node names Click on node to see subroutes Click on end node to see its path back Also can get raw traceroutes with AS’ Alternate rt SLAC Alternate route Hour of day

19 19 Putting it together     ESnet CENIC Abilene SLAC Supernet SOX

20 20 New features in works (with NIIT) Improve new site set-up tools Improve management –Discover non working links faster Improve access to data and meta data –Provide data base with lat/long, country etc. –Add web services access Improve visualization: –Provide map with drill down to node information –Automate production of long term trend plots for regions –More node selection capabilities Traceroute measurement and analysis

21 21 More PingER Project –http://www-iepm.slac.stanford.edu/pinger/http://www-iepm.slac.stanford.edu/pinger/ –IEEE Communications Magazine on Network Traffic Measurements and Experiments. ICFA/SCIC Network Monitoring report, Jan ‘04 –http://www.slac.stanford.edu/xorg/icfa/icfa-net- paper-jan04/http://www.slac.stanford.edu/xorg/icfa/icfa-net- paper-jan04/ IEPM-BW –http://www-iepm.slac.stanford.edu/http://www-iepm.slac.stanford.edu/


Download ppt "1 IEPM/PingER Project Les Cottrell, SLAC DoE 2004 PI Network Research Meeting, FNAL Sep 15- 17 ‘04 www.slac.stanford.edu/grp/scs/net/talk03/scidac-pinger-sep04.ppt."

Similar presentations


Ads by Google