Pushing Up Performance for Everyone Matt Mathis 7-Dec-99.

Slides:



Advertisements
Similar presentations
Wendy Huntoon - PSC Jim Ferguson - NCSA I2 Members Meeting May 2002
Advertisements

Surveyor IP Performance Measurements Matt Zekauskas June, 1999 NLANR/I2 Joint Techs.
INDIANAUNIVERSITYINDIANAUNIVERSITY Global Federated Network Operations Structure (GFNOS) Jim Williams TransPAC2 Principal Investigator
Autotuning in Web100 John W. Heffner August 1, 2002 Boulder, CO.
Experiences Using Web100 for Visible Human Testbeds Thomas Hacker Center for Advanced Computing, University of Michigan Brian Athey Michigan Center for.
GSFC to Alaska Performance Results Tino Sciuto Swales Aerospace ESDIS Network Prototype Lab. NASA GSFC Greenbelt, MD.
Using NetLogger and Web100 for TCP analysis Data Intensive Distributed Computing Group Lawrence Berkeley National Laboratory Brian L. Tierney.
TCP Monitor and Auto Tuner. Need Analysis Enable monitoring of TCP Connections Enable maximum bandwidth utilization No such utility available in MONALISA.
1 Migrating from Access to SQL Server Simon Kingston, CSU / NPS NRGIS.
Active Measurement Project on KREONET & APAN Manhee Lee Supercomputing Center Korea Institute of Science & Technology Information.
Maximizing End-to-End Network Performance Thomas Hacker University of Michigan October 5, 2001.
14 July 2000TWIST George Brett NLANR Distributed Applications Support Team (NCSA/UIUC)
Network Measurement Bandwidth Analysis. Why measure bandwidth? Network congestion has increased tremendously. Network congestion has increased tremendously.
Network Monitoring grid network performance measurement, simulation & analysis Presented by Warren Matthews at the Performance.
Network Performance Toolkit (NPToolkit) A Knoppix Live-CD Rich Carlson Tools Tutorial 12/4/06.
Performance Diagnostic Research at PSC Matt Mathis John Heffner Ragu Reddy 5/12/05 PathDiag ppt.
NDT Tools Tutorial: How-To setup your own NDT server Rich Carlson Summer 04 Joint Tech July 19, 2004.
Network Performance Measurement Atlas Tier 2 Meeting at BNL December Joe Metzger
Pathdiag: Automatic TCP Diagnosis Matt Mathis John Heffner Ragu Reddy 8/01/08 PathDiag ppt.
NLANR, Internet2, and End-to-End performance Scot Colburn National Center for Atmospheric Research May Florianópolis, Brasil.
BLOG. WHAT IS A BLOG ? We have a lot of definition of blog.. A blog is a personal diary. A daily pulpit. A collaborative space. A political soapbox. A.
University of Sunderland CDM105 Session 5 Web Authoring Tools The past and present A history of web authoring tools and an overview of Macromedia Dreamweaver.
Maximizing End-to-End Network Performance Thomas Hacker University of Michigan October 26, 2001.
InterDomain Dynamic Circuit Network Demo Joint Techs - Hawaii Jan 2008 John Vollbrecht, Internet2
1 Project Goals Project Elements Future Plans Scheduled Accomplishments Project Title: Net Developing Network-Aware Operating Systems PI: G. Huntoon,
Heuristics to Classify Internet Backbone Traffic based on Connection Patterns Wolfgang John and Sven Tafvelin Dept. of Computer Science and Engineering.
1 Session Number Presentation_ID © 2001, Cisco Systems, Inc. All rights reserved. Using the Cisco TAC Web Site for LAN Switching Issues Cisco TAC Web Seminar.
TAAD - A Tool for Traffic Analysis and Automatic Diagnosis Kathy L. Benninger NLANR/Pittsburgh Supercomputing Center.
1 BWdetail: A bandwidth tester with detailed reporting Masters of Engineering Project Presentation Mark McGinley April 19, 2007 Advisor: Malathi Veeraraghavan.
NET100 Development of network-aware operating systems Tom Dunigan
30 May 2001 Campus Measurement Matt Zekauskas, Internet2 Campus Workshop Atlanta, GA.
Internet2 End-to-End Performance Initiative or Why Fat Pipes aren’t Enough Ted Hanss Director, Applications Development 16 May 2001 See
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Network Path and Application Diagnostics Matt Mathis John Heffner Ragu Reddy 4/24/06 PathDiag ppt.
APEC Project Database.
NLANR The National Laboratory for Applied Network Research funded by the National Science Foundation, provides engineering,
NET100 Development of network-aware operating systems Tom Dunigan
National Center for Atmospheric Research Pittsburgh Supercomputing Center National Center for Supercomputing Applications Web100 Basil Irwin & George Brett.
Wide Area Network Performance Analysis Methodology Wenji Wu, Phil DeMar, Mark Bowden Fermilab ESCC/Internet2 Joint Techs Workshop 2007
Achieving Dependable Bulk Throughput in a Hybrid Network Guy Almes Aaron Brown Martin Swany Joint Techs Meeting Univ Wisconsin July 2006.
The TCP-ESTATS-MIB Matt Mathis John Heffner Raghu Reddy Pittsburgh Supercomputing Center Rajiv Raghunarayan Cisco Systems J. Saperia JDS Consulting, Inc.
13-Oct-2003 Internet2 End-to-End Performance Initiative: piPEs Eric Boyd, Matt Zekauskas, Internet2 International.
1 Evaluating NGI performance Matt Mathis
© 2006, National Research Council Canada © 2006, IBM Corporation Solving performance issues in OTS-based systems Erik Putrycz Software Engineering Group.
Web100 Basil Irwin National Center for Atmospheric Research Matt Mathis Pittsburgh Supercomputing Center Halloween, 2000.
National Center for Atmospheric Research Pittsburgh Supercomputing Center National Center for Supercomputing Applications Web100 Roll Out I2 Members Meeting.
Internet2 End-to-End (e2e) Performance Initiative Laurie Burns Steve Corbató Internet2 Fall Member Meeting Atlanta, Georgia October 31, 2000.
Measurement in the Internet Measurement in the Internet Paul Barford University of Wisconsin - Madison Spring, 2001.
Summer Project Marama2008 By Robert and Johnson. What is Marama? Marama is an Eclipses based toolset permits rapid specification of notational elements,
TEAM 2 IAN BRAKER, ALIX COHEN, JOSIE STRUTZ, IAN MORGAN, ARTHUR MORALES FEMA Registration Process Report.
NET100 Development of network-aware operating systems Tom Dunigan
Enterprise Engineering How to read an IDEF0 model
National Center for Atmospheric Research Pittsburgh Supercomputing Center National Center for Supercomputing Applications Web100 and Logistical Networking.
Network Path and Application Diagnostics Matt Mathis John Heffner Ragu Reddy 7/19/05 PathDiag ppt.
UT-BATTELLE U.S. Department of Energy Oak Ridge National Laboratory Net100: developing network-aware operating systems New (9/01) DOE-funded (Office of.
© 2015 Pittsburgh Supercomputing Center Opening the Black Box Using Web10G to Uncover the Hidden Side of TCP CC PI Meeting Austin, TX September 29, 2015.
INDIANAUNIVERSITYINDIANAUNIVERSITY Status of FAST TCP and other TCP alternatives John Hicks TransPAC HPCC Engineer Indiana University APAN Meeting – Hawaii.
George Brett, Janet Brown, Basil Irwin NLANR Team Members Update on the National Laboratory for Applied Network Research Internet2 Fall Meeting Atlanta,
A Validation System for the Complex Event Processing Directives of the ATLAS Shifter Assistant Tool G. Anders (CERN), G. Avolio (CERN), A. Kazarov (PNPI),
05 October 2001 End-to-End Performance Initiative Network Measurement Matt Zekauskas, Fall 2001 Internet2 Member Meeting Network Measurement.
Advanced Network Diagnostic Tools Richard Carlson EVN-NREN workshop.
PiPEs Tools in Action Rich Carlson SMM Tools Tutorial May 3, 2005.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
1 Deploying Measurement Systems in ESnet Joint Techs, Feb Joseph Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Network Path and Application Diagnostics
Measurements on Internet2
Experimental Networking (ECSE 4963)
Network Technology Update
Anant Mudambi, U. Virginia
Using NetLogger and Web100 for TCP analysis
Presentation transcript:

Pushing Up Performance for Everyone Matt Mathis 7-Dec-99

Why do so few people get good network performance? Context and history Architectural origins Approaches

The Wizard Gap

Past Performance Evolution Wizards wrote standards –Standard TCP could not go fast (1988) Wizards enhanced systems –Stock systems could not go fast (1995) Gurus tune systems (today) –Fast TCP is present –Badly misstuned by default

Ongoing Performance Evolution More disciples tune and debug (tomorrow) –All netadmins and sysadmins? Systems are tuned by default (future) –Web100..… Debugging will become “easy” (?)

Architecture The Good news –TCP hides the net from the application The Bad news –TCP hides the net

Architecture The Good news –TCP hides the net from the application The Bad news –TCP hides the net ……. including ALL bugs everywhere. The only legal symptom is less than expected performance

You get poor performance if: –The application is inefficient –TCP is buggy –TCP is misstuned –The path is buggy –The path is congested –Routing is suboptimal Especially on a long path. –Think: weakest link of an invisible chain

Closing the Wizard gap Share the expertise –Train more disciples Require less expertise –Systems should tune themselves Better observability –Focused and efficient debugging Documentation –Show that the world is improving

Share the expertise Joint Techs meetings TCP Tuning –In depth presentation by Matt Mathis DAST Application tutorials –See: dast.nlanr.net

Require less expertise TCP Autotuning –Presentation by Matt Mathis Web100 –Presentation be Basil Irwin Online TCP debugging resources –See

Better Observability (Instrumentation) Network Instrumentation and Visualization –Presentation by Mark Gates Trace Analysis and Auto-Diagnosis –Presentation by Kathy Benninger Better TCP instrumentation (Web-100) –Just ask TCP why it is slow

Better Observability (Debugging methods) Sweden - Pittsburgh path –Presentation by Greg Miller & Jerry Sobieski iPerf tool –Presentation by Mark Gates Existing tools and tool repositories –See: Still insufficient

Better Observability (Measurement) Measurements from Seattle I2 Meeting –Presentation by Matt Zekauskas Advanced Research and Engineering Atlas –Presentation by John Jamison Many distributed measurement efforts –AMP, Surveyor, NIMI, etc

Documentation vBNS stats and measurement –Tutorial by Rick Wilder NLANR MOAT vBNS traffic on NAI –See: moat.nlanr.net Many benchmark efforts –Surveyor, AMP, NIMI, Web100…… HPC host census(?)

Conclusion We need to find every bug that TCP hides –Now and always We need to eliminate all irrelevant controls –Autotune TCP (and RED, etc)

Debugging flowchart Look at a trace and click to study symptoms Ongoing evolution

Testrig kit "Fool proof" TCP diagnosis starter kit with: –Simple diagnostic application –TCP trace collection tools –Visualization tools –Pointer to the debugging flowchart With wrapper scripts around everything

TCP Debugging In-depth Draft done at CAIDA this summer Future NCNE On-site –1, 2.5 and 5 hour versions Basis for the debugging flowchart Update from flowchart as it evolves Interactive - Uses magicpoint/xplot

Trace Analysis and Auto-Diagnosis (TAAD) Scan GigaPop traffic for misstuned TCP connections –that fail to meet the model rate = (MSS/RTT) * (C/sqrt(p)) Running prototype Use to direct other resources

Autotuning Make TCP “do the right thing” by default No unneeded user controls

Generate data points (AMP) Nearly 100 systems already Kernel TCP bug –Need to upgrade to freeBSD 3.3 Easy to create 100x1 data points Can create 100x100 data points Opportunity for NIMI

Generate OC-12 data points Max Okumoto working at PSC for SDSC Will start tuning selected paths

HPC Host Census Use existing data from MCI OC-Xmon Patterned after HWB big flow detection Measure the number of fast hosts Words needed to generalize to all of JET