PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell The PERT and Network Performance Monitoring EVN-NREN, Amsterdam 28//01/05.

Slides:



Advertisements
Similar presentations
Key Multi-domain GÉANT Network Services June 2011.
Advertisements

COMPREHENSIVE APPROACH TO INFORMATION SECURITY IN ADVANCED COMPANIES.
Connect. Communicate. Collaborate The Performance Enhancement Response Team: Origins and Evolution Ann Harding, HEAnet Toby Rodwell,
MCTS GUIDE TO MICROSOFT WINDOWS 7 Chapter 10 Performance Tuning.
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
11 TROUBLESHOOTING Chapter 12. Chapter 12: TROUBLESHOOTING2 OVERVIEW  Determine whether a network communications problem is related to TCP/IP.  Understand.
LANs and WANs Network size, vary from –simple office system (few PCs) to –complex global system(thousands PCs) Distinguish by the distances that the network.
QoS Solutions Confidential 2010 NetQuality Analyzer and QPerf.
1 Lecture 30 Introduction to Data Communications Overview  Lecture Objectives.  Data Communications: Basics.  Major Issues in Data Communications. 
Office 2003 Introductory Concepts and Techniques M i c r o s o f t CPTG104 Intro to Information Systems Dr. Hwang Essential Introduction to Computers.
Project Plan The Development Plan The project plan is one of the first formal documents produced by the project team. It describes  How the project will.
MCITP Guide to Microsoft Windows Server 2008 Server Administration (Exam #70-646) Chapter 14 Server and Network Monitoring.
Chapter 9: Moving to Design
Remote Monitoring and Desktop Management Week-7. SNMP designed for management of a limited range of devices and a limited range of functions Monitoring.
Hands-On Microsoft Windows Server 2008 Chapter 11 Server and Network Monitoring.
CH 13 Server and Network Monitoring. Hands-On Microsoft Windows Server Objectives Understand the importance of server monitoring Monitor server.
Windows Server 2008 Chapter 11 Last Update
Introduction to Networks Networking Concepts IST-200 VWCC 1.
Internet Service Provisioning Phase - I August 29, 2003 TSPT Web:
Connect. Communicate. Collaborate Place your organisation logo in this area End-to-End Coordination Unit Toby Rodwell, Network Engineer, DANTE TNLC, 28.
1 ESnet Network Measurements ESCC Feb Joe Metzger
Minix Jeff Ward, Robert Burghart, Jeb Collins, Joe Creech.
MCTS Guide to Microsoft Windows 7
11 SECURITY TEMPLATES AND PLANNING Chapter 7. Chapter 7: SECURITY TEMPLATES AND PLANNING2 OVERVIEW  Understand the uses of security templates  Explain.
Current Job Components Information Technology Department Network Systems Administration Telecommunications Database Design and Administration.
1. There are different assistant software tools and methods that help in managing the network in different things such as: 1. Special management programs.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE II - Network Service Level Agreement (SLA) Establishment EGEE’07 Mary Grammatikou.
Managing and Monitoring Windows 7 Performance Lesson 8.
Connect. Communicate. Collaborate PERT Performance Enhancement & Response Team Toby Rodwell, DANTE Joint Techs, New Mexico USA 7 February 2006.
Performance Monitoring - Internet2 Member Meeting -- Nicolas Simar Performance Monitoring Internet2 Member Meeting, Indianapolis.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
Event Management & ITIL V3
What is a Computer? Computer generally means a programmable machine. The two principal characteristics of a computer are: it responds to a specific set.
1 Network Monitoring Mi-Jung Choi Dept. of Computer Science KNU
Mr C Johnston ICT Teacher BTEC IT Unit 05 - Lesson 05 Network Protocols.
CCNA 3 Week 4 Switching Concepts. Copyright © 2005 University of Bolton Introduction Lan design has moved away from using shared media, hubs and repeaters.
Connect. Communicate. Collaborate Implementing Multi-Domain Monitoring Services for European Research Networks Szymon Trocha, PSNC A. Hanemann, L. Kudarimoti,
Connect. Communicate. Collaborate Place your organisation logo in this area The PERT – Evolution from a Centralised to a Federated Organization Toby Rodwell.
By: Dominique Vargas. Computer network Questions of computer networks.
Connect communicate collaborate GÉANT3 Services Connectivity and Monitoring Services by and for NRENs Ann Harding, SWITCH TNC 2010.
Business Data Communications, Fourth Edition Chapter 11: Network Management.
Connect communicate collaborate PERT workshop Bartek Gajda Poznan Supercomputing and Networking Center (PSNC) PIONIER (Polish NREN)
Connect. Communicate. Collaborate perfSONAR MDM Service for LHC OPN Loukik Kudarimoti DANTE.
Lecture (Mar 23, 2000) H/W Assignment 3 posted on Web –Due Tuesday March 28, 2000 Review of Data packets LANS WANS.
CH 13 Server and Network Monitoring. Hands-On Microsoft Windows Server Objectives Understand the importance of server monitoring Monitor server.
Chapter 13: LAN Maintenance. Documentation Document your LAN so that you have a record of equipment location and configuration. Documentation should include.
Connect. Communicate. Collaborate Click to edit Master title style PERT OPERATIONS.
Connect communicate collaborate Design and Set Up of the New GÉANT NOC Toby Rodwell, DANTE TNC09, 9 June 2009.
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
Term 2, 2011 Week 2. CONTENTS Communications devices – Modems – Network interface cards (NIC) – Wireless access point – Switches and routers Communications.
Macquarie Fields College of TAFE Version 2 – 13 March HARDWARE 4.
Debugging end-to-end performance in commodity operating system Pavel Cimbál, CTU, Sven Ubik, CESNET,
Higher Computing Networking. Networking – Local Area Networks.
Connect. Communicate. Collaborate GEANT2 Monitoring Services Emma Apted, DANTE Operations EGEE III, Budapest, 3 rd October 2007.
Connect communicate collaborate LHCONE Diagnostic & Monitoring Infrastructure Richard Hughes-Jones DANTE Delivery of Advanced Network Technology to Europe.
EGEE-II INFSO-RI Enabling Grids for E-sciencE End-to-End Service Level Agreement Provisioning and Monitoring for End-to-End QoS.
Introduction to ITIL and ITIS. CONFIDENTIAL Agenda ITIL Introduction  What is ITIL?  ITIL History  ITIL Phases  ITIL Certification Introduction to.
A Service-Based SLA Model HEPIX -- CERN May 6, 2008 Tony Chan -- BNL.
Page 1 Monitoring, Optimization, and Troubleshooting Lecture 10 Hassan Shuja 11/30/2004.
2.2 Interfacing Computers MR JOSEPH TAN CHOO KEE TUESDAY 1330 TO 1530
ITMT 1371 – Window 7 Configuration 1 ITMT Windows 7 Configuration Chapter 8 – Managing and Monitoring Windows 7 Performance.
Advanced Network Diagnostic Tools Richard Carlson EVN-NREN workshop.
1 eduPERT. 2 MOTIVATION FOR THE PERT Why have a Performance Enhancement and Response Team (PERT)? Historically, long-distance circuits (the ‘wide-area’)
Campana (CERN-IT/SDC), McKee (Michigan) 16 October 2013 Deployment of a WLCG network monitoring infrastructure based on the perfSONAR-PS technology.
OPEN SOURCE NETWORK MANAGEMENT TOOLS
Chapter 7. Identifying Assets and Activities to Be Protected
Hands-On Microsoft Windows Server 2008
What is a Computer? Computer generally means a programmable machine. The two principal characteristics of a computer are: it responds to a specific set.
What is a Computer? Computer generally means a programmable machine. The two principal characteristics of a computer are: it responds to a specific set.
4 Macquarie Fields College of TAFE Version 2 – 13 March 2000
Presentation transcript:

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell The PERT and Network Performance Monitoring EVN-NREN, Amsterdam 28//01/05 Toby Rodwell, Network Engineer DANTE

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Network Performance Problems Historically, long distance circuits (the “wide-area”) have been the bottleneck in a network In recent years, the capacity of long distance circuits has significantly increased End-to-end performance bottle-necks may now occur at any point in a system – end-system (application, OS, hardware), LAN or WAN As such, it is becoming more and more difficult for a non-expert end-user to diagnose their network performance issues

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Origins of the PERT Conception of the PERT … Jan 01 Internet2 Meeting –Performance Enhancement and Response Team –To provide a support structure to investigate and resolve problems in the performance of applications over computer networks –Comparable to CERT structure Realization of the PERT … Dec 2002 TERENA meeting –GARR, TERENA, DANTE, SWITCH, CESnet, HEAnet and UKERNA committed to a practical trial of a basic PERT

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell The GEANT PERT PERT –Informal, unregulated access to PERT; anybody can request PERT’s help –PERT communicated via list –Primary purpose of investigation was to improve PERT’s knowledge and experience –Problems were addressed on a best efforts basis –No dedicated Monitoring tools –RoundUp tracking system (off-the-shelf) used

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell GEANT2 PERT A development of the existing PERT Pilot phase Nov 04 –Feb 05 Fully operational from Mar 05 A virtual team consisting of –Case Managers, who receive new requests and manage unresolved issues –Subject specialists who can be called upon to help resolve complex issues Monitoring tools –During the course of the GEANT2 project a monitoring infrastructure will be developed and deployed which should be of particular help with performance troubleshooting

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell PERT Staff Case Managers –Part-time staff provided by GEANT2 project participants –On a roster to ensure continuous cover during normal working hours (once PERT fully operational) –Cross-discipline experts who are capable of identifying the locations of performance bottle-necks Subject Matter Experts –Unfunded volunteers from a potentially wide variety of organizations who provide help on a best efforts basis –Have specialist knowledge in one or more subjects and so can precisely diagnose the cause of a given problem and help the end-users resolve it

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Pilot PERT Systems Issue Tracker –Record of PERT issues (cases) and their investigation –Use open-source, “Roundup” software – Publicly accessible at (eVLBI performance case issue4) PERT Diary –For assessing the performance of the PERT and highlighting issues –Uses TWiki open-source software (user editable website) –Publicly accessible at

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell PERT Systems PERT Ticket System –Similar to Trouble Ticket systems used by NOCs –Optimised for the collaborative nature of PERT investigations (will collect and records s and Instant Messaging threads) –May directly contact SMEs who have expressed interest in a particular subject Knowledge Base –Known performance issues, with possible ways to address them –Successful diagnostic strategies

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Lessons Learned to Date Identify technical contact at each end Determine the scope of testing possible –If production machines involved, some configurations changes may not be acceptable for testing purposes Wherever possible, use methods to minimise the amount of variables –e.g. sink data to /dev/null, memory to memory transfer not to disk

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Contacting the PERT Normally via NREN Selected pan-European projects (including EVN) may contact PERT directly –Because the PERT is not 24x7 quick response, suspected network failures are best reported to NREN/GEANT NOCs address –

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell GEANT Network Monitoring

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Monitoring Tools GEANT status monitoring –5 minute polling - state of equipment, circuits and services –Failed hardware or circuits detected within 10 minutes and action taken by GEANT NOC, 24x7 GEANT traffic statistics collection –5 minute polling of router interface counters (default and customised) –Collected data stored in a Round Robin Database (RRD), that is kept a constant size by aggregating data as it ages GEANT traffic statistics display –For quick, real-time view – Weathermap –For back history and specialist counters – Taksometro

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Monitoring Tools - Taksometro

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Monitoring Tools Taksometro

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Monitoring Tools Taksometro

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Monitoring Tools ‘Weathermap’ Kairos

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Monitoring Tools ‘Weathermap’ Kairos Hyperlinked traffic chart

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Monitoring Tools – Synagon (GEANT Ops only) BeforeAfter

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Any Questions? Thank you.

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Example Case … from last year. Project has since moved on, but sequence of events is still instructive EVN throughput test –Test the download of 430MB file from the JIVE website in Dwingerloo to the University of Oxford –Problems with the systems in Oxford, therefore test done between JIVE and a GÉANT workstation.

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Example Case Initial transfer test: –Via http, using wget –Took 5 minutes to complete the 430Mbps transfer, (approximately 10Mbps throughput) PERT case opened Potential causes –Ethernet interfaces not full duplex mode –Insufficiently large TCP buffers

PERT – EVN-NREN –Amsterdam 28/1/05 – Toby Rodwell Example Case –The TCP receive buffers max size on GEANT of reasonable size –wget uses the default TCP buffer size. TCP default buffer size increased on two receiver (ws4.uk: Linux -> 8MB, ws1.de: Unix -> 196kB) Dramatic improvement: 40Mbps –Could not access the JIVE webserver to increase the Tx buffer (critical production machine) –Access was granted to the JIVE FTP server, where the Tx buffer was increased to 2MB Improvement: 90Mbps