Presentation is loading. Please wait.

Presentation is loading. Please wait.

Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) 1st EGEE Conference Cork, April 2004 Network Monitoring: The GGF Perspective.

Similar presentations


Presentation on theme: "Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) 1st EGEE Conference Cork, April 2004 Network Monitoring: The GGF Perspective."— Presentation transcript:

1 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) 1st EGEE Conference Cork, April 2004 Network Monitoring: The GGF Perspective Mark Leese Paul Mealor

2 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) Contents Simple really: uUse cases - why this is important uWhat GGF is doing

3 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) The Grid? uBasic Grid principle: uUser applications (Grid apps) submit their work to the middleware which selects the “best” resources available to runs the job. uNetwork performance information is essential...because… Grid App Middleware Grid App Resource (SE) Resource (CE) Network Grid App Uzbekistan CERN

4 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) uResource Brokers (RBs) are responsible for finding the best resource (Computing Element, CE) to be used for a job, e.g.: uRun job at B, using copy of data from A, then store results at C uAll other things being equal, take into account the data access requirements of the job uOut of the list of CEs capable of running the job, use network cost function to identify the CE with the “best” data access: uConsider “best” combination of data sources and sinks, e.g. IF source data = 10 GB AND resulting data will = 100 GB THEN pick CE based on performance to result storing SE (Storage Element). uEuropean Data Grid does something along these lines (Please, no one tell me that this is wrong) Use Case 1: Resource Selection Network Cost Function Estimated transfer time File source & destination File size

5 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) Use Case 2: Replica Selection uFile replication = proven technique for improving data access uSpread multiple copies of same file across the Grid Do you really want to get everything from CERN, everytime? Do you really want to get everything from your geographically nearest site everytime? uA file has Logical File Name (LFN) which maps to 1 or more PFNs (physicals) uReplica Manager should include Replica Selection Service which uses network performance data (from somewhere) to find “best” replica. 5. GridFTP commands Replica Catalogue Replica Selection Grid App 2. Multiple locations (PFNs) 1.LFN 4. Selected replica (PFN) Net Mon Service 3. Get performance data/ predictions GGF looking at formally defining these (and other) use cases

6 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) uPatience ;-) First we must look at web services. uEssentially, an online application accessed using XML... u…which makes it easier for other apps to use yours… u…which allows the Grid middleware to access our data How are GGF addressing problem? UDDI registry WSPClient 3. Client requests WSDL doc 4. WSDL tells client how to interact 1. WSP registers service with registry 2. Client locates suitable service using registry 5. Service and client interact using XML messages, sent via SOAP

7 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) uBy producing standards relating to network monitoring services. uFirst with the Network Measurements Working Group (NM-WG): Defining XML schemas for requesting tests and historic data, and publishing network measurements Aims: to standardise communication, and… …use XML, for web services and OGSI model Simple use case… uAll request & result messages can be formatted using standardised schemas = truly powerful combination How are GGF addressing problem? Network Monitoring Service test request (request schema) tests results (publication schema) DANTE, Internet2, SLAC etc. already using NM-WG work.

8 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) Standard measurements? uSchemas based on NM-WG proposed measurement classification system: describes a set of network characteristics and their classification hierarchy used for creating common schemata for describing network monitoring data using a standard classification maximises data portability description + hierarchy

9 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) So what can you ask for 1? uInitial schema requirements set. Four sections: what, where, when, how uWhat: Use GGF metric names, e.g. path.delay.oneWay Can request statistical data, with a specified sample interval, e.g. daily averages for one-way delay over the last month After some “discussion”, multiple statistics in same request Can limit number of returned results to avoid overload uWhere: Source and destination Flexible: IPv4|6, hostnames, or textual names such as “core router” and “edge router” (e.g. for security)

10 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) uWhen: The primary means of specifying the time period we are interested in (for tests or data retrieval) is: target Time (an absolute time or “now”) relative +ve and -ve time tolerances… -ve time tolerance= 600 secs target_time= 14:00 -ve time tolerance= 600 secs So what can you ask for 2? = 13:50-14:10

11 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) So what can you ask for 3? Setting limit on number of results controls possibilities: when number of results = “all”: supply all matching measurements in given time period when number of results = 1: time data defines the period for which a measurement is considered to be acceptable, e.g. 14:00 +/- 10 minutes Can also give start & end time if you wish, but values are mapped to target_time & number of results will = all “testing interval” controls how often tests are run

12 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) So what can you ask for 4096? uHow: Can supply values to act as parameters for tests, or filters for querying past data, including tool name. Uses param specific tags or list of parameters: -a –b 10 -c Possible to set ranges for parameters… 4194304 1048576 …and orders of preference. Unspecified params use receiving system’s defaults Can request reporting of actual param values used Can control whether a test is ever run 4194304 1048576

13 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) Is that all GGF is doing? uNo, GGF Grid High Performance Networking Research Group also hard at work, modelling the network as a Grid resource so they can perform “advance reservation” etc. uComputing, storage and interconnecting network are all resources: Easier to manage All can be reserved Capability discovery Exploit commonalities Forms integrated stack computing middleware Grid applications networkstorage “advance reservation”

14 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) The network as a resource uTo be achieved with set of network sub- services forming holistic network service. uCan't say more as this probably going to change quite a lot. uWant to know more? Then get involved!

15 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) Network monitoring service uHistoric measurement data uPredictions uAllow clients to run scheduled tests uOn-demand (real-time) tests uProvide less-frequently monitored information (network route, topology…) uEvent notifications, for all of the above uAcross multiple administrative domains for all of the above Network Monitoring Service Network domain Y Grid Middleware Grid Applications Automated Test Systems GOC/NOC Admin Software Grid/Net Operations Other Network Services Other Network Services Other Network Services Network Monitoring Service Network domain Z Network Monitoring Service Network domain X Diagram shows potential clients: numerous and varied

16 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) Will this be easy? uProbably not, but like all good car salespeople, I won’t tell you about the problems. uBut the potential benefits are worth the effort!

17 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) Conclusion uGrid network monitoring crucial to the Grid But you all know that already! uGHPN: looking at network services, inc. monitoring service uNM-WG: looking at how to interface to network monitoring services uAmbitious, but potential benefits justify efforts! JRA4 SHOULD be involved!

18 Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) ? ? ? ? ? Questions m.j.leese@dl.ac.ukm.j.leese@dl.ac.uk pdm@hep.ucl.ac.ukpdm@hep.ucl.ac.uk GET INVOLVED! http://www-didc.lbl.gov/NMWG/ http://forge.gridforum.org/projects/ghpn-rg The End


Download ppt "Mark Leese (Daresbury Laboratory) Paul Mealor (University College London) 1st EGEE Conference Cork, April 2004 Network Monitoring: The GGF Perspective."

Similar presentations


Ads by Google