NM functions Configuration, Performance, Fault, Accounting, Security.

Slides:

Advertisements

Similar presentations

Network II.5 simulator ..

Advertisements

Steve Lewis J.D. Edwards & Company

Top-Down Network Design Chapter Nine Developing Network Management Strategies Copyright 2010 Cisco Press & Priscilla Oppenheimer.

Performance Testing - Kanwalpreet Singh.

Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.

Chapter 19: Network Management Business Data Communications, 5e.

CIS : Network Management. Introduction Network, associated resources and distributed applications indispensable Complex systems —More things can.

Telecommunications Management /635 Network Management.

11 TROUBLESHOOTING Chapter 12. Chapter 12: TROUBLESHOOTING2 OVERVIEW  Determine whether a network communications problem is related to TCP/IP.  Understand.

Chapter 19: Network Management Business Data Communications, 4e.

Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.

Network Monitoring Chu-Sing Yang Department of Electrical Engineering National Cheng Kung University.

Hands-On Microsoft Windows Server 2003 Administration Chapter 10 Monitoring and Troubleshooting Windows Server 2003.

Documenting the Existing Network - Starting Points IACT 418 IACT 918 Corporate Network Planning.

1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.

70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 11: Monitoring Server Performance.

Fault, Configuration, Performance Management

Chapter 15 Chapter 15: Network Monitoring and Tuning.

MCDST : Supporting Users and Troubleshooting a Microsoft Windows XP Operating System Chapter 10: Collect and Analyze Performance Data.

CS 695 Network Management Techniques1 Data Communications and Network Management Overview.

Measuring Performance Chapter 12 CSE807. Performance Measurement To assist in guaranteeing Service Level Agreements For capacity planning For troubleshooting.

Chapter 12: Troubleshooting Networking Problems Network+ Guide to Networks Third Edition.

Configuration Management IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.

Configuration Management IACT 418/918 Autumn 2005 Gene Awyzio SITACS University of Wollongong.

EHealth Network Monitoring Network Tool Presentation J. Gaston Senior Network Design Seminar Professor Morteza Anvari 10 December 2004.

Maintaining and Updating Windows Server 2008

Check Disk. Disk Defragmenter Using Disk Defragmenter Effectively Run Disk Defragmenter when the computer will receive the least usage. Educate users.

© 2001 by Prentice Hall1-1 Local Area Networks, 3rd Edition David A. Stamper Part 4: Installation and Management Chapter 12 LAN Administration: Reactive.

Hands-On Microsoft Windows Server 2008 Chapter 11 Server and Network Monitoring.

Copyright © 2015 Pearson Education, Inc. Processing Integrity and Availability Controls Chapter

Chapter 8.  Network Management  Organization Management  Risk Assessment & Management  Service Management  Performance Management  Problem Management.

Network and Active Directory Performance Monitoring and Troubleshooting NETW4008 Lecture 8.

1 Kyung Hee University Prof. Choong Seon HONG Network Control.

System Testing There are several steps in testing the system: –Function testing –Performance testing –Acceptance testing –Installation testing.

CHAPTER 2 OPERATING SYSTEM OVERVIEW 1. Operating System Operating System Definition A program that controls the execution of application programs and.

Top-Down Network Design Chapter Nine Developing Network Management Strategies Oppenheimer.

IEEE R lmap 23 Feb 2015.

WavioNet 2.0. Proprietary Information. 2 Objective Introduce WavioNet application NMS Learn how to perform basic device management Understand WavioNet.

Prof. N. P. Pathak - Dept. of I.T.1 Unit 4 Inventory Management Process OSS Essentials by Kornel Terplan.

70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.

Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.

Cisco S2 C4 Router Components. Configure a Router You can configure a router from –from the console terminal (a computer connected to the router –through.

 Communication Tasks  Protocols  Protocol Architecture  Characteristics of a Protocol.

© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Introduction to HP Availability Manager.

Event Management & ITIL V3

1 Network Monitoring Mi-Jung Choi Dept. of Computer Science KNU

Chapter 19: Network Management Business Data Communications, 4e.

C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.

70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.

DATABASE MANAGEMENT SYSTEMS CMAM301. Introduction to database management systems  What is Database?  What is Database Systems?  Types of Database.

Business Data Communications, Fourth Edition Chapter 11: Network Management.

組態管理 ( Configuration Management) 陳彥錚 (Yen-Cheng Chen)

NETWORKING FUNDAMENTALS. Network+ Guide to Networks, 4e2.

Network design Topic 6 Testing and documentation.

Network management Network management refers to the activities, methods, procedures, and tools that pertain to the operation, administration, maintenance,

Company LOGO Network Management Architecture By Dr. Shadi Masadeh 1.

Maintaining and Updating Windows Server 2008 Lesson 8.

Manajemen Jaringan, Sukiswo ST, MT 1 Network Monitoring Sukiswo

Lecture 11. Switch Hardware Nowadays switches are very high performance computers with high hardware specifications Switches usually consist of a chassis.

Chapter 19: Network Management

Software Architecture in Practice

Processing Integrity and Availability Controls

Network Management Functions

Top-Down Network Design Chapter Nine Developing Network Management Strategies Copyright 2010 Cisco Press & Priscilla Oppenheimer.

Network Administration CNET-443

Chapter 16: Distributed System Structures

Chapter 10 - X.25 and Network Management

Requirements Definition

Top-Down Network Design Chapter Nine Developing Network Management Strategies Copyright 2010 Cisco Press & Priscilla Oppenheimer.

Presentation transcript:

NM functions Configuration, Performance, Fault, Accounting, Security

Configuration Management Middle and long range activities for controlling  Physical, electrical and logical inventories  Maintaining vendor files and trouble tickets  Supporting provisioning and order processing  Defining and supervising service level agreements  Managing changes  Distributing software

Configuration management is central to all other network management functions  All other management are supported by configuration details  Enhances control over configuring the network and devices  Quick access to vital configuration data  Helps initialization, maintenance and shutdown of individual components and logical subsystems

Primary Information Actual configuration Attributes of network elements Generated configuration Status indicators of network elements Vendor data Change requests and record Order data Actual inventory Status of service-level indicators

Secondary Information Traffic Volumes More details on indicators Performance indicators of the network elements etc

Configuration management functions Inventory management Network topology services Service Level agreements Designing, implementing and processing trouble tickets Order processing and provisioning Change Management

Inventory management Automated inventory – online record of  currently implemented components and spares,  contact vendors,  location of components,  maintenance requirements for certain equipment classes,  service statistics like number of outages, response for repair, repair time distribution

Good Inventory Management less redundancy  if same information is stored in different data bases- wastage of resource, processing time to back up the data bases synchronized change management unique names and addresses  Helps during troubleshooting Efficient troubleshooting Better capacity and contingency planning

Network Topology Services Requires current and historical configurations Layered configuration displays at network and component level of  Electrical layouts  Physical  Logical

Display of configuration details

Network details – click on icon

Protocol level

Auto Discovery tool Auto- discovery tool can discover devices on the network ( periodically) Auto mapping produces the network map Takes up bandwidth to execute all this

SLA Need to evaluate long-term service levels Consistency in customer service level Increased planning and decreased crisis management Service levels  Responsiveness, accuracy, availability Performance reporting  Planned and actual workload characteristics and service levels during report period

trouble tickets Linking trouble-tickets Information in a trouble tickets  Time reported  Time received by responsible group  Time network service restored  Time vendor notified  Time vendor responded  Time vendor restored service  Total vendor time  Total user non-availability  Total service outage

Change Management

Tools for configuration management Simple tools  Provide simple storage for all network related information  Manually collecting and entering data Complex tool  Automatically gather data – latest information on configuration  Compare current configuration with stored conf  Change a device’s configuration while running  Specify configuration errors that should generate warning messages –

Performance Management Activities required to continuously evaluate principal performance indicators to check  Service level maintenance  Identify potential bottlenecks  Establish trend reports  Network utilization and error rates

Contd.. Involves  Collection of data on current utilization of network devices and links  Analyze data to discern high utilization trends  Setting utilization thresholds  Using off-line simulation and or analytical studies on how to maximize performance

Primary Information Actual Configuration Generated configuration Performance indicators in real-time or in near- real-time  Response time  Congested channels  Resource utilization Selected vendor data Performance histories for selected facilities Operational procedures

Performance Indicators Availability Response time Throughput Utilization – channel occupancy Grade of service Transmission volumes Offered load Accuracy

Indicators Service oriented indicators  Have priority Efficiency oriented indicators

Service Oriented Indicators Availability  Customers perspective  depends on technical reliability of components  Redundancy? Cost benefit  Total Costs = costs of redundancy + cost of cosnequences

Availability MTBF __________________________________ MTBF+MTTD+MMTR+MTOR MTBF – Mean time between failures MTTD – Mean time to diagnose MTTR- Mean time to Repair (or report) MTOR – Mean time of Repair Better Availability, keep MTTD, MTTR, MTOR low,

Response Time Propagation Delays, Processing delays, Transmission delays, Protocol delays

Contd.. Total Response Time Network Delays Processing delays Protocol delays – time outs Response time consideration depend on  Protocols and their behavior  Job priorities  Loads in the system

Accuracy Accuracy can be affected by  Erroneous transmission (wireless & fiber)  Characters transmitted but not delivered  Characters received which were not sent  Characters duplicated

Residual Error Rate CH E +CH V +CH N +CH D ______________________________ CH T CH E = erroneous characters  due to media & processing CH V = transmitted but not received CH N = extra characters received CH D = duplicated characters CH T = total characters

Efficiency oriented indicators Efficiency oriented indicators - Represent interest of the organization Service oriented monitoring and and efficiency oriented monitoring  conflicts?

Efficiency vs service

Throughput Measure of a server’s capacity - MIPS Line throughput – kilobits/sec Application oriented  Number of transaction / unit time  Number of customer sessions per application  Number of calls serviced  Number of jobs provided by a node

Utilization Dynamic measure of resources used Puts a practical limits on the throughput under operational conditions Helps study overlap among component processing, mutual waits etc.

Utilization Utilization vs Accuracy Utilization vs throughput Utilization vs Goodput

Overlap effects

Availability Availability of system depends on availability of individual components (Very difficult to measure and report on availability)  Check on each component and compare with configuration  Depends on how components are connected

Example Each Component availability = 0.98 Availability of the serial combination is 0.98 * 0.98 = 0.96 Example : 2 modems. Serial processing of data

Prob 1 link is not available = 0.02 Prob both links are no available is 0.02 * 0.02 = Availability = =

Performance measurements Data Gathering  Exhaustive  Statistical Distribution for sampling times Correlation effects Performance Analysis  Data presentation  Interpretation

Contd.. Historical trends Real time trends Graphical presentation and comparison Linking different performance indicators  Then set thresholds

Simulation studies To improve the performance or identify bottlenecks –  model the network and components – (primary)  Study effects of changes in the model  Target Optimal performance  Requires Synthetic traffic generation Analytical and simulation tools

Simple tools for PM Provides real-time information on network components  Graphical – bars, histograms Can help find bottlenecks Main information  Processor utilization  Memory utilization  Link – pkts/sec, bits/sec  Bit error rates

Complex Tools Set threshold Take action once thresholds exceed  Alarm  Enable backup Near threshold warning Store historical daya

A complex tool at work Performance problem Brief periods on interrupted service between systems – no information passes through –3 pm and 12 am

PM tool at work Check error rates in the network  Normal Check utilization  Peaks at 3pm and 12 am – times of back up Check Gatsby and Daisy utilization  Peaked to 100% at the specified times Check for processor intensive applications  negative

Contd.. Check network traffic type  Located an unknown protocol packet  Flooding the network – locating servers  Check originator  Send message to him  Or block his traffic

Fault management Activities needed to dynamically maintain the network service level High network availability

Primary Information Actual configuration Generated configuration Event reports and alarms Status indicators of network elements Performance indicators Spare components and their status Backup routes and their status Vendor data for problem dispatch Global traffic volumes Progress of trouble resolution

Steps in FM Identify the occurrence of fault Isolate the cause of fault Correct the fault if possible First is difficult, second is very difficult!

Network Status Supervision Layered configuration maps (status) (Tightly coupled to topology display) Zoom in on parts to isolate problems Real time traffic status displays Good monitoring devices/sensors Monitored information to be passed on to agents, or management elements Process and distribute messages, events and alarms

Status Is a measurement of the behavior of an object at a specific instance in time  Represented by a set of status information items and their values at a specific time

Event Change in the status of the element – which justifies notification i.e. significant to fault management Event report can be generated  Type of event  Change in status  Time stamp  Reporting entity -Object or process that generated event  Managed object whose status changed  Managed object information  Probable cause  Effect of event on the managed object

Event Filtering Multi-layered filtering

Filtering Process

Global filtering  First process on an event – is the event serious and does it have to be processed  Use a set of criteria for this assessment  Can not be function specific

Filtering Process Distribution Filtering  An event processor selects the event it wishes to receive  There are various event processes running simultaneously Event process filtering  Filtering done by the event processor  Specific to the functional

Event Processor Examine and process event reports Passive processing  Sampling and logging Proactive processing  Takes automatic corrective action

Process for filtering

Event effect Permanent – external action required Temporary – will correct automatically Impending – will result in failure soon Impaired – services can be provided at reduced levels Inhibited – services stopped

Dynamic Troubleshooting Opens trouble tickets, links them, dispatches to the proper vendors, checks on-line progress of trouble tickets Problem detection –  Is something wrong? Problem determination  What is wrong and where is the problem in the network? Problem diagnosis & resolution  To isolate, fix or provide backup and fix

End-to-end testing To verify dynamically correct network operation  Conducted during normal network operation, without affecting it Can we have over-head free testing? What components should be tested? How should tasks be assigned?  Local sites  Central sites

Contd.. When to monitor and test?  Continually, periodically, on demand How to monitor and test  Disruptive, non-disruptive What indicators to monitor and test?  Service level, efficiency, loops, circuits What instruments to use?  Hw, sw, analog, digital What reports are to be generated?  Standard, adhoc with special evaluations What are the triggering events?  Time, single or combined events, alarms

Types of faults Unobservable  Deadlocks between processes  Instrument not capable of recording the events Partially observable  Node failure – actual reason – low level protocol Uncertainty in observation  Lack of device response Device is down, network partitioned, congestion delays, local timer faulty

Issues in isolating faults Multiple potential faults  Number of elements failing Too many related observations  One fault manifests itself as various events Interference between diagnosis and local recovery procedures  Error recovery sets in before diagnosis Absence of automated tools

Example FM Problem scenario – sergeant fails due to buffer overflow

Contd.. Buffer is sergeant is well provisioned for  Fails due to traffic surge Pepper reports link failure to LAN3  Message sent to NM system NMS asks pepper to check on carrier presence in Link to LAN3  Carrier Absence reported NMS ask Pepper to perform loopback on link3  ok

Contd.. NM resets Sergeant ? Actual reason for failure not identified This could have been avoided if there was an event from sergeant of utilization in excess of 80% or 90%

Simple tool Points out problem existence  Eg ICMP ping tells you about the existence of a system Complex tool may perform all functions shown in the previous example