Download presentation

Presentation is loading. Please wait.

Published byAndy Henshaw Modified about 1 year ago

1
Optimal redundancy allocation for information technology disaster recovery in the network economy Benjamin B.M. Shao IEEE Transaction on Dependable and Secure Computing, Vol. 2, NO. 3, July-September 2005 Presented by: Derek KD Jiang 江坤道

2
Agenda Introduction Redundancy for IT disaster recovery Redundancy allocation model Solution procedure Examples Conclusion

3
Introduction Modern organizations have become increasingly reliant on IT to facilitate business operation. The issue of how to strengthen IT capability so that a company can prevent or quickly recover from disasters becomes a serious concern.

4
Introduction Perform a impact analysis to: –Identify the disasters likely occur in the environment. –Evaluate the degree to which IT are vulnerable to sustain. –Take necessary measures to protect those IT functions according the importance. This paper incorporate redundancy into critical IT functions and aims to maximize the survivability against potential disasters.

5
Introduction Adopting cluster-centric approach, this paper concentrate on managing resources around independent clusters IT functions where each cluster is assigned its own dedicated solutions. An optimization model is proposed, taking into account the significance of IT functions, the cost of IT solutions, and the availability of resources subject to budget limitation.

6
Agenda Introduction Redundancy for IT disaster recovery Redundancy allocation model Solution procedure Examples Conclusion

7
Redundancy for IT disaster recovery Redundancy is a design principle of having one or more backup systems in case of failure of the main system. The use of redundancy in preparation for disasters is of potential advantage due to two aspects. –Proactive prevention –Reactive recovery

8
Redundancy for IT disaster recovery The objective is to select among competing alternatives for redundancy level and reap the best returns from a limited budget. A quantitative model can provide the guidelines for allocating optimal redundancy levels to critical IT functions needing to be protected.

9
Agenda Introduction Redundancy for IT disaster recovery Redundancy allocation model Solution procedure Examples Conclusion

10
Redundancy allocation model Suppose an organization is planning for taking measures of redundancy, and the budget is limited. Several possible disasters have been identified with the potential to affect IT functions and to cause business discontinuity. How to allocate redundancy to IT functions such that survivability is maximized and the cost still remains under budget?

11
Redundancy allocation model

12
The redundancy allocation problem (RAP) is formulated below

13
Redundancy allocation model Survivability S mid in this context is defined as the likelihood of IT asset i to withstand disaster d and to ensure IT function m remains operational. IT function m fails against disaster d only when all of its selected solutions fail at the same time. In other words, as long as one of the selected solutions survives the disaster, IT function would be in operation.

14
Redundancy allocation model Ensures that at least one solution is selected and allocated to each IT function. Notably, IT function without redundancy is allowable. Indicates that the total costs can’t exceed the budget limit B.

15
Redundancy allocation model

16
Agenda Introduction Redundancy for IT disaster recovery Redundancy allocation model Solution procedure Examples Conclusion

17
Solution procedure The proposed model is a 0-1 integer programming problem with a nonlinear objective function. Due to the nonlinearity of the objective function, LR cannot be employed to tackle this problem. A partial enumeration procedure based on probabilistic dynamic programming is presented.

18
Solution procedure The sum of failure probabilities of each IT function due to any disasters.

19
Solution procedure We define a state of system T as the available budget and stage m as IT function. Let be the failure rate of the system composed of IT functions m, m+1,…, M. The recursive formula, where m < M

20
Solution procedure For stage (IT function) m, state (budget) T cannot exceed the total available budget B minus the minimum costs to be allocated for stage 1,…, m-1. T must be at least equal to the cost of the least expensive solution in the current stage to ensure at least one solution for IT function m. For T not in the range, F m (T) is defined as 1, so it won’t be chosen.

21
Solution procedure F m (T) of (4) deals with the risks of disaster occurrence and involves the calculation of expected failure rate of IT function m according to the remaining budget T. The initial stage m=M and,

22
Solution procedure The optimal objective function value F* is obtained as F 1 (B), representing the minimum overall failure rate of the whole system composed of all M IT functions with a budget of B. The original maximum overall survivability S* of RAP is then equal to 1 - F 1 (B).

23
Agenda Introduction Redundancy for IT disaster recovery Redundancy allocation model Solution procedure Examples Conclusion

24
Example Two LANs (M=2) with weight w 1 = 0.3, w 2 =0.7 respectively. Flooding disaster that occurs with a likelihood of 0.05 (i.e., p 1 =0.05, p 2 =0.95 for no disaster). It considers incorporating redundant bridges into LAN1 and redundant switches into LAN2 with a budget B=14.

25
Example For LAN1 –Four types of bridges are available (n 1 =4), with C 11 =8, C 12 =2, C 13 =4, and C 14 =6. –The survival rates are S 111 =0.1, S 121 =0.09, S 131 =0.15, and S 141 =0.21 (i.e., v 111 =0.9, v 121 =0.91, v 131 =0.85, v 141 =0.79). –Their availabilities when no disaster occurs are S 112 =0.9999, S 122 =0.9993, S 132 =0.9997, and S 142 = (i.e., v 112 =0.0001, v 122 =0.0007, v 132 =0.0004, v 142 =0.0005).

26
Example For LAN2 –Three types of switches are available (n 2 =3), with C 21 =4, C 22 =6, and C 23 =5. –The survival rates are S 211 =0.06, S 221 =0.1, S 231 =0.2 (i.e., v 211 =0.94, v 221 =0.9, v 231 =0.8). –Their availabilities when no disaster occurs are S 212 =0.9994, S 122 =0.9990, S 132 = (i.e., v 212 =0.0006, v 222 =0.0010, v 232 =0.0004)

27
Example

28
Starts with stage=2 Since the least expensive switch for LAN2 has cost C 21 =4, and the least expensive bridge for LAN1 has cost C 12 =2, the valid range for T is. Equation (6) then calculate F 2 (T) for T=4,…, 12. Take F 2 (6) for example: (X 21, X 22, X 23 )=(0, 0, 1), (0, 1, 0), (1, 0, 0). The minimum F 2 (T) = is associated with (0, 0, 1).

29
Example

30
Next, we proceed to find the optimal solution F 1 (14) in the final stage m=1. The minimum F 1 (14) is associated with (X 11, X 12, X 13, X 14 ) = (0, 1, 0, 1), with F* = using F 2 (6) = Namely, the maximum survivability S* against flooding equal 1 – F* = 1 – =

31
Agenda Introduction Redundancy for IT disaster recovery Redundancy allocation model Solution procedure Examples Conclusion

32
Contributions –It presents one of the earliest quantitative studies to allocate redundancy for recovery planning. –An exact solution method based on probabilistic dynamic programming is presented to help obtain optimal solution of redundancy allocation. –Through sensitivity analysis, the model can further help IT managers make betters decisions.

33
Conclusion IT plays an extremely important role in modern business operations, nevertheless, it has potential vulnerabilities against disasters. RAP redundant allocation model proposed in this paper can fulfill the need for a structured decision analysis of recovery planning.

34
Conclusion For future research, we can further categorize assets into hardware, software, and other types to examine the impacts of each asset type on the redundancy allocation decisions. Specific assumptions of dependent IT functions or shared solutions can be made to address a different set of IT disaster recovery problems.

35
Thanks for your patience

Similar presentations

© 2017 SlidePlayer.com Inc.

All rights reserved.

Ads by Google