Presentation is loading. Please wait.

Presentation is loading. Please wait.

GridNets 2006 – October 1 st Grid Resource Management by means of Ant Colony Optimization Gustavo Sousa Pavani and Helio Waldman Optical Networking Laboratory.

Similar presentations


Presentation on theme: "GridNets 2006 – October 1 st Grid Resource Management by means of Ant Colony Optimization Gustavo Sousa Pavani and Helio Waldman Optical Networking Laboratory."— Presentation transcript:

1 GridNets 2006 – October 1 st Grid Resource Management by means of Ant Colony Optimization Gustavo Sousa Pavani and Helio Waldman Optical Networking Laboratory – OptiNet Communications Department – Decom School of Electrical and Computer Engineering – FEEC State University of Campinas – UNICAMP. http://www.optinet.fee.unicamp.br

2 GridNets 2006 – October 1 st Gustavo Sousa Pavani 2 Introduction Resource management in grids is a very challenging task, since the resources to be shared are very heterogeneous: CPU time, storage space or data, and network bandwidth Need for networks that are dynamically configurable in order to support large transfers of data – Use of a GMPLS control plane for managing network resources Use of the Ant Colony Optimization meta-heuristics to tackle the problem of grid scheduling, by the adaptation of the AntNet framework, which is a ACO-based algorithm, to this problem – Use of ant-like mobile agents to act on behalf of the user to make scheduling decisions – Considered the allocation of processing power and provisioning of lightpaths

3 GridNets 2006 – October 1 st Gustavo Sousa Pavani 3 Proposed Algorithm – ACO ACO is used to refer to the class of algorithms that are inspired in the process of foraging for food by natural ants for the optimization of hard-to-solve problems It is characterized by ant-like mobile agents that cooperate and stochastically explore a network, iteratively building solutions based on their own memory and on the traces (pheromone levels) left by other agents The AntNet framework is an ACO-based algorithm used for routing of packets in telecommunication networks – The original AntNet framework uses the delay introduced by each hop as the metric for routing. This is not applicable in circuit-switching networks, where the main metric is generally the number of hops. Indeed, the minimization of the number of hops of a connection following some criteria is a very good heuristics to reduce the overall blocking probability in a network

4 GridNets 2006 – October 1 st Gustavo Sousa Pavani 4 Proposed Algorithm – Outline At regular intervals, a forward ant is launched from a random source node to another random destination node. In its trip, the forward ant will select the next hop i using a random scheme that accounts the path selection probabilities, given by the pheromone levels in each neighbor link, and a heuristics value, calculated from the congestion of each neighbor links During its trip, the forward ant gathers the label of each node where it passes by, putting it in its memory. When it arrives at the destination node, it becomes a backward ant, collects the availability information and returns to the source node using the same path followed by the forward ant, but in the opposite direction. At each intermediate node i, it updates the routing table, comparing its memory with the local parametric model If the ant does not reach its destination node in a number of pre-established hops it is dropped. This avoids lost ants circulating forever in the network In a hop-by-hop basis, the lightpath is routed evaluating the entry of the pheromone routing table that matches the destination. The neighbor with the higher level of pheromone is chosen as the next hop

5 GridNets 2006 – October 1 st Gustavo Sousa Pavani 5 Proposed Algorithm – Adaptations to AntNet Data Structures 1 At each node i, we have the following data structures: – Pheromone-routing table T i : It is a matrix containing a row for each destination of the network and a column for each neighbor node, for storing the pheromone values. The sum of each row must be equal to 1 Example of the pheromone routing table of node 4: 12 34 5 325 1 0.10.50.4 2 0.20.30.5 3 0.90.10.0 5 0.50.25 Destinations Neighbor Nodes

6 GridNets 2006 – October 1 st Gustavo Sousa Pavani 6 Proposed Algorithm – Adaptations to AntNet Data Structures 2 – Statistical parametric model M i : It is a matrix containing the triplet for each destination d of the network. These values are updated using an exponential model within the non-sliding window of w observations Example of the statistical parametric model of node 4: 12 34 5 dd dd EdEd 1 2.20.52.0 2 1.30.31.0 3 1.20.41.0 5 1.40.51.0 Destinations Neighbor Nodes

7 GridNets 2006 – October 1 st Gustavo Sousa Pavani 7 Proposed Algorithm – Adaptations to AntNet Data Structures 3 – Availability vector A i : It is a vector containing an availability metric for each destination of the network. The rationale behind this structure is to allow for the load-balancing of the server farms Example of the availability vector of node 4: 12 34 5 Availability 1 0.1 2 0.2 3 0.9 5 0.5 Destinations

8 GridNets 2006 – October 1 st Gustavo Sousa Pavani 8 Proposed Algorithm – Routing of Forward Ants Congestion level for each neighbor link: Probability of a forward ant to choose the neighbor node: –  gives the emphasis between pheromone level (long-term memory) and instantaneous congestion state (short-term memory)

9 GridNets 2006 – October 1 st Gustavo Sousa Pavani 9 Proposed Algorithm – Updating by Backward Ants 1 At each intermediate node i through source node, the backward ant updates the parametric model M i and routing table T i of the intermediate nodes k  [( i +1), d [, i.e., it treats these nodes as destinations. However, this is done only if the evaluation of sub-path traced by the ant is good enough, i.e., its number of hops is less than the superior estimate I sup. This allows for the updating of good paths found by ants that are not intended to those destinations. The update is always done for the destination node d at i

10 GridNets 2006 – October 1 st Gustavo Sousa Pavani 10 Proposed Algorithm – Updating by Backward Ants 2 First, the parametric model M i k is updated. After, an adaptive reinforcement r is calculated for updating the routing table: The obtained r is limited to 0.9 to avoid stagnation and its value is “squashed” Now, if the neighbor node m is on the path, then it receives a positive reinforcement: The other nodes receives a negative reinforcement:

11 GridNets 2006 – October 1 st Gustavo Sousa Pavani 11 Proposed Algorithm – Loop Avoidance and Management Memory of the forward ant works as tabu list for selecting the next hop Dead-end: all neighbors of the node where the forward ant is being processed are already visited – Ignore the heuristic correction given by the congestion information Use only pheromone values – Remove all nodes that belongs to the cycle from the ant’s memory

12 GridNets 2006 – October 1 st Gustavo Sousa Pavani 12 Proposed Algorithm – Path Computation A B C D PATH B D C 0.6 0.4 C D C 0.8 0.2 Calculated hop-by-hop!

13 GridNets 2006 – October 1 st Gustavo Sousa Pavani 13 NSFNet Network 14 nodes and 21 bi-directional links Node 1 as user, other nodes as resources 12 1 13 2 3 4 5 6 9 7 8 10 11 14

14 GridNets 2006 – October 1 st Gustavo Sousa Pavani 14 Simulation Homogeneous Poissonian traffic with uniform spatial profile – Node 1 as the only source of lightpath requests Fixed duration of the lightpath: 14.4 sec (equivalent to a 1.8 GB file transfer at a 10 Gbps rate) Fixed processing time of each job: 12 hours – Homogeneous processors – Same number of processors per node First-fit wavelength assignment algorithm Blocking can be caused by lack of resources on the: – Optical network – Server farm

15 GridNets 2006 – October 1 st Gustavo Sousa Pavani 15 Results – Blocking Probability

16 GridNets 2006 – October 1 st Gustavo Sousa Pavani 16 Results – Total Grid Workload W = 8 Load = 4 Erlangs The optical network can limit the total grid workload (W = 2 and 4) for higher loads

17 GridNets 2006 – October 1 st Gustavo Sousa Pavani 17 Results – The Effect of the Ant Launching Rate The increase on the ant launching rate can enhance the system performance in terms of blocking probability

18 GridNets 2006 – October 1 st Gustavo Sousa Pavani 18 Results – The Average Overhead on Control Channels due to the Ants

19 GridNets 2006 – October 1 st Gustavo Sousa Pavani 19 Conclusions Alternative for grid scheduling when the optical network resources have to be dynamically controlled. Integrated management of processing and network resources. Load balancing of the: – Processing workload – Optical network Use of a GMPLS control plane for lightpath provisioning – Advantages of the proposed algorithm over link-state protocols Some scenarios proposed where the performance of the grid was limited by either networking resources or processing power

20 GridNets 2006 – October 1 st Gustavo Sousa Pavani 20 Any questions? pavani@decom.fee.unicamp.br http://www.optinet.fee.unicamp.br/~pavani


Download ppt "GridNets 2006 – October 1 st Grid Resource Management by means of Ant Colony Optimization Gustavo Sousa Pavani and Helio Waldman Optical Networking Laboratory."

Similar presentations


Ads by Google