The Organic Grid: Self- Organizing Computation on a Peer-to-Peer Network Presented by : Xuan Lin.

Slides:



Advertisements
Similar presentations
Universidade do Minho A Framework for Multi-Class Based Multicast Routing TNC 2002 Maria João Nicolau, António Costa, Alexandre Santos {joao, costa,
Advertisements

Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
Decentralized Reactive Clustering in Sensor Networks Yingyue Xu April 26, 2015.
Silberschatz and Galvin  Operating System Concepts Module 16: Distributed-System Structures Network-Operating Systems Distributed-Operating.
Walter Binder University of Lugano, Switzerland Niranjan Suri IHMC, Florida, USA Green Computing: Energy Consumption Optimized Service Hosting.
Playback delay in p2p streaming systems with random packet forwarding Viktoria Fodor and Ilias Chatzidrossos Laboratory for Communication Networks School.
Guang Tan, Stephen A. Jarvis, and Anne-Marie Kermarrec IEEE Transactions on Mobile Computing, VOL. 8, NO.6, JUNE Yun-Jung Lu.
Gossip Algorithms and Implementing a Cluster/Grid Information service MsSys Course Amar Lior and Barak Amnon.
Madhavi W. SubbaraoWCTG - NIST Dynamic Power-Conscious Routing for Mobile Ad-Hoc Networks Madhavi W. Subbarao Wireless Communications Technology Group.
Termination Detection. Goal Study the development of a protocol for termination detection with the help of invariants.
Network Operating Systems Users are aware of multiplicity of machines. Access to resources of various machines is done explicitly by: –Logging into the.
Applying Genetic Algorithms to Decision Making in Autonomic Computing Systems Authors: Andres J. Ramirez, David B. Knoester, Betty H.C. Cheng, Philip K.
1 Complexity of Network Synchronization Raeda Naamnieh.
Scribe: A Large-Scale and Decentralized Application-Level Multicast Infrastructure Miguel Castro, Peter Druschel, Anne-Marie Kermarrec, and Antony L. T.
SplitStream: High-Bandwidth Multicast in Cooperative Environments Marco Barreno Peer-to-peer systems 9/22/2003.
A New Approach for the Construction of ALM Trees using Layered Coding Yohei Okada, Masato Oguro, Jiro Katto Sakae Okubo International Conference on Autonomic.
Spotlighting Decentralized P2P File Sharing Archie Kuo and Ethan Le Department of Computer Science San Jose State University.
Dept. of Computer Science & Engineering, CUHK1 Trust- and Clustering-Based Authentication Services in Mobile Ad Hoc Networks Edith Ngai and Michael R.
Fault-tolerant Adaptive Divisible Load Scheduling Xuan Lin, Sumanth J. V. Acknowledge: a few slides of DLT are from Thomas Robertazzi ’ s presentation.
1 A Distributed Delay-Constrained Dynamic Multicast Routing Algorithm Quan Sun and Horst Langendorfer Telecommunication Systems Journal, vol.11, p.47~58,
A Mobile Agent Infrastructure for QoS Negotiation of Adaptive Distributed Applications Roberto Speicys Cardoso & Fabio Kon University of São Paulo – USP.
Online Data Gathering for Maximizing Network Lifetime in Sensor Networks IEEE transactions on Mobile Computing Weifa Liang, YuZhen Liu.
Strategies for Implementing Dynamic Load Sharing.
AgentOS: The Agent-based Distributed Operating System for Mobile Networks Salimol Thomas Department of Computer Science Illinois Institute of Technology,
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
P2P Course, Structured systems 1 Introduction (26/10/05)
Algorithms for Self-Organization and Adaptive Service Placement in Dynamic Distributed Systems Artur Andrzejak, Sven Graupner,Vadim Kotov, Holger Trinks.
The Organic Grid : Self Organizing Computation on a Peer to Peer Network Vikram Negi.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.
FiWi Integrated Fiber-Wireless Access Networks
Information Technology Lecture No 3 By: Khurram Shahid.
Publisher Mobility in Distributed Publish/Subscribe Systems Vinod Muthusamy, Milenko Petrovic, Dapeng Gao, Hans-Arno Jacobsen University of Toronto June.
Network Aware Resource Allocation in Distributed Clouds.
1 A Mutual Exclusion Algorithm for Ad Hoc Mobile networks Presentation by Sanjeev Verma For COEN th Nov, 2003 J. E. Walter, J. L. Welch and N. Vaidya.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Chapter 11 Heap. Overview ● The heap is a special type of binary tree. ● It may be used either as a priority queue or as a tool for sorting.
Locating Mobile Agents in Distributed Computing Environment.
Load-Balancing Routing in Multichannel Hybrid Wireless Networks With Single Network Interface So, J.; Vaidya, N. H.; Vehicular Technology, IEEE Transactions.
The Alternative Larry Moore. 5 Nodes and Variant Input File Sizes Hadoop Alternative.
2007/03/26OPLAB, NTUIM1 A Proactive Tree Recovery Mechanism for Resilient Overlay Network Networking, IEEE/ACM Transactions on Volume 15, Issue 1, Feb.
1 An Adaptive File Distribution Algorithm for Wide Area Network Takashi Hoshino, Kenjiro Taura, Takashi Chikayama University of Tokyo.
 Distributed file systems having transaction facility need to support distributed transaction service.  A distributed transaction service is an extension.
Complex Contagions Models in Opportunistic Mobile Social Networks Yunsheng Wang Dept. of Computer Science, Kettering University Jie Wu Dept. of Computer.
A Membership Management Protocol for Mobile P2P Networks Mohamed Karim SBAI, Emna SALHI, Chadi BARAKAT.
A Data Stream Publish/Subscribe Architecture with Self-adapting Queries Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences,
SocialVoD: a Social Feature-based P2P System Wei Chang, and Jie Wu Presenter: En Wang Temple University, PA, USA IEEE ICPP, September, Beijing, China1.
On Reducing Mesh Delay for Peer- to-Peer Live Streaming Dongni Ren, Y.-T. Hillman Li, S.-H. Gary Chan Department of Computer Science and Engineering The.
Design Issues of Prefetching Strategies for Heterogeneous Software DSM Author :Ssu-Hsuan Lu, Chien-Lung Chou, Kuang-Jui Wang, Hsiao-Hsi Wang, and Kuan-Ching.
Energy-Efficient Wake-Up Scheduling for Data Collection and Aggregation Yanwei Wu, Member, IEEE, Xiang-Yang Li, Senior Member, IEEE, YunHao Liu, Senior.
AVL Trees and Heaps. AVL Trees So far balancing the tree was done globally Basically every node was involved in the balance operation Tree balancing can.
Efficient Resource Allocation for Wireless Multicast De-Nian Yang, Member, IEEE Ming-Syan Chen, Fellow, IEEE IEEE Transactions on Mobile Computing, April.
An Adaptive Zone-based Storage Architecture for Wireless Sensor Networks Thang Nam Le, Dong Xuan and *Wei Yu Department of Computer Science and Engineering,
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
Ad Hoc On-Demand Distance Vector Routing (AODV) ietf
Antidio Viguria Ann Krueger A Nonblocking Quorum Consensus Protocol for Replicated Data Divyakant Agrawal and Arthur J. Bernstein Paper Presentation: Dependable.
By Nitin Bahadur Gokul Nadathur Department of Computer Sciences University of Wisconsin-Madison Spring 2000.
A Stable Broadcast Algorithm Kei Takahashi Hideo Saito Takeshi Shibata Kenjiro Taura (The University of Tokyo, Japan) 1 CCGrid Lyon, France.
1 Roie Melamed, Technion AT&T Labs Araneola: A Scalable Reliable Multicast System for Dynamic Wide Area Environments Roie Melamed, Idit Keidar Technion.
On Detecting Termination in Cognitive Radio Networks Shantanu Sharma 1 and Awadhesh Kumar Singh 2 1 Ben-Gurion University of the Negev, Israel 2 National.
Lecture 4 CPU scheduling. Basic Concepts Single Process  one process at a time Maximum CPU utilization obtained with multiprogramming CPU idle :waiting.
CPU scheduling.  Single Process  one process at a time  Maximum CPU utilization obtained with multiprogramming  CPU idle :waiting time is wasted 2.
Powerpoint Templates Data Communication Muhammad Waseem Iqbal Lecture # 07 Spring-2016.
The Biologically Inspired Distributed File System: An Emergent Thinker Instantiation Presented by Dr. Ying Lu.
Mingze Zhang, Mun Choon Chan and A. L. Ananda School of Computing
Authors: Jiang Xie, Ian F. Akyildiz
Kevin Lee & Adam Piechowicz 10/10/2009
Database System Architectures
Presentation transcript:

The Organic Grid: Self- Organizing Computation on a Peer-to-Peer Network Presented by : Xuan Lin

Outline Introduction Motivation Organic Scheduling Scheme Experiment Evaluation Conclusion

Outline Introduction Motivation Organic Scheduling Scheme Experiment Evaluation Conclusion

Introduction Scientific Computations require large scale distributed computing. Traditional Grid vs. Desktop Grid Centralized vs. Decentralized Mobile agent. (Weak mobility, Strong mobility, Forced Mobility)

Outline Introduction Motivation Organic Scheduling Scheme Experiment Evaluation Conclusion

Motivation Many previous schemes assume reliable network. Centralized schemes suffer from poor scalability. Traditional scheduling schemes assume sufficient system information. Inspired by Local Activation, Long-range Inhibition (LALI)

Outline Introduction Motivation Organic Scheduling Scheme Experiment Evaluation Conclusion

Assumptions Independent-task application, data initially resides at one location. Each node initially has a “friend lists”.

A. General Approach Tree-structured overlay network is selected as the desirable pattern of execution. Empirically determined the simplest behavior that would organize the communication and task distribution among mobile agents. Augmented the basic behavior by introducing other desirable properties.

B. Basic Agent Behavior A computational task is encapsulated in an agent. A user starts the computation agent on his/her machine. (root of the tree) The agent starts one thread for computation. At the same time, the agent is prepared to receive requests.

B. Basic Agent Behavior (con’t) -when get a request The agent dispatches a clone when get requests. (The requester will be a child). The clone will ask for its parent for subtasks.

B. Basic Agent Behavior (con’t) -requester A thread begins to compute. Other threads are created-when required- to communicate with parents or other machines. If a requests is received, this ‘child’ sends its own clone to the requester. It will become the parent of the requester. The requester will be a child of this node. …… Thus, the computation spreads.

B. Basic Agent Behavior (con’t) An agent requests its parent for more subtasks if it completes its own subtasks. Every time a node obtain r results, it sends them to its parent.

B. Basic Agent Behavior (con’t)

C. Maintenance of Child-lists Up to c active children and up to p potential children. (balance of deep and width of the tree) Active nodes are ranked by their performance (the rate the node sends result). Potential children are the ones which the current node has not yet been able to evaluate. A potential child is added to the active child-list once it has sent enough results to the current node.

C. Maintenance of Child-lists (con’t) When the node has more than c active children, the slowest node (sc) will be kicked out. The sc is then given a list of other nodes, which it can contact to try and get back to the tree. The sc will also be put into a list which records o former children. (Avoid thrashing )

C. Maintenance of Child-lists (con’t)

D. Restructuring of the Overlay Network Philosophy: Having best nodes close to the top enhances the extraction of subtasks from the root and minimizes the communication delay. The overlay network is constantly being restructured so that the nodes with the highest throughput migrate toward the root.

D. Restructuring of the Overlay Network (How to achieve that?) A node periodically informs its parent about its best-performing child.

D. Restructuring of the Overlay Network (con’t) A sc is not simply discarded. The parent sends a list of its children in descending order of performance. The sc attempts to contact these nodes in turn.

E. Size of Result Burst R result-burst intervals r results (R+1)* r If r and R are too large, it will take too much time for the network to update.

F. Fault Tolerance What can we do when nodes lost connection? Every node keeps track of unfinished subtasks that were sent to children. Each node keeps a list of a ancestors.

F. Fault Tolerance (con’t)

G. Cycles Failure could cause cycles. (How to find the cycle?) Every node checks its ancestor list on receiving it from its parents to see if itself is in the ancestor. (How to break the cycle?) Try to obtaining the address of some other agent on its data distribution or communication overlays.

G. Cycles (starvation) May cause starvation. If the agent is starved of work for more than a specified time, it self-destructs.

H. Termination Root sends out termination messages. The messages will spread down to leaves. Two scenarios: 1. If a node does not get such message, the situation will be the same as F. 2. n2 does not get the termination messages but it is in n1’s friend-list. n1 terminate when it get informed. n2 will clone itself to n1 when it is informed by n1 ?????

I. Self-adjust of Task List Size In an ITA-type application, the utilization of a high-performance machine may be poor because it is only requesting a fixed number of subtasks at a time. So, agents request more or less according to its performance. (compare to last run) i(t), d(t)

J. Prefetching Motivation: A potential cause of slowdown in the basic scheduling scheme described earlier is the delay at each node due to its waiting for new subtasks. Using the self-adjustment function i(t) to prefetch. However, excessively prefetching will degrade the performance since prefetch will increase the amount of data that needs to be transferred at a time.

Outline Introduction Motivation Scheduling Scheme Experiment Evaluation Conclusion

Metric Total Computation Time Ramp-up Time The time required for subtasks to reach every single node. Topology Fast nodes should migrate to the root as close as possible.

Experiment Configuration Application: NCBI’s nucleotide-nucleotide BLAST, the gene sequence similarity search tool. ( Match a 256KB sequence against 320 data chunks) A cluster of eighteen heterogeneous machines Introduced Delays in the application code. The machines ran the Aglets weak mobility agent environment on top of either Linux or Solaris.

Initial Topology

Initial Parameter

A. Comparison with Knowledge- Based Scheme

A. Comparison with Knowledge- Based Scheme (con’t)

B. Effects of Child Propagation

B. Effects of Child Propagation (con’t) 32% improvement in the running time

C. Result-Burst Size There is a qualitative improvement in the child- lists as the result-burst size increases. However, with very large result-bursts, it takes longer for the tree overlay to form and adapt, thus slowing down the experiment.

C. Result-Burst Size (con’t)

D. Effects of prefetching Ramp-up Time is affected by prefetching and the minimum number of subtasks that each node requests.

D. Effects of prefetching (con’t)

Prefecthing degrades the throughput when the No. of subtasks increases.

D. Effects of prefetching (con’t)

E. Self-Adjustment

F. Number of Children

Two experiments: good initial configuration and star topology The total time are approximately the same. Children have to wait for a longer time for their requests to be satisfied.

Outline Introduction Motivation Scheduling Scheme Experiment Evaluation Conclusion

An autonomic scheduling algorithm in which multithreaded agents with strong mobility form a tree-structured overlay network. The approach can be adapted to many applications. Future work includes more experiments, and good design of initial friend-list.