Example: Sorting on Distributed Computing Environment Apr 20, 2009 1.

Slides:



Advertisements
Similar presentations
How does a network identify computers and transmissions?
Advertisements

TCP/IP MODEL Maninder Kaur
CoMPI: Enhancing MPI based applications performance and scalability using run-time compression. Rosa Filgueira, David E.Singh, Alejandro Calderón and Jesús.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts Amherst Operating Systems CMPSCI 377 Lecture.
UNIT-IV Computer Network Network Layer. Network Layer Prepared by - ROHIT KOSHTA In the seven-layer OSI model of computer networking, the network layer.
Introduction Future wireless systems will be characterized by their heterogeneity - availability of multiple access systems in the same physical space.
CS 582 / CMPE 481 Distributed Systems Communications.
Department of Computer Engineering University of California at Santa Cruz Networking Systems (1) Hai Tao.
Protocols and the TCP/IP Suite Chapter 4 (Stallings Book)
The Design Philosophy of the DARPA Internet Protocols D. D. Clark.
Protocols and the TCP/IP Suite
Reliable Networking Systems The goals: Implement a reliable network application of a file sharing network. Implement a reliable network application of.
Computer Network Architecture and Programming
Spring EE4272 Switch vs. Router Switch:  Def. 1: A network node that forwards packets from inputs to outputs based on header information in each.
Networks 1 CS502 Spring 2006 Network Input & Output CS-502 Operating Systems Spring 2006.
EECE 411: Design of Distributed Software Applications What is a Distributed System? You know when you have one … … when the failure of a computer you’ve.
Review on Networking Technologies Linda Wu (CMPT )
CCNA 1 v3.1 Module 2 Review. 2 Which types of networks pass a token sequentially to each host? Token Ring FDDI.
CS-3013 & CS-502, Summer 2006 Network Input & Output1 CS-3013 & CS-502, Summer 2006.
Internetworking Fundamentals (Lecture #2) Andres Rengifo Copyright 2008.
COE 342: Data & Computer Communications (T042) Dr. Marwan Abu-Amara Chapter 2: Protocols and Architecture.
TCP/IP Reference Model Host To Network Layer Transport Layer Application Layer Internet Layer.
 The Open Systems Interconnection model (OSI model) is a product of the Open Systems Interconnection effort at the International Organization for Standardization.
Fundamentals of Computer Networks ECE 478/578 Lecture #2 Instructor: Loukas Lazos Dept of Electrical and Computer Engineering University of Arizona.
Network Management Concepts and Practice Author: J. Richard Burke Presentation by Shu-Ping Lin.
1: Introduction1 Part I: Introduction Goal: r get context, overview, “feel” of networking r more depth, detail later in course r approach: m descriptive.
Protocols and the TCP/IP Suite Chapter 4. Multilayer communication. A series of layers, each built upon the one below it. The purpose of each layer is.
Data Communications and Networks
Review: – computer networks – topology: pair-wise connection, point-to-point networks and broadcast networks – switching techniques packet switching and.
Presentation on Osi & TCP/IP MODEL
What is a Protocol A set of definitions and rules defining the method by which data is transferred between two or more entities or systems. The key elements.
Protocol Architectures. Simple Protocol Architecture Not an actual architecture, but a model for how they work Similar to “pseudocode,” used for teaching.
Network Technologies & Principles 1 Communication Subsystem. Types of Network. Principles of Network. Distributed Protocols.
Protocols and the TCP/IP Suite
Lec4: TCP/IP, Network management model, Agent architectures
Mukesh N. Tekwani Elphinstone College Mumbai
Department of Electronic Engineering City University of Hong Kong EE3900 Computer Networks Introduction Slide 1 A Communications Model Source: generates.
CS 6401 Internetworking Outline Internet Architecture Best Effort Service Model.
1 Next Few Classes Networking basics Protection & Security.
Distributed Systems Principles and Paradigms Chapter 02 Communication 00 – 1.
Transport Layer OSI Model. The transport layer is responsible for the segmentation and the delivery of a message from one process to another.
FALL 2005CSI 4118 – UNIVERSITY OF OTTAWA1 Part 2.5 Internetworking Chapter 25 (Transport Protocols, UDP and TCP, Protocol Port Numbers)
 What is a network and how does it function with computer systems? It is a collection of computers and devices that communicate with one another over.
Spring 2006Computer Networks1 Chapter 2 Network Models.
CSP Implementing a network 1 Implementing a network Lecturer: Smilen Dimitrov Cross-sensorial processing – MED7.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
1 Networking Chapter Distributed Capabilities Communications architectures –Software that supports a group of networked computers Network operating.
OSI Model Data Communications. 7 layer “research” model ApplicationPresentationSessionTransportNetworkLinkPhysical.
Network Essentials -III Prof. Sujata Rao Lesson 12.
Spring 2006Computer Networks1 Chapter 2 Network Models.
William Stallings Data and Computer Communications
Networking Fundamentals Network Protocols. Protocol Rule for how networks communicate Each OSI layer handled by one or more protocols Protocol Suites.
Prepared by: Azara Prakash L.. Contents:-  Data Transmission  Introduction  Socket Description  Data Flow Diagram  Module Design Specification.
CHAPTER 4 PROTOCOLS AND THE TCP/IP SUITE Acknowledgement: The Slides Were Provided By Cory Beard, William Stallings For Their Textbook “Wireless Communication.
1 Chapter 4. Protocols and the TCP/IP Suite Wen-Shyang Hwang KUAS EE.
Reading TCP/IP Protocol. Training target: Read the following reading materials and use the reading skills mentioned in the passages above. You may also.
Sy Le CS 147.  Process of transferring information between computers virtually anywhere in the world.
J. Liebeher (modified by M. Veeraraghavan) 1 Introduction Complexity of networking: An example Layered communications The TCP/IP protocol suite.
Client-server communication Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
For more course tutorials visit NTC 406 Entire Course NTC 406 Week 1 Individual Assignment Network Requirements Analysis Paper NTC 406.
Protocols and the TCP/IP Suite
Distributed Systems.
Network Architecture Layered system with alternative abstractions available at a given layer.
Client-Server Interaction
Chapter 16: Distributed System Structures
Protocols and the TCP/IP Suite
Telecommunications and Networking
ECEN “Internet Protocols and Modeling”
CPEG514 Advanced Computer Networkst
Protocols and the TCP/IP Suite
Presentation transcript:

Example: Sorting on Distributed Computing Environment Apr 20,

About this presentation Example for starting preparation of the "Enshu" part of this class Give a talk about the problem attached to you according to the theme of the day. Does not show complete presentation. Just shows the points to be studied, explained and solved. 2

Sorting Large Number of Data Data size > Memory size of single computer Ex. 1~100trillion integer numbers Distributed Parallel Sort: Distribute the data into multiple computers on a network and sort. ⇒ Use multiple computational power ⇒ Requires communication among computers 3

Computational Infrastructures Case 1: PCs in a computer room Use all of the PCs on holidays or in midnights ~100 PCs (200~400GB of memory in total) Case 2: Supercomputers in Japan Enable "Ultra Large Scale Computation" by using supercomputers all over Japan 10~20 supercomputers Speed: 10TFLOPS ~ 100TFLOPS / each Memory: 10TB ~ 100TB / each 4

Network Infrastructures Case 1: Ethernet Switch Bandwidth: 100Mbps ~ 1Gbps Latency: 0.05~0.1msec Case 2: SINET3 (Academic Network in Japan) Backbone Bandwidth: 10~40Gbps Bandwidth per computer: ~10Gbps Latency: 10~100msec depends on the length of physical networks 5

Bandwidth? Latency? Bandwidth: Available speed of data transfer (bit/sec) on the network Latency: Minimum time required for each data transfer Estimation of the cost for a data transfer: T = L + S / B L: Latency, B: Bandwidth, S: Data size (bit) 6

System Infrastructure Case 1: Network environment can be "Reliable", since no other user is using the system. Implementation of the program will be easier by installing MPI(Message Passing Interface). Case 2: Network environment may be "Unreliable" since many users share the network routes. Usage of MPI is difficult, since the environment is "Heterogeneous" Different architectures and OSs 7

Implementation on Internet Everything can be built on "Application Layer" Choose a protocol for internet: TCP or UDP Case 1: UDP (or MPI over UDP) Case 2: TCP Choose a parallel algorithm of sorting Parallel Algorithm: algorithm for solving a problem by dividing it into multiple tasks and running them concurrently 8

"Layers" of networks OSI model: divides facilities of network devices into 7 layers Application Layer Presentation Layer Session Layer Transport Layer Network Layer Data Link Layer Physical Layer 9

TCP or UDP TCP (Transmission Control Protocol) Guarantees the completion of data transfer. Slow but reliable. UDP(User Datagram Protocol) No guarantee about data transfer. Fast but unreliable. Sorting requires every data to be correctly transferred. ⇒ TCP is preferred. On reliable networks such as Case 1, UDP can be used. MPI is an interface over UDP (or TCP). Guarantees data transfer even over UDP. 10

Detailed Implementation of Softing Program Implementation of parallel algorithm: Cost of computation? Cost of communication? Requirements of Memory? Policies for distributing computation and data affects the performance. 11

Characteristics of each case Case 1: Low latency and narrow bandwidth Total amount of computational power and memory is small No need for load-balancing Case 2: High latency and wide bandwidth (Possibly) Large amount of computational power and memory Requires load-balancing according to the computational power of each machine. 12

Implementation for Case 1. Distribute same amount of computation and data on each computer Consider the number of PCs to be used: Communication cost increases according to the number of PCs If the target data is large enough, it will achieve sufficient speedup by parallelization even with 100PCs. 13

Implementation for Case 2 The amount of computation and data depends on the relative performance of each computer. Accurate analysis of the performance of each machine and network is important. It must be difficult to obtain sufficient effect of parallelization with large number of nodes. Performance degradation by load unbalance and communication cost. 14

To complete the presentation of your solution Detailed information about the infrastructure. Detailed information of implementation: Parallel Algorithm? Policy for distributing data and computation? Estimate computation and communication time, and find the optimal distribution. How to distribute the target data? and how to gather the results? Management of multiple jobs. Standardization of the solution Relationship with the future networks 15

Exercise Find existing parallel algorithms For example: Sorting Note: Algorithms for "distributed memory parallel" computing environment = Each computer has its own memory => Requires explicit communication Consider how to implement them on computers connected with Internet. 16