Message Passing Models

Slides:

Advertisements

Similar presentations

Reference: Getting Started with MPI.

Advertisements

Distributed Memory Programming with MPI. What is MPI? Message Passing Interface (MPI) is an industry standard message passing system designed to be both.

Comp 422: Parallel Programming Lecture 8: Message Passing (MPI)

1 Tuesday, October 10, 2006 To err is human, and to blame it on a computer is even more so. -Robert Orben.

Basics of Message-passing Mechanics of message-passing –A means of creating separate processes on different computers –A way to send and receive messages.

ORNL is managed by UT-Battelle for the US Department of Energy Crash Course In Message Passing Interface Adam Simpson NCCS User Assistance.

Tutorial for QUIZ 1: Interconnects, shared memory, and synchronization

2.1 Message-Passing Computing ITCS 4/5145 Parallel Computing, UNC-Charlotte, B. Wilkinson, Jan 17, 2012.

1 MPI: Message-Passing Interface Chapter 2. 2 MPI - (Message Passing Interface) Message passing library standard (MPI) is developed by group of academics.

Part I MPI from scratch. Part I By: Camilo A. SilvaBIOinformatics Summer 2008 PIRE :: REU :: Cyberbridges.

Parallel Computing A task is broken down into tasks, performed by separate workers or processes Processes interact by exchanging information What do we.

Parallel Programming with MPI Prof. Sivarama Dandamudi School of Computer Science Carleton University.

Message Passing Programming with MPI Introduction to MPI Basic MPI functions Most of the MPI materials are obtained from William Gropp and Rusty Lusk’s.

Hybrid MPI and OpenMP Parallel Programming

CS 838: Pervasive Parallelism Introduction to MPI Copyright 2005 Mark D. Hill University of Wisconsin-Madison Slides are derived from an online tutorial.

Message Passing Programming Model AMANO, Hideharu Textbook pp. １４０－１４７.

Message Passing Interface (MPI) 1 Amit Majumdar Scientific Computing Applications Group San Diego Supercomputer Center Tim Kaiser (now at Colorado School.

MPI Introduction to MPI Commands. Basics – Send and Receive MPI is a message passing environment. The processors’ method of sharing information is NOT.

Distributed-Memory (Message-Passing) Paradigm FDI 2004 Track M Day 2 – Morning Session #1 C. J. Ribbens.

Slides for Parallel Programming Techniques & Applications Using Networked Workstations & Parallel Computers 2nd Edition, by B. Wilkinson & M. Allen, ©

CSCI-455/522 Introduction to High Performance Computing Lecture 4.

1 Message Passing Models CEG 4131 Computer Architecture III Miodrag Bolic.

Chapter 4 Message-Passing Programming. The Message-Passing Model.

Message Passing and MPI Laxmikant Kale CS Message Passing Program consists of independent processes, –Each running in its own address space –Processors.

Introduction to MPI CDP 1. Shared Memory vs. Message Passing Shared Memory Implicit communication via memory operations (load/store/lock) Global address.

Introduction to MPI Nischint Rajmohan 5 November 2007.

12.1 Parallel Programming Types of Parallel Computers Two principal types: 1.Single computer containing multiple processors - main memory is shared,

Project18 Communication Design + Parallelization Camilo A Silva BIOinformatics Summer 2008.

3/12/2013Computer Engg, IIT(BHU)1 MPI-1. MESSAGE PASSING INTERFACE A message passing library specification Extended message-passing model Not a language.

1 Parallel and Distributed Processing Lecture 5: Message-Passing Computing Chapter 2, Wilkinson & Allen, “Parallel Programming”, 2 nd Ed.

Message Passing Interface Using resources from

MPI-Message Passing Interface. What is MPI?  MPI is a specification for the developers and users of message passing libraries. By itself, it is NOT a.

1 Programming distributed memory systems Clusters Distributed computers ITCS 4/5145 Parallel Computing, UNC-Charlotte, B. Wilkinson, Jan 6, 2015.

Group Members Hamza Zahid (131391) Fahad Nadeem khan Abdual Hannan AIR UNIVERSITY MULTAN CAMPUS.

Introduction to parallel computing concepts and technics

Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing Message Passing Interface (cont.) Topologies.

Introduction to MPI.

MPI Message Passing Interface

Introduction to MPI CDP.

Send and Receive.

MPI and comparison of models Lecture 23, cs262a

MPI: The Message-Passing Interface

Send and Receive.

CS4961 Parallel Programming Lecture 16: Introduction to Message Passing Mary Hall November 3, /03/2011 CS4961.

Introduction to Message Passing Interface (MPI)

ITCS 4/5145 Parallel Computing, UNC-Charlotte, B

Parallel Processing - MPI

Lecture 14: Inter-process Communication

Shared Memory Systems Miodrag Bolic.

MPI: Message Passing Interface

Message-Passing Computing More MPI routines: Collective routines Synchronous routines Non-blocking routines ITCS 4/5145 Parallel Computing, UNC-Charlotte,

Message-Passing Computing

Lab Course CFD Parallelisation Dr. Miriam Mehl.

Introduction to parallelism and the Message Passing Interface

Parallel Processing Javier Delgado

Programming with Shared Memory

Programming with Shared Memory

Hybrid MPI and OpenMP Parallel Programming

Hardware Environment VIA cluster - 8 nodes Blade Server – 5 nodes

Message-Passing Computing Message Passing Interface (MPI)

Distributed Memory Programming with Message-Passing

Parallel Processing - MPI

MPI Message Passing Interface

Some codes for analysis and preparation for programming

CS 584 Lecture 8 Assignment?.

Programming Parallel Computers

Presentation transcript:

Message Passing Models Miodrag Bolic

Overview Hardware model Programming model Message Passing Interface

Generic Model Of A Message-passing Multicomputer [5] Node Node Node Node Node Node Message-passing direct network interconnection Node Node Node Node Node Node Gyula Fehér

Generic Node Architecture [5] External channel Fat-Node Node Node -powerful processor -large memory -many chips Node-processor -costly/node -moderate parallelism Processor + Local memory + .... Thin-Node Internal channel(s) -small processor Router External -small memory channel -one-few chips Communication -cheap/node Processor + External -high parallelism Switch unit+ .... channel External channel Gyula Fehér

Generic Organization Model [5] Switching network P+M P+M CP CP S S P+M P+M P+M CP CP CP (b) Decentralized (c) Centralized Gyula Fehér

Message Passing Properties [1] Complete computer as building block, including I/O Programming model: directly access only private address space (local memory) Communication via explicit messages (send/receive) Communication integrated at I/O level, not memory system, so no special hardware Resembles a network of workstations (which can actually be used as multiprocessor systems)

Message Passing Program [1] Problem: Sum all of the elements of an array of size n. INITIALIZE; //assign proc_num and num_procs if (proc_num == 0) //processor with a proc_num of 0 is the master, //which sends out messages and sums the result { read_array(array_to_sum, size); //read the array and array size from file size_to_sum = size/num_procs; for (current_proc = 1; current_proc < num_procs; current_proc++) lower_ind = size_to_sum * current_proc; upper_ind = size_to_sum * (current_proc + 1); SEND(current_proc, size_to_sum); SEND(current_proc, array_to_sum[lower_ind:upper_ind]); } //master nodes sums its part of the array sum = 0; for (k = 0; k < size_to_sum; k++) sum += array_to_sum[k]; global_sum = sum; RECEIVE(current_proc, local_sum); global_sum += local_sum; printf(“sum is %d”, global_sum); else //any processor other than proc_num = 0 is a slave RECEIVE(0, size_to_sum); RECEIVE(0, array_to_sum[0 : size_to_sum]); SEND(0, sum); END;

Message Passing Program (cont.) [1] Multiprocessor Software Functions Provided: INITIALIZE – assigns a number (proc_num) to each processor in the system, assigns the total number of processors (num_procs). SEND(receiving_processor_number, data) - sends data to another processor BARRIER(n_procs) – When a BARRIER is encountered, a processor waits at that BARRIER until n_procs processors reach the BARRIER, then execution can proceed.

Advantages [1] Advantages Disadvantages Easier to build than scalable shared memory machines Easy to scale (but topology is important) Programming model more removed from basic hardware operations Coherency and synchronization is the responsibility of the user, so the system designer need not worry about them. Disadvantages Large overhead: copying of buffers requires large data transfers (this will kill the benefits of multiprocessing, if not kept to a minimum). Programming is more difficult. Blocking nature of SEND/RECEIVE can cause increased latency and deadlock issues.

Message-Passing Interface – MPI [3] Standardization - MPI is the only message passing library which can be considered a standard. It is supported on virtually all HPC platforms. Practically, it has replaced all previous message passing libraries. Portability - There is no need to modify your source code when you port your application to a different platform that supports the MPI standard. Performance Opportunities - Vendor implementations should be able to exploit native hardware features to optimize performance. Functionality - Over 115 routines are defined. Availability - A variety of implementations are available, both vendor and public domain.

MPI basics [3] Start Processes Send Messages Receive Messages Synchronize With these four capabilities, you can construct any program.

Communicators [3] Provide a named set of processes for communication: System allocated unique tags to processes All processes can be numbered from 0 to n-1 Allow construction of libraries: application creates communicators MPI_COMM_WORLD MPI uses objects called communicators and groups to define which collection of processes may communicate with each other. Provide functions (split, duplicate, ...) for creating communicators from other communicators Functions (size, my_rank, …) for finding out about all processes within a communicator Blocking vs. non-blocking

Hello world example [3] #include <stdio.h> #include "mpi.h" main(int argc, char** argv) { int my_PE_num; MPI_Init(&argc, &argv); MPI_Comm_rank(MPI_COMM_WORLD, &my_PE_num); printf("Hello from %d.\n", my_PE_num); MPI_Finalize(); }

Hello world example [3] Hello from 5. Hello from 3. Hello from 1.

MPMD [3] Use MPI_Comm_rank: if (my_PE_num = 0) Routine1 else if (my_PE_num = 1) Routine2 else if (my_PE_num =2) Routine3 . . .

Blocking Sending and Receiving Messages [3] #include <stdio.h> #include "mpi.h" main(int argc, char** argv) { int my_PE_num, numbertoreceive, numbertosend=42; MPI_Status status; MPI_Init(&argc, &argv); MPI_Comm_rank(MPI_COMM_WORLD, &my_PE_num); if (my_PE_num==0) MPI_Recv( &numbertoreceive, 1, MPI_INT, MPI_ANY_SOURCE, MPI_ANY_TAG, MPI_COMM_WORLD, &status); printf("Number received is: %d\n", numbertoreceive); } else MPI_Send( &numbertosend, 1, MPI_INT, 0, 10, MPI_COMM_WORLD); MPI_Finalize();

Non-Blocking Message Passing Routines [4] #include "mpi.h" #include <stdio.h> int main(int argc, char *argv[]) { int numtasks, rank, next, prev, buf[2], tag1=1, tag2=2; MPI_Request reqs[4]; MPI_Status stats[4]; MPI_Init(&argc,&argv); MPI_Comm_size(MPI_COMM_WORLD, &numtasks); MPI_Comm_rank(MPI_COMM_WORLD, &rank); prev = rank-1; next = rank+1; if (rank == 0) prev = numtasks - 1; if (rank == (numtasks - 1)) next = 0; MPI_Irecv(&buf[0], 1, MPI_INT, prev, tag1, MPI_COMM_WORLD, &reqs[0]); MPI_Irecv(&buf[1], 1, MPI_INT, next, tag2, MPI_COMM_WORLD, &reqs[1]); MPI_Isend(&rank, 1, MPI_INT, prev, tag2, MPI_COMM_WORLD, &reqs[2]); MPI_Isend(&rank, 1, MPI_INT, next, tag1, MPI_COMM_WORLD, &reqs[3]); { do some work } MPI_Waitall(4, reqs, stats); MPI_Finalize(); }

Collective Communications [3] The Communicator specifies a process group to participate in a collective communication MPI implements various optimized functions: Barrier synchronization Broadcast Reduction operations: with one destination or all in group destination Collective operations are blocking

Comparison MPI vs. OpenMP Features OpenMP MPI Apply parallelism in steps yes no Scale to large number of processors maybe Code complexity Small increase Major increase Runtime environment Expensive compilers Free Cost of hardware Very expensive Cheap

References J. Kowalczyk, “Multiprocessor Systems,” Xilinx, 2003. D. Culler, J. P. Singh, Parallel Computer Architectures, A Hardware/Software Approach, Morgan Kaufman, 1999. MPI Basics Message Passing Interface (MPI) D. Sima, T. Fountain and P. Kascuk, Advanced Computer Architectures – A Design Space Approach, Pearson, 1997.