S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE MPI 2 Part II NPACI Parallel Computing Institute.

Slides:

Advertisements

Similar presentations

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Introduction to Message Passing Interface (MPI) Part I SoCal Annual AAP workshop.

Advertisements

Tutorial on MPI Experimental Environment for ECE5610/CSC

MPI Fundamentals—A Quick Overview Shantanu Dutt ECE Dept., UIC.

Introduction MPI Mengxia Zhu Fall An Introduction to MPI Parallel Programming with the Message Passing Interface.

A Message Passing Standard for MPP and Workstations Communications of the ACM, July 1996 J.J. Dongarra, S.W. Otto, M. Snir, and D.W. Walker.

Distributed Memory Programming with MPI. What is MPI? Message Passing Interface (MPI) is an industry standard message passing system designed to be both.

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE Message Passing Interface (MPI) Part I NPACI Parallel.

EECC756 - Shaaban #1 lec # 7 Spring Message Passing Interface (MPI) MPI, the Message Passing Interface, is a library, and a software standard.

MPI Point-to-Point Communication CS 524 – High-Performance Computing.

CS 179: GPU Programming Lecture 20: Cross-system communication.

Parallel Programming Using Basic MPI Presented by Timothy H. Kaiser, Ph.D. San Diego Supercomputer Center Presented by Timothy H. Kaiser, Ph.D. San Diego.

1 An Introduction to MPI Parallel Programming with the Message Passing Interface Originally by William Gropp and Ewing Lusk Adapted by Anda Iamnitchi.

A Message Passing Standard for MPP and Workstations Communications of the ACM, July 1996 J.J. Dongarra, S.W. Otto, M. Snir, and D.W. Walker.

2a.1 Message-Passing Computing More MPI routines: Collective routines Synchronous routines Non-blocking routines ITCS 4/5145 Parallel Computing, UNC-Charlotte,

1 MPI: Message-Passing Interface Chapter 2. 2 MPI - (Message Passing Interface) Message passing library standard (MPI) is developed by group of academics.

MPI More of the Story Presented by Timothy H. Kaiser, Ph.D. San Diego Supercomputer Center Presented by Timothy H. Kaiser, Ph.D. San Diego Supercomputer.

MA471Fall 2003 Lecture5. More Point To Point Communications in MPI Note: so far we have covered –MPI_Init, MPI_Finalize –MPI_Comm_size, MPI_Comm_rank.

Specialized Sending and Receiving David Monismith CS599 Based upon notes from Chapter 3 of the MPI 3.0 Standard

Part I MPI from scratch. Part I By: Camilo A. SilvaBIOinformatics Summer 2008 PIRE :: REU :: Cyberbridges.

Steve Lantz Computing and Information Science Distributed Memory Programming Using Advanced MPI (Message Passing Interface)

PP Lab MPI programming VI. Program 1 Break up a long vector into subvectors of equal length. Distribute subvectors to processes. Let them compute the.

1 Review –6 Basic MPI Calls –Data Types –Wildcards –Using Status Probing Asynchronous Communication Collective Communications Advanced Topics –"V" operations.

Message Passing Interface (MPI) 3 Amit Majumdar Scientific Computing Applications Group San Diego Supercomputer Center Tim Kaiser (now at Colorado School.

CS 484. Message Passing Based on multi-processor Set of independent processors Connected via some communication net All communication between processes.

Parallel Programming with MPI Prof. Sivarama Dandamudi School of Computer Science Carleton University.

CS 838: Pervasive Parallelism Introduction to MPI Copyright 2005 Mark D. Hill University of Wisconsin-Madison Slides are derived from an online tutorial.

MPI Communications Point to Point Collective Communication Data Packaging.

Message Passing Programming Model AMANO, Hideharu Textbook pp. １４０－１４７.

Summary of MPI commands Luis Basurto. Large scale systems Shared Memory systems – Memory is shared among processors Distributed memory systems – Each.

Message Passing Interface (MPI) 1 Amit Majumdar Scientific Computing Applications Group San Diego Supercomputer Center Tim Kaiser (now at Colorado School.

MPI Introduction to MPI Commands. Basics – Send and Receive MPI is a message passing environment. The processors’ method of sharing information is NOT.

Distributed-Memory (Message-Passing) Paradigm FDI 2004 Track M Day 2 – Morning Session #1 C. J. Ribbens.

MA471Fall 2002 Lecture5. More Point To Point Communications in MPI Note: so far we have covered –MPI_Init, MPI_Finalize –MPI_Comm_size, MPI_Comm_rank.

Lecture 6: Message Passing Interface (MPI). Parallel Programming Models Message Passing Model Used on Distributed memory MIMD architectures Multiple processes.

CSCI-455/522 Introduction to High Performance Computing Lecture 4.

MPI Point to Point Communication CDP 1. Message Passing Definitions Application buffer Holds the data for send or receive Handled by the user System buffer.

An Introduction to MPI (message passing interface)

1 S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE Point to Point Communications in MPI Basic operations.

Message Passing Interface (MPI) 2 Amit Majumdar Scientific Computing Applications Group San Diego Supercomputer Center Tim Kaiser (now at Colorado School.

MPI Send/Receive Blocked/Unblocked Josh Alexander, University of Oklahoma Ivan Babic, Earlham College Andrew Fitz Gibbon, Shodor Education Foundation Inc.

2.1 Collective Communication Involves set of processes, defined by an intra-communicator. Message tags not present. Principal collective operations: MPI_BCAST()

Chapter 5. Nonblocking Communication MPI_Send, MPI_Recv are blocking operations Will not return until the arguments to the functions can be safely modified.

-1.1- MPI Lectured by: Nguyễn Đức Thái Prepared by: Thoại Nam.

1 Parallel and Distributed Processing Lecture 5: Message-Passing Computing Chapter 2, Wilkinson & Allen, “Parallel Programming”, 2 nd Ed.

Message Passing Interface Using resources from

Lecture 3 Point-to-Point Communications Dr. Muhammad Hanif Durad Department of Computer and Information Sciences Pakistan Institute Engineering and Applied.

COMP7330/7336 Advanced Parallel and Distributed Computing MPI Programming - Exercises Dr. Xiao Qin Auburn University

COMP7330/7336 Advanced Parallel and Distributed Computing MPI Programming: 1. Collective Operations 2. Overlapping Communication with Computation Dr. Xiao.

Distributed Processing with MPI International Summer School 2015 Tomsk Polytechnic University Assistant Professor Dr. Sergey Axyonov.

Introduction to parallel computing concepts and technics

CS4402 – Parallel Computing

MPI Point to Point Communication

Introduction to MPI.

Send and Receive.

An Introduction to Parallel Programming with MPI

Send and Receive.

ITCS 4/5145 Parallel Computing, UNC-Charlotte, B

Lecture 14: Inter-process Communication

A Message Passing Standard for MPP and Workstations

MPI: Message Passing Interface

Message-Passing Computing More MPI routines: Collective routines Synchronous routines Non-blocking routines ITCS 4/5145 Parallel Computing, UNC-Charlotte,

Introduction to parallelism and the Message Passing Interface

Message-Passing Computing Message Passing Interface (MPI)

Hello, world in MPI #include <stdio.h> #include "mpi.h"

Hello, world in MPI #include <stdio.h> #include "mpi.h"

MPI Message Passing Interface

CS 584 Lecture 8 Assignment?.

Programming Parallel Computers

Presentation transcript:

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE MPI 2 Part II NPACI Parallel Computing Institute August , 2002 San Diego Supercomputer Center

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 2 Outline Review –6 Basic MPI Calls –Data Types –Wildcards –Using Status Probing Asynchronous Communication Collective Communications Advanced Topics –"V" operations –Derived data types –Communicators

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 3 Review: Six Basic MPI Calls MPI_Init –Initialize MPI MPI_Comm_Rank –Get the processor rank MPI_Comm_Size –Get the number of processors MPI_Send –Send data to another processor MPI_Recv –Get data from another processor MPI_Finalize –Finish MPI

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 4 Simple Send and Receive Program #include int main(int argc,char *argv[]) { int myid, numprocs, tag,source,destination,count, buffer; MPI_Status status; MPI_Init(&argc,&argv); MPI_Comm_size(MPI_COMM_WORLD,&numprocs); MPI_Comm_rank(MPI_COMM_WORLD,&myid); tag=1234; source=0; destination=1; count=1; if(myid == source){ buffer=5678; MPI_Send(&buffer,count,MPI_INT,destination,tag,MPI_COMM_WORLD); printf("processor %d sent %d\n",myid,buffer); } if(myid == destination){ MPI_Recv(&buffer,count,MPI_INT,source,tag,MPI_COMM_WORLD,&status); printf("processor %d got %d\n",myid,buffer); } MPI_Finalize(); }

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 5 MPI Data Types MPI has many different predefined data types –All your favorite C data types are included MPI data types can be used in any communication operation

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 6 MPI Predefined Data Types in C

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 7 Wildcards Enables programmer to avoid having to specify a tag and/or source. Example: MPI_ANY_SOURCE and MPI_ANY_TAG are wild cards status structure is used to get wildcard values MPI_Status status; int buffer[5]; int error; error = MPI_Recv(&buffer[0], 5, MPI_INT, MPI_ANY_SOURCE, MPI_ANY_TAG, MPI_COMM_WORLD,&status);

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 8 Status The status parameter returns additional information for some MPI routines –additional error status information –additional information with wildcard parameters C declaration : a predefined struct  MPI_Status status; Fortran declaration : an array is used instead  INTEGER STATUS(MPI_STATUS_SIZE)

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 9 Accessing Status Information The tag of a received message –C : status.MPI_TAG –Fortran : STATUS(MPI_TAG) The source of a received message –C : status.MPI_SOURCE –Fortran : STATUS(MPI_SOURCE) The error code of the MPI call –C : status.MPI_ERROR –Fortran : STATUS(MPI_ERROR) Other uses...

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 10 MPI_Probe MPI_Probe allows incoming messages to be checked without actually receiving them. –The user can then decide how to receive the data. –Useful when different action needs to be taken depending on the "who, what, and how much" information of the message.

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 11 MPI_Probe C  MPI_Probe(source, tag, comm, &status) Fortran  MPI_PROBE(SOURCE, TAG, COMM, STATUS, IERROR) Parameters –Source: source rank, or MPI_ANY_SOURCE –Tag: tag value, or MPI_ANY_TAG –Comm: communicator –Status: status object

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 12 MPI_Probe Example #include /* Program shows how to use probe and get_count to find the size */ /* of an incomming message */ int main(argc,argv) int argc; char *argv[]; { int myid, numprocs; MPI_Status status; int i,mytag,ierr,icount; MPI_Init(&argc,&argv); MPI_Comm_size(MPI_COMM_WORLD,&numprocs); MPI_Comm_rank(MPI_COMM_WORLD,&myid); /* print out my rank and this run's PE size*/ printf("Hello from %d\n",myid); printf("Numprocs is %d\n",numprocs);

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 13 MPI_Probe example (cont.) mytag=123; i=0; icount=0; if(myid == 0) { i=100; icount=1; ierr=MPI_Send(&i,icount,MPI_INT,1,mytag,MPI_COMM_WORLD ); } if(myid == 1){ ierr=MPI_Probe(0,mytag,MPI_COMM_WORLD,&status); ierr=MPI_Get_count(&status,MPI_INT,&icount); printf("getting %d\n",icount); ierr = MPI_Recv(&i,icount,MPI_INT,0,mytag,MPI_COMM_WORLD, status); printf("i= %d\n",i); } MPI_Finalize(); }

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 14 MPI_Barrier Blocks the caller until all members in the communicator have called it. Used as a synchronization tool. C  MPI_Barrier(comm ) Fortran  Call MPI_BARRIER(COMM, IERROR) Parameter –Comm: communicator (often MPI_COMM_WORLD)

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 15 Asynchronous Communication Asynchronous send: send call returns immediately, send actually occurs later Asynchronous receive: receive call returns immediately. When received data is needed, call a wait subroutine Asynchronous communication used to overlap communication with computation. Can help prevent deadlock (but beware!)

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 16 Asynchronous Send with MPI_Isend C  MPI_Request request  MPI_Isend(&buffer, count, datatype, dest,tag, comm, &request) Fortran  Integer REQUEST  MPI_Isend(BUFFER, COUNT, DATATYPE, DEST, TAG, COMM, REQUEST, IERROR) Request is a new output parameter Don't change data until communication is complete

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 17 Asynchronous Receive w/ MPI_Irecv C  MPI_Request request;  int MPI_Irecv(&buf, count, datatype, source, tag, comm, request) Fortran  Integer request  MPI_Irecv(BUFFER, COUNT, DATATYPE, SOURCE, TAG, COMM, REQUEST,IERROR) Parameter changes –Request: communication request –Status parameter is missing Don't use data until communication is complete

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 18 MPI_Wait Used to Complete Communication Request from Isend or Irecv is input –The completion of a send operation indicates that the sender is now free to update the data in the send buffer –The completion of a receive operation indicates that the receive buffer contains the received message MPI_Wait blocks until message specified by "request" completes

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 19 MPI_Wait Usage C  MPI_Request request;  MPI_Status status;  MPI_Wait(&request, &status) Fortran  Integer request  Integer status(MPI_STATUS_SIZE)  MPI_WAIT(REQUEST, STATUS, IERROR) MPI_Wait blocks until message specified by "request" completes

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 20 MPI_Test Similar to MPI_Wait, but does not block Value of flags signifies whether a message has been delivered C  int flag  int MPI_Test(&request,&flag, &status) Fortran  LOGICAL FLAG  MPI_TEST(REQUEST, FLAG, STATUS, IER)

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 21 Non-Blocking Send Example call MPI_Isend (buffer,count,datatype,dest, tag,comm, request, ierr) 10 continue Do other work... call MPI_Test (request, flag, status, ierr) if (.not. flag) goto 10

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 22 Asynchronous Send and Receive Write a parallel program to send and receive data using MPI_Isend and MPI_Irecv –Initialize MPI –Have processor 0 send an integer to processor 1 –Have processor 1 receive and integer from processor 0 –Both processors check on message completion –Quit MPI

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 23 #include #include "mpi.h" #include /************************************************************ This is a simple isend/ireceive program in MPI ************************************************************/ int main(argc,argv) int argc; char *argv[]; { int myid, numprocs; int tag,source,destination,count; int buffer; MPI_Status status; MPI_Request request;

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 24 MPI_Init(&argc,&argv); MPI_Comm_size(MPI_COMM_WORLD,&numprocs); MPI_Comm_rank(MPI_COMM_WORLD,&myid); tag=1234; source=0; destination=1; count=1; request=MPI_REQUEST_NULL;

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 25 if(myid == source){ buffer=5678; MPI_Isend(&buffer,count,MPI_INT,destination,tag, MPI_COMM_WORLD,&request); } if(myid == destination){ MPI_Irecv(&buffer,count,MPI_INT,source,tag, MPI_COMM_WORLD,&request); } MPI_Wait(&request,&status); if(myid == source){ printf("processor %d sent %d\n",myid,buffer); } if(myid == destination){ printf("processor %d got %d\n",myid,buffer); } MPI_Finalize(); }

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 26 Broadcast Operation: MPI_Bcast All nodes call MPI_Bcast One node (root) sends a message all others receive the message C  MPI_Bcast(&buffer, count, datatype, root, communicator); Fortran  call MPI_Bcast(buffer, count, datatype, root, communicator, ierr) Root is node that sends the message

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 27 Broadcast Example Write a parallel program to broadcast data using MPI_Bcast –Initialize MPI –Have processor 0 broadcast an integer –Have all processors print the data –Quit MPI

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 28 /************************************************************ This is a simple broadcast program in MPI ************************************************************/ int main(argc,argv) int argc; char *argv[]; { int i,myid, numprocs; int source,count; int buffer[4]; MPI_Status status; MPI_Request request; MPI_Init(&argc,&argv); MPI_Comm_size(MPI_COMM_WORLD,&numprocs); MPI_Comm_rank(MPI_COMM_WORLD,&myid);

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 29 source=0; count=4; if(myid == source){ for(i=0;i<count;i++) buffer[i]=i; } MPI_Bcast(buffer,count,MPI_INT,source,MPI_COMM_WORLD) ; for(i=0;i<count;i++) printf("%d ",buffer[i]); printf("\n"); MPI_Finalize(); }

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 30 Reduction Operations Used to combine partial results from all processors Result returned to root processor Several types of operations available Works on single elements and arrays

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 31 MPI_Reduce C –int MPI_Reduce(&sendbuf, &recvbuf, count, datatype, operation,root, communicator) Fortran –call MPI_Reduce(sendbuf, recvbuf, count, datatype, operation,root, communicator, ierr) Parameters –Like MPI_Bcast, a root is specified. –Operation is a type of mathematical operation

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 32 Operations for MPI_Reduce MPI_MAXMaximum MPI_MINMinimum MPI_PRODProduct MPI_SUMSum MPI_LANDLogical and MPI_LORLogical or MPI_LXORLogical exclusive or MPI_BAND Bitwise and MPI_BORBitwise or MPI_BXORBitwise exclusive or MPI_MAXLOCMaximum value and location MPI_MINLOCMinimum value and location

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 33 Global Sum with MPI_Reduce C double sum_partial, sum_global; sum_partial =...; ierr = MPI_Reduce(&sum_partial, &sum_global, 1, MPI_DOUBLE_PRECISION, MPI_SUM,root, MPI_COMM_WORLD); Fortran double precision sum_partial, sum_global sum_partial =... call MPI_Reduce(sum_partial, sum_global, 1, MPI_DOUBLE_PRECISION, MPI_SUM,root, MPI_COMM_WORLD, ierr)

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 34 Global Sum Example with MPI_Reduce Write a program to sum data from all processors

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 35 #include /* ! This program shows how to use MPI_Scatter and MPI_Reduce ! Each processor gets different data from the root processor ! by way of mpi_scatter. The data is summed and then sent back ! to the root processor using MPI_Reduce. The root processor ! then prints the global sum. */ /* globals */ int numnodes,myid,mpi_err; #define mpi_root 0 /* end globals */

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 36 void init_it(int *argc, char ***argv); void init_it(int *argc, char ***argv) { mpi_err = MPI_Init(argc,argv); mpi_err = MPI_Comm_size( MPI_COMM_WORLD, &numnodes ); mpi_err = MPI_Comm_rank(MPI_COMM_WORLD, &myid); } int main(int argc,char *argv[]){ int *myray,*send_ray,*back_ray; int count; int size,mysize,i,k,j,total,gtotal; init_it(&argc,&argv); /* each processor will get count elements from the root */ count=4; myray=(int*)malloc(count*sizeof(int));

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 37 /* create the data to be sent on the root */ if(myid == mpi_root){ size=count*numnodes; send_ray=(int*)malloc(size*sizeof(int)); back_ray=(int*)malloc(numnodes*sizeof(int)); for(i=0;i<size;i++) send_ray[i]=i; } /* send different data to each processor */ mpi_err = MPI_Scatter(send_ray, count, MPI_INT, myray, count, MPI_INT, mpi_root, MPI_COMM_WORLD); /* each processor does a local sum */ total=0; for(i=0;i<count;i++) total=total+myray[i]; printf("myid= %d total= %d\n ",myid,total);

S an D IEGO S UPERCOMPUTER C ENTER N ATIONAL P ARTNERSHIP FOR A DVANCED C OMPUTATIONAL I NFRASTRUCTURE 38 /* send the local sums back to the root */ mpi_err = MPI_Reduce(&total, &gtotal, 1, MPI_INT, MPI_SUM, mpi_root, MPI_COMM_WORLD); /* the root prints the global sum */ if(myid == mpi_root){ printf("results from all processors= %d \n ",gtotal); } mpi_err = MPI_Finalize(); }