Parallel Graph Algorithms

Slides:

Advertisements

Similar presentations

Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar

Advertisements

Graph Algorithms Carl Tropper Department of Computer Science McGill University.

Graph Theory Arnold Mesa. Basic Concepts n A graph G = (V,E) is defined by a set of vertices and edges v3 v1 v2Vertex (v1) Edge (e1) A Graph with 3 vertices.

* Bellman-Ford: single-source shortest distance * O(VE) for graphs with negative edges * Detects negative weight cycles * Floyd-Warshall: All pairs shortest.

Parallel Graph Algorithms

Data Structures Using C++

Discussion #34 1/17 Discussion #34 Warshall’s and Floyd’s Algorithms.

 2004 SDU Lecture11- All-pairs shortest paths. Dynamic programming Comparing to divide-and-conquer 1.Both partition the problem into sub-problems 2.Divide-and-conquer.

Graph II MST, Shortest Path. Graph Terminology Node (vertex) Edge (arc) Directed graph, undirected graph Degree, in-degree, out-degree Subgraph Simple.

1 Graphs: shortest paths & Minimum Spanning Tree(MST) Fundamental Data Structures and Algorithms Ananda Guna April 8, 2003.

Greedy Algorithms Reading Material: Chapter 8 (Except Section 8.5)

Shortest Path Problems Directed weighted graph. Path length is sum of weights of edges on path. The vertex at which the path begins is the source vertex.

Algorithms All pairs shortest path

Greedy Algorithms Like dynamic programming algorithms, greedy algorithms are usually designed to solve optimization problems Unlike dynamic programming.

Parallel Programming – Graph Algorithms David Monismith CS599 Notes are primarily based upon Introduction to Parallel Programming, Second Edition by Grama,

Graph Algorithms. Definitions and Representation An undirected graph G is a pair (V,E), where V is a finite set of points called vertices and E is a finite.

Parallel graph algorithms Antonio-Gabriel Sturzu, SCPD Adela Diana Almasi, SCPD Adela Diana Almasi, SCPD Iulia Alexandra Floroiu, ISI Iulia Alexandra Floroiu,

1 The Floyd-Warshall Algorithm Andreas Klappenecker.

Graph Algorithms Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar Adapted for 3030 To accompany the text ``Introduction to Parallel Computing'',

The all-pairs shortest path problem (APSP) input: a directed graph G = (V, E) with edge weights goal: find a minimum weight (shortest) path between every.

Graph Algorithms Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar To accompany the text ``Introduction to Parallel Computing'', Addison Wesley,

1 Prim’s algorithm. 2 Minimum Spanning Tree Given a weighted undirected graph G, find a tree T that spans all the vertices of G and minimizes the sum.

Graphs. Graphs Similar to the graphs you’ve known since the 5 th grade: line graphs, bar graphs, etc., but more general. Those mathematical graphs are.

Graph Algorithms Gayathri R To accompany the text ``Introduction to Parallel Computing'', Addison Wesley, 2003.

Graph Theory. undirected graph node: a, b, c, d, e, f edge: (a, b), (a, c), (b, c), (b, e), (c, d), (c, f), (d, e), (d, f), (e, f) subgraph.

Parallel Graph Algorithms Sathish Vadhiyar. Graph Traversal  Graph search plays an important role in analyzing large data sets  Relationship between.

Minimum-Cost Spanning Tree weighted connected undirected graph cost of spanning tree is sum of edge costs find spanning tree that has minimum cost Network.

Proof of correctness of Dijkstra’s algorithm: Basically, we need to prove two claims. (1)Let S be the set of vertices for which the shortest path from.

1 Chapter 6 : Graph – Part 2 교수 : 이상환 강의실 : 113,118 호, 324 호 연구실 : 과학관 204 호 Home :

All Pairs Shortest Path Algorithms Aditya Sehgal Amlan Bhattacharya.

Shortest Path -Prim’s -Djikstra’s. PRIM’s - Minimum Spanning Tree -A spanning tree of a graph is a tree that has all the vertices of the graph connected.

Shortest Paths C B A E D F Shortest Paths

Shortest Paths and Minimum Spanning Trees

Computing Connected Components on Parallel Computers

Shortest Paths C B A E D F Shortest Paths

Sieve of Eratosthenes.

Discrete Optimization Lecture 1

Introduction to Graphs

Shortest Paths C B A E D F Shortest Paths

Shortest Path Problems

CS330 Discussion 6.

Shortest Path Graph represents highway system Edges have weights

Shortest Paths C B A E D F

Minimum Spanning Trees

Graph Algorithm.

Graphs Representation, BFS, DFS

Shortest Paths C B A E D F Shortest Paths

Shortest Path Problems

Shortest Path Algorithms

Shortest Paths and Minimum Spanning Trees

CSCI2100 Data Structures Tutorial

Dynamic Programming.

Using compiler-directed approach to create MPI code automatically

Patterns Paraguin Compiler Version 2.1.

Single-Source All-Destinations Shortest Paths With Negative Costs

Chapter 24: Single-Source Shortest Paths

Near-neighbor or Mesh Based Paradigm

CS 584 Project Write up Poster session for final Due on day of final

Weighted Graphs & Shortest Paths

Dynamic Programming.

Chapter 24: Single-Source Shortest Paths

CSE 417: Algorithms and Computational Complexity

Parallel Graph Algorithms

Quiz Questions How does one execute code in parallel in Paraguin?

Chapter 9 Graph algorithms

Single-Source All-Destinations Shortest Paths With Negative Costs

Directed Graphs (Part II)

INTRODUCTION A graph G=(V,E) consists of a finite non empty set of vertices V , and a finite set of edges E which connect pairs of vertices .

Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar

More Graphs Lecture 19 CS2110 – Fall 2009.

Presentation transcript:

Parallel Graph Algorithms

Graph Algorithms Minimum Spanning Tree (Prim’s Algorithm) Single-Source Shortest Path (Dijkstra’s Algorithm) All-Pairs Shortest Paths (Dijkstra’s and Floyd’s Algorithm)

Adjacency Matrix An adjacency matrix represent the edges of a graph

Adjacency Matrix Example 1 2 3 4

Prim’s Algorithm for Minimum Spanning Tree Prim_MST(V, E, A, r) { VT = {r}; d[r] = 0; for all v in (V – VT) d[v] = Ar,v; while (VT != V) { Find a vertex u such that d[u] = min(d[v] for all v in (V – VT)); VT = VT + {u}; for all v in (V – VT) { d[v] = min(d[v], Au,v); } V – set of vertices VT – set of vertices in the MST E – set of edges A – adjacency matrix r – root node d – minimum distance from MST to any vertex Complexity = O(n2)

Root is node b (Prim’s) Initialize 3 Initialize 1 f 3 b 5 c 5 1 1 2 d e 4 Since d[3] = 1, add the edge b to d and consider node d next

Next consider node d (Prim’s) a 3 Take Minimums except for b and d 1 f 3 b 5 c 5 1 1 2 d e 4 Since d[0] = 1, add the edge b to a and consider node a next

Next consider node a (Prim’s) 3 1 f 3 b 5 c 5 1 1 2 d e 4 Since d[2] = 2, add the edge d to c and consider node c next

Next consider node c (Prim’s) a 3 1 f 3 b 5 c 5 1 1 2 d e 4 Since d[4] = 1, add the edge c to e and consider node e next

Next consider node e (Prim’s) a 3 1 f 3 b 5 c 5 1 1 2 d e 4 Since d[5] = 3, add the edge a to f and consider node f next

Next consider node f (Prim’s) a 3 1 f 3 b 5 c 5 1 1 2 d e 4 VT= V so stop

Parallelizing Prim’s Algorithm We can’t just simply execute the while loop in parallel because the d[] array changes with each selection of a vertex We have to update values in d[] from all processors after each iteration Suppose we have n vertices in the graph and p processors

Parallelizing Prim’s Algorithm Partition and adjacency matrix and the distance array (d) across processors d[ ] n A 1 2 p-1

Parallelizing Prim’s Algorithm Each processor computes the next vertex from among its vertices A reduction is done on the distance array (d) to find the minimum The result is broadcast out to all the processors

Which pattern does this fit?

Prim’s Algorithm (Parallel) Prim_MST(V, E, A, r) { ... // Initialize d as before #pragma paraguin begin_parallel while (VT != V) { Find a vertex u such that d[u] = min(d[v] for all v in (V – VT)); VT = VT + {u}; #pragma paraguin forall for v in V if (v  VT) d[v] = min(d[v], Au,v); #pragma paraguin reduce min d #pragma paraguin bcast d } #pragma paraguin end_parallel

Prim’s Algorithm (Parallel) Complexity of Parallel algorithm: Each reduction and broadcast takes log p time, but we have to do up to n of them. Communication Computation

Dijkstra’s Algorithm for Single-Source Shortest Path Given a source node, what is the shortest distance to each other node The minimum spanning tree gives is this information

Dijkstra’s Algorithm Complexity = O(n2) Dijkstra_SP(V, E, A, r) { VT = {r}; d[r] = 0; for v in (V – VT) d[v] = Ar,v; while (VT != V) { Find a vertex v such that d[u] = min(d[v] for all v in (V – VT)); VT = VT + {u}; d[v] = min(d[v], d[u] + Au,v); } V – set of vertices VT – set of vertices in the MST E – set of edges A – adjacency matrix r – root node d – minimum distance from root to any vertex Complexity = O(n2) This is the only thing different

Source Node is node b (Dijkstra’s) 3 Initialize 1 f 3 b 5 c 5 1 1 2 d e 4 Since d[3] = 1, consider node d next

Next consider node d (Dijkstra’s) 3 1 f 3 b 5 c 5 1 1 2 d e 4 Since l[0] = 1, consider node a next

Next consider node a (Dijkstra’s) 3 1 f 3 b 5 c 5 1 1 2 d e 4 Since l[2] = 3, consider node c next

Next consider node c (Dijkstra’s) 3 1 f 3 b 5 c 5 1 1 2 d e 4 Since l[4] = 4, consider node e next

Next consider node e (Dijkstra’s) 3 1 f 3 b 5 c 5 1 1 2 d e 4 Since d[5] = 4, add the edge a to f and consider node f next

Next consider node f (Dijkstra’s) 3 1 f 3 b 5 c 5 1 1 2 d e 4 VT= V so stop

Parallelizing Dijkstra’s Algorithm Since Dijkstra’s Algorithm and Prim’s Algorithm are essentially the same, we can parallelize them the same way: Complexity of Parallel algorithm: If we have n processors, this becomes: Communication Computation

All Pairs Shortest Path Dijkstra’s Algorithm gives us the shortest path from a particular node to all the others For All Paris Shortest Path, we want to find the shortest path between all pairs of vertices We can apply Dijkstra’s Algorithm to every pair of vertices Complexity = O(n3)

All Pairs using Dijkstra’s Algorithm Dijkstra_APSP(V, E, A) { for r in V { VT = {r}; d[N] = {0, … }; for all v in (V – VT) d[r][v] = Ar,v; while (VT != V) { Find a vertex u such that d[r][u] = min(d[r][v] for all v in (V – VT)); VT = VT + {u}; for v in (V – VT) d[r][v] = min(d[v], d[u] + Au,v); } V – set of vertices VT – set of vertices in the MST E – set of edges A – adjacency matrix r – root node d – minimum distance from root to any vertex Complexity = O(n3)

All Pairs Shortest Path We can parallelize the outermost loop Each processors assumes a different node vi and computes the shortest path to all nodes No communication if needed Complexity is O(n3/p) If we have n processors, complexity is O(n2) If we have n2 processors, we can use n processors for each vertex. Complexity becomes O(nlogn)

Floyd’s Algorithm for All Pairs Shortest Path Floyd’s Algorithm works off of this observation: Consider a subset of V: Let be the weight of the shortest path from vi to vj that includes one of the vertices in If vk is not in the shortest path from vi to vj, then Otherwise, the shortest path is

Floyd’s Algorithm for All Pairs Shortest Path This leads to the following recurrence: We can implement this using iteration and not recursion

k = 0 (Floyds’s) d0 is just the distance matrix A a 3 1 f 3 b 5 c 5 1 2 d e 4 d0 is just the distance matrix A

k = 1 (consider node a) (Floyds’s) 3 i 1 f 3 b 5 c 5 1 1 2 d e 4 b to c and b to f is shorter by going through a

k = 1 (consider node a) (Floyds’s) 3 1 f 3 i b 5 c 5 1 1 2 d e 4 c to b and c to f is shorter by going through a

k = 1 (consider node a) (Floyds’s) 3 1 f 3 b 5 i c 5 1 i 1 2 d e 4 Neither d nor e can get to a, so move on

k = 1 (consider node a) (Floyds’s) 3 1 f 3 b 5 c 5 1 1 2 i d e 4 f to b and f to c is shorter by going through a

k = 2 (consider node b) (Floyds’s) a 3 1 f 3 b 5 c 5 1 1 2 d e 4 a to d is shorter by going through b

k = 2 (consider node b) (Floyds’s) a 3 1 f 3 b 5 i c 5 1 1 2 d e 4 a to d is shorter by going through b

k = 2 (consider node b) (Floyds’s) a 3 1 f 3 b 5 c 5 1 1 2 i d e 4 a to d is shorter by going through b

All Pairs using Floyd’s Algorithm Floyd_APSP(V, E, A) { d0i,j = Ai,j for all i,j for k = 1 to n for i = 1 to n for j = 1 to n d(k)i,j = min(d(k-1)i,j , d(k-1)i,k + d(k-1)k,j ) We don’t need n copies of the d matrix. We only need one. In fact, we can do it with only one matrix V – set of vertices E – set of edges A – adjacency matrix Complexity = O(n3)

Partitioning of the d matrix … We divide the d matrix into p blocks of size n/√p Each processor is responsible for n2/√p elements of the d matrix

Partitioning of the d matrix k column j column However, we have to send data between processors k row i row

Which pattern does this fit?

Communication Pattern …

Analysis of Floyd’s Algorithm Each processor has to send its block to all processors on the same row and column. If we use a broadcast, then the time for communication is The synchronization step requires The time to compute the new values for each step is

Analysis of Floyd’s Algorithm So the complexity for each step is: And finally, the complexity for n steps (of the k loop) is: Communication Computation

A faster version of Floyd’s Algorithm We can do a pipeline of values moving through the matrix. The reason is because once processor pi, j computes the value of it can then send it to the processors pi, j-1 , pi, j+1 , pi+1, j , and pi-1, j

Consider the movement of the value computed by processor 4 Time t t+1 t+2 t+3 t+4 1 2 3 4 5 6 7 8 Processors

Analysis of Floyd’s Algorithm with pipelining The net complexity of the algorithm using pipelining is: Communication Computation

Row Partitioning of the d matrix … We divide the d matrix into p rows instead of blocks Each processor is responsible for n2/p elements of the d matrix

Partitioning of the d matrix k column j column Now we are only sending between rows But this still requires broadcasting or a pipeline k row i row

Communication Pattern … …

All Pairs using Floyd’s Algorithm Floyd_APSP(V, E, A) { d0i,j = Ai,j for all i,j for k = 1 to n #pragma omp parallel for private (i,j) for i = 1 to n for j = 1 to n d(k)i,j = min(d(k-1)i,j , d(k-1)i,k + d(k-1)k,j ) Given the amount of communication, this algorithm would best be done on a shared- memory system V – set of vertices E – set of edges A – adjacency matrix

Questions