Elementary Graph Algorithms Many of the slides are from Prof. Plaisted’s resources at University of North Carolina at Chapel Hill.

Slides:



Advertisements
Similar presentations
Comp 122, Spring 2004 Graph Algorithms – 2. graphs Lin / Devi Comp 122, Fall 2004 Identification of Edges Edge type for edge (u, v) can be identified.
Advertisements

Comp 122, Fall 2004 Elementary Graph Algorithms. graphs Lin / Devi Comp 122, Fall 2004 Graphs  Graph G = (V, E) »V = set of vertices »E = set of.
Tirgul 7 Review of graphs Graph algorithms: –DFS –Properties of DFS –Topological sort.
Introduction to Algorithms Second Edition by Cormen, Leiserson, Rivest & Stein Chapter 22.
Elementary Graph Algorithms Depth-first search.Topological Sort. Strongly connected components. Chapter 22 CLRS.
Lecture 16: DFS, DAG, and Strongly Connected Components Shang-Hua Teng.
Graph Traversals. For solving most problems on graphs –Need to systematically visit all the vertices and edges of a graph Two major traversals –Breadth-First.
ALGORITHMS THIRD YEAR BANHA UNIVERSITY FACULTY OF COMPUTERS AND INFORMATIC Lecture eight Dr. Hamdy M. Mousa.
More Graphs COL 106 Slides from Naveen. Some Terminology for Graph Search A vertex is white if it is undiscovered A vertex is gray if it has been discovered.
David Luebke 1 5/9/2015 CS 332: Algorithms Graph Algorithms.
Graphs Breadth First Search & Depth First Search by Shailendra Upadhye.
1 Graph Programming Gordon College. 2 Graph Basics A graph G = (V, E) –V = set of vertices, E = set of edges –Dense graph: |E|  |V| 2 ; Sparse graph:
Graph traversals / cutler1 Graph traversals Breadth first search Depth first search.
Tirgul 11 DFS Properties of DFS Topological sort.
CS 473Lecture 151 CS473-Algorithms I Lecture 15 Graph Searching: Depth-First Search and Topological Sort.
CPSC 311, Fall CPSC 311 Analysis of Algorithms Graph Algorithms Prof. Jennifer Welch Fall 2009.
UMass Lowell Computer Science Analysis of Algorithms Prof. Karen Daniels Spring, 2001 Makeup Lecture Chapter 23: Graph Algorithms Depth-First SearchBreadth-First.
1 Data Structures DFS, Topological Sort Dana Shapira.
CSE 780 Algorithms Advanced Algorithms Graph Alg. DFS Topological sort.
Tirgul 11 BFS,DFS review Properties Use. Breadth-First-Search(BFS) The BFS algorithm executes a breadth search over the graph. The search starts at a.
Lecture 10 Graph Algorithms. Definitions Graph is a set of vertices V, with edges connecting some of the vertices (edge set E). An edge can connect two.
Lecture 15: Depth First Search Shang-Hua Teng. Graphs G= (V,E) B E C F D A B E C F D A Directed Graph (digraph) –Degree: in/out Undirected Graph –Adjacency.
Data Structures, Spring 2006 © L. Joskowicz 1 Data Structures – LECTURE 14 Strongly connected components Definition and motivation Algorithm Chapter 22.5.
Tirgul 7 Review of graphs Graph algorithms: – BFS (next tirgul) – DFS – Properties of DFS – Topological sort.
Graph Algorithms Introduction to Algorithms Graph Algorithms CSE 680 Prof. Roger Crawfis Partially from io.uwinnipeg.ca/~ychen2.
Review of Graphs A graph is composed of edges E and vertices V that link the nodes together. A graph G is often denoted G=(V,E) where V is the set of vertices.
November 6, Algorithms and Data Structures Lecture XI Simonas Šaltenis Aalborg University
COSC 3101A - Design and Analysis of Algorithms 10
Elementary Graph Algorithms CSc 4520/6520 Fall 2013 Slides adapted from David Luebke, University of Virginia and David Plaisted, University of North Carolina.
Spring 2015 Lecture 10: Elementary Graph Algorithms
Sept Elementary Graph Algorithms Graph representation Graph traversal -Breadth-first search -Depth-first search Parenthesis theorem.
1 Depth-First Search Idea: –Starting at a node, follow a path all the way until you cannot move any further –Then backtrack and try another branch –Do.
Elementary Graph Algorithms CLRS Chapter 22. Graph A graph is a structure that consists of a set of vertices and a set of edges between pairs of vertices.
CSC 413/513: Intro to Algorithms Graph Algorithms DFS.
Lecture 11 Algorithm Analysis Arne Kutzner Hanyang University / Seoul Korea.
Jan Topological Order and SCC Edge classification Topological order Recognition of strongly connected components.
1 Chapter 22 Elementary Graph Algorithms. 2 Introduction G=(V, E) –V = vertex set –E = edge set Graph representation –Adjacency list –Adjacency matrix.
 2004 SDU Lectrue4-Properties of DFS Properties of DFS Classification of edges Topological sort.
Graph Algorithms Searching. Review: Graphs ● A graph G = (V, E) ■ V = set of vertices, E = set of edges ■ Dense graph: |E|  |V| 2 ; Sparse graph: |E|
Chapter 22: Elementary Graph Algorithms
Analysis of Algorithms CS 477/677 Instructor: Monica Nicolescu Lecture 20.
David Luebke 1 1/6/2016 CS 332: Algorithms Graph Algorithms.
Graph. Graphs G = (V,E) V is the vertex set. Vertices are also called nodes and points. E is the edge set. Each edge connects two different vertices.
CS 2133: Algorithms Intro to Graph Algorithms (Slides created by David Luebke)
David Luebke 1 1/25/2016 CSE 207: Algorithms Graph Algorithms.
Shahed University Dr. Shahriar Bijani May  A path is a sequence of vertices P = (v 0, v 1, …, v k ) such that, for 1 ≤ i ≤ k, edge (v i – 1, v.
ALGORITHMS THIRD YEAR BANHA UNIVERSITY FACULTY OF COMPUTERS AND INFORMATIC Lecture nine Dr. Hamdy M. Mousa.
November 19, Algorithms and Data Structures Lecture XI Simonas Šaltenis Nykredit Center for Database Research Aalborg University
G RAPH A LGORITHMS Dr. Tanzima Hashem Assistant Professor CSE, BUET.
G RAPH A LGORITHMS Dr. Tanzima Hashem Assistant Professor CSE, BUET.
CSC317 1 At the same time: Breadth-first search tree: If node v is discovered after u then edge uv is added to the tree. We say that u is a predecessor.
Chapter 22: Elementary Graph Algorithms Overview: Definition of a graph Representation of graphs adjacency list matrix Elementary search algorithms breadth-first.
Graph Algorithms – 2. graphs Parenthesis Theorem Theorem 22.7 For all u, v, exactly one of the following holds: 1. d[u] < f [u] < d[v] < f [v] or.
CS138A Elementary Graph Algorithms Peter Schröder.
Introduction to Algorithms
Elementary Graph Algorithms
Chapter 22 Elementary Graph Algorithms
CS200: Algorithm Analysis
Elementary Graph Algorithms
Graph Algorithms – 2 DAGs Topological order
Intro to Graph Algorithms (Slides originally created by David Luebke)
Graph Representation Adjacency list representation of G = (V, E)
Lecture 10 Algorithm Analysis
Finding Shortest Paths
Graph Algorithms – 2.
Advanced Algorithms Analysis and Design
Advanced Algorithms Analysis and Design
Basic Graph Algorithms
Elementary Graph Algorithms
CSC 325: Algorithms Graph Algorithms David Luebke /24/2019.
Presentation transcript:

Elementary Graph Algorithms Many of the slides are from Prof. Plaisted’s resources at University of North Carolina at Chapel Hill

Graphs  Graph G = (V, E) »V = set of vertices »E = set of edges  (V  V)  Types of graphs »Undirected: edge (u, v) = (v, u); for all v, (v, v)  E (No self loops.) »Directed: (u, v) is edge from u to v, denoted as u  v. Self loops are allowed. »Weighted: each edge has an associated weight, given by a weight function w : E  R. »Dense: |E|  |V| 2. »Sparse: |E| << |V| 2.  |E| = O(|V| 2 )

Graphs  If (u, v)  E, then vertex v is adjacent to vertex u.  Adjacency relationship is: »Symmetric if G is undirected. »Not necessarily so if G is directed.  If G is connected: »There is a path between every pair of vertices. »|E|  |V| – 1. »Furthermore, if |E| = |V| – 1, then G is a tree.  Other definitions in Appendix B (B.4 and B.5) as needed.

Representation of Graphs  Two standard ways. »Adjacency Lists. »Adjacency Matrix. a dc b a b c d b a d dc c ab ac a dc b

Adjacency Lists  Consists of an array Adj of |V| lists.  One list per vertex.  For u  V, Adj[u] consists of all vertices adjacent to u. a dc b a b c d b c d dc a dc b a b c d b a d dc c ab ac If weighted, store weights also in adjacency lists.

Storage Requirement  For directed graphs: »Sum of lengths of all adj. lists is  out-degree(v) = |E| v  V »Total storage:  (V+E)  For undirected graphs: »Sum of lengths of all adj. lists is  degree(v) = 2|E| v  V »Total storage:  (V+E) No. of edges leaving v No. of edges incident on v. Edge (u,v) is incident on vertices u and v.

Pros and Cons: adj list  Pros »Space-efficient, when a graph is sparse. »Can be modified to support many graph variants.  Cons »Determining if an edge (u,v)  G is not efficient. Have to search in u’s adjacency list.  (degree(u)) time.  (V) in the worst case.

Adjacency Matrix  |V|  |V| matrix A.  Number vertices from 1 to |V| in some arbitrary manner.  A is then given by: a dc b a dc b A = A T for undirected graphs.

Space and Time  Space:  ( V 2 ). »Not memory efficient for large graphs.  Time: to list all vertices adjacent to u:  ( V ).  Time: to determine if ( u, v)  E:  ( 1 ).  Can store weights instead of bits for weighted graph.

Graph-searching Algorithms  Searching a graph: »Systematically follow the edges of a graph to visit the vertices of the graph.  Used to discover the structure of a graph.  Standard graph-searching algorithms. »Breadth-first Search (BFS). »Depth-first Search (DFS).

Breadth-first Search  Input: Graph G = ( V, E ), either directed or undirected, and source vertex s  V.  Output: »d[v] = distance (smallest # of edges, or shortest path) from s to v, for all v  V. d[v] =  if v is not reachable from s. »  [v] = u such that ( u, v ) is last edge on shortest path s v. u is v’s predecessor. »Builds breadth-first tree with root s that contains all reachable vertices. Definitions: Path between vertices u and v: Sequence of vertices (v 1, v 2, …, v k ) such that u=v 1 and v =v k, and (v i,v i+1 )  E, for all 1  i  k-1. Length of the path: Number of edges in the path. Path is simple if no vertex is repeated.

Breadth-first Search  Expands the frontier between discovered and undiscovered vertices uniformly across the breadth of the frontier. »A vertex is “discovered” the first time it is encountered during the search. »A vertex is “finished” if all vertices adjacent to it have been discovered.  Colors the vertices to keep track of progress. »White – Undiscovered. »Gray – Discovered but not finished. »Black – Finished. Colors are required only to reason about the algorithm. Can be implemented without colors.

BFS for Shortest Paths Finished Discovered Undiscovered S S S

BFS(G,s) 1.for each vertex u in V[G] – {s} 2do color[u]  white 3 d[u]   4  [u]  nil 5color[s]  gray 6d[s]  0 7  [s]  nil 8Q   9enqueue(Q,s) 10while Q   11do u  dequeue(Q) 12for each v in Adj[u] 13do if color[v] = white 14then color[v]  gray 15 d[v]  d[u]  [v]  u 17 enqueue(Q,v) 18color[u]  black BFS(G,s) 1.for each vertex u in V[G] – {s} 2do color[u]  white 3 d[u]   4  [u]  nil 5color[s]  gray 6d[s]  0 7  [s]  nil 8Q   9enqueue(Q,s) 10while Q   11do u  dequeue(Q) 12for each v in Adj[u] 13do if color[v] = white 14then color[v]  gray 15 d[v]  d[u]  [v]  u 17 enqueue(Q,v) 18color[u]  black white: undiscovered gray: discovered black: finished Q: a queue of discovered vertices color[v]: color of v d[v]: distance from s to v  [u]: predecessor of v

Example (BFS)  0       r s t u v w x y Q: s 0 (Courtesy of Prof. Jim Anderson)

Example (BFS)      r s t u v w x y Q: w r 1 1

Example (BFS)  2   r s t u v w x y Q: r t x 1 2 2

Example (BFS)  2  2 r s t u v w x y Q: t x v 2 2 2

Example (BFS)  r s t u v w x y Q: x v u 2 2 3

Example (BFS) r s t u v w x y Q: v u y 2 3 3

Example (BFS) r s t u v w x y Q: u y 3 3

Example (BFS) r s t u v w x y Q: y 3

Example (BFS) r s t u v w x y Q: 

Example (BFS) r s t u v w x y BF Tree

Analysis of BFS  Initialization takes O(V).  Traversal Loop »After initialization, each vertex is enqueued and dequeued at most once, and each operation takes O(1). So, total time for queuing is O(V). »The adjacency list of each vertex is scanned at most once. The sum of lengths of all adjacency lists is  (E).  Summing up over all vertices => total running time of BFS is O(V+E), linear in the size of the adjacency list representation of graph.  Correctness Proof »We omit for BFS and DFS. »Will do for later algorithms.

Breadth-first Tree  For a graph G = (V, E) with source s, the predecessor subgraph of G is G  = (V , E  ) where » V  ={v  V :  [v]  NIL }  {s} » E  ={(  [v],v)  E : v  V  - {s}}  The predecessor subgraph G  is a breadth-first tree if: » V  consists of the vertices reachable from s and » for all v  V , there is a unique simple path from s to v in G  that is also a shortest path from s to v in G.  The edges in E  are called tree edges. |E  | = |V  | - 1.

Depth-first Search (DFS)  Explore edges out of the most recently discovered vertex v.  When all edges of v have been explored, backtrack to explore other edges leaving the vertex from which v was discovered (its predecessor).  “Search as deep as possible first.”  Continue until all vertices reachable from the original source are discovered.  If any undiscovered vertices remain, then one of them is chosen as a new source and search is repeated from that source.

Depth-first Search  Input: G = ( V, E ), directed or undirected. No source vertex given!  Output: » 2 timestamps on each vertex. Integers between 1 and 2|V|. d[v] = discovery time (v turns from white to gray) f [v] = finishing time (v turns from gray to black) »  [v] : predecessor of v = u, such that v was discovered during the scan of u’s adjacency list.  Uses the same coloring scheme for vertices as BFS.

Pseudo-code DFS(G) 1. for each vertex u  V[G] 2. do color[u]  white 3.  [u]  NIL 4. time  0 5. for each vertex u  V[G] 6. do if color[u] = white 7. then DFS-Visit(u) DFS(G) 1. for each vertex u  V[G] 2. do color[u]  white 3.  [u]  NIL 4. time  0 5. for each vertex u  V[G] 6. do if color[u] = white 7. then DFS-Visit(u) Uses a global timestamp time. DFS-Visit(u) 1.color[u]  GRAY  White vertex u has been discovered 2.time  time d[u]  time 4. for each v  Adj[u] 5. do if color[v] = WHITE 6. then  [v]  u 7. DFS-Visit(v) 8. color[u]  BLACK  Blacken u; it is finished. 9. f[u]  time  time + 1 DFS-Visit(u) 1.color[u]  GRAY  White vertex u has been discovered 2.time  time d[u]  time 4. for each v  Adj[u] 5. do if color[v] = WHITE 6. then  [v]  u 7. DFS-Visit(v) 8. color[u]  BLACK  Blacken u; it is finished. 9. f[u]  time  time + 1

Example (DFS) 1/ u v w x y z (Courtesy of Prof. Jim Anderson)

Example (DFS) 1/ 2/ u v w x y z

Example (DFS) 1/ 3/ 2/ u v w x y z

Example (DFS) 1/ 4/ 3/ 2/ u v w x y z

Example (DFS) 1/ 4/ 3/ 2/ u v w x y z B

Example (DFS) 1/ 4/5 3/ 2/ u v w x y z B

Example (DFS) 1/ 4/5 3/6 2/ u v w x y z B

Example (DFS) 1/ 4/5 3/6 2/7 u v w x y z B

Example (DFS) 1/ 4/5 3/6 2/7 u v w x y z B F

Example (DFS) 1/8 4/5 3/6 2/7 u v w x y z B F

Example (DFS) 1/8 4/5 3/6 2/7 9/ u v w x y z B F

Example (DFS) 1/8 4/5 3/6 2/7 9/ u v w x y z B F C

Example (DFS) 1/8 4/5 3/6 10/ 2/7 9/ u v w x y z B F C

Example (DFS) 1/8 4/5 3/6 10/ 2/7 9/ u v w x y z B F C B

Example (DFS) 1/8 4/5 3/6 10/11 2/7 9/ u v w x y z B F C B

Example (DFS) 1/8 4/5 3/6 10/11 2/7 9/12 u v w x y z B F C B

Analysis of DFS  Loops on lines 1-2 & 5-7 take  (V) time, excluding time to execute DFS-Visit.  DFS-Visit is called once for each white vertex v  V when it’s painted gray the first time. Lines 3-6 of DFS- Visit is executed |Adj[v]| times. The total cost of executing DFS-Visit is  v  V |Adj[v]| =  (E)  Total running time of DFS is  (V+E).

Parenthesis Theorem Theorem 22.7 For all u, v, exactly one of the following holds: 1. d[u] < f [u] < d[v] < f [v] or d[v] < f [v] < d[u] < f [u] and neither u nor v is a descendant of the other. 2. d[u] < d[v] < f [v] < f [u] and v is a descendant of u. 3. d[v] < d[u] < f [u] < f [v] and u is a descendant of v. Theorem 22.7 For all u, v, exactly one of the following holds: 1. d[u] < f [u] < d[v] < f [v] or d[v] < f [v] < d[u] < f [u] and neither u nor v is a descendant of the other. 2. d[u] < d[v] < f [v] < f [u] and v is a descendant of u. 3. d[v] < d[u] < f [u] < f [v] and u is a descendant of v.  So d[u] < d[ v ] < f [u] < f [ v ] cannot happen.  Like parentheses:  OK: ( ) [ ] ( [ ] ) [ ( ) ]  Not OK: ( [ ) ] [ ( ] ) Corollary v is a proper descendant of u if and only if d[u] < d[ v ] < f [ v ] < f [u].

Example (Parenthesis Theorem) 3/6 4/5 7/8 12/13 2/9 1/10 y z s x w v BF 14/15 11/16 u t C C C C B (s (z (y (x x) y) (w w) z) s) (t (v v) (u u) t)

Depth-First Trees  Predecessor subgraph defined slightly different from that of BFS.  The predecessor subgraph of DFS is G  = (V, E  ) where E  ={(  [v],v) : v  V and  [v]  NIL}. »How does it differ from that of BFS? » The predecessor subgraph G  forms a depth-first forest composed of several depth-first trees. The edges in E  are called tree edges. Definition: Forest: An acyclic graph G that may be disconnected.

White-path Theorem Theorem 22.9 v is a descendant of u if and only if at time d[u], there is a path u v consisting of only white vertices. (Except for u, which was just colored gray.) Theorem 22.9 v is a descendant of u if and only if at time d[u], there is a path u v consisting of only white vertices. (Except for u, which was just colored gray.)

DFS: Kinds of edges  DFS introduces an important distinction among edges in the original graph: »Tree edge: encounter new (white) vertex »Back edge: from descendent to ancestor »Forward edge: from ancestor to descendent »Cross edge: between a tree or subtrees  Note: tree & back edges are important; most algorithms don’t distinguish forward & cross

Classification of Edges  Tree edge: in the depth-first forest. Found by exploring ( u, v ).  Back edge: ( u, v ), where u is a descendant of v ( in the depth-first tree ). (include self-loop)  Forward edge: ( u, v ), where v is a descendant of u, but not a tree edge.  Cross edge: any other edge. Can go between vertices in same depth-first tree or in different depth-first trees. Theorem: In DFS of an undirected graph, we get only tree and back edges. No forward or cross edges. Theorem: In DFS of an undirected graph, we get only tree and back edges. No forward or cross edges.

Identification of Edges  Edge type for edge (u, v) can be identified when it is first explored by DFS.  Identification is based on the color of v. »White – tree edge. »Gray – back edge. »Black – forward or cross edge.

Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

DFS: Kinds Of Edges  If G is undirected, a DFS produces only tree and back edges  Proof by contradiction: »Assume there’s a forward edge But F? edge must actually be a back edge (why?) source F?

DFS: Kinds Of Edges  If G is undirected, a DFS produces only tree and back edges  Proof by contradiction: »Assume there’s a cross edge But C? edge cannot be cross: must be explored from one of the vertices it connects, becoming a tree vertex, before other vertex is explored So in fact the picture is wrong…both lower tree edges cannot in fact be tree edges source C?

Directed Acyclic Graph  DAG – Directed graph with no cycles.  Good for modeling processes and structures that have a partial order: »a > b and b > c  a > c. »But may have a and b such that neither a > b nor b > a.  Can always make a total order (either a > b or b > a for all a  b) from a partial order.

Example DAG of dependencies for putting on goalie equipment. socksshorts hose pants skates leg pads T-shirt chest pad sweater mask catch glove blocker batting glove

Characterizing a DAG Proof:   : Show that back edge  cycle. »Suppose there is a back edge ( u, v). Then v is ancestor of u in depth-first forest. »Therefore, there is a path v u, so v u v is a cycle. Lemma A directed graph G is acyclic iff a DFS of G yields no back edges. Lemma A directed graph G is acyclic iff a DFS of G yields no back edges. vu TTT B

Characterizing a DAG Proof (Contd.):   : Show that a cycle implies a back edge. »c : cycle in G, v : first vertex discovered in c, ( u, v) : v’s preceding edge in c. »At time d[v], vertices of c form a white path v u. Why? »By white-path theorem, u is a descendent of v in depth-first forest. »Therefore, ( u, v) is a back edge. Lemma A directed graph G is acyclic iff a DFS of G yields no back edges. Lemma A directed graph G is acyclic iff a DFS of G yields no back edges. vu TTT B

Topological Sort Want to “sort” a directed acyclic graph (DAG). B E D C A C E D A B Think of original DAG as a partial order. Want a total order that extends this partial order.

Topological Sort  Performed on a DAG.  Linear ordering of the vertices of G such that if (u, v)  E, then u appears somewhere before v. Topological-Sort ( G ) 1.call DFS ( G ) to compute finishing times f [ v ] for all v  V 2.as each vertex is finished, insert it onto the front of a linked list 3.return the linked list of vertices Topological-Sort ( G ) 1.call DFS ( G ) to compute finishing times f [ v ] for all v  V 2.as each vertex is finished, insert it onto the front of a linked list 3.return the linked list of vertices Time:  ( V + E ).

Example Linked List: A B D C E 1/ (Courtesy of Prof. Jim Anderson)

Example Linked List: A B D C E 1/ 2/

Example Linked List: A B D C E 1/ 2/3 E

Example Linked List: A B D C E 1/4 2/3 E 1/4 D

Example Linked List: A B D C E 1/4 2/3 E 1/4 D 5/

Example Linked List: A B D C E 1/4 2/3 E 1/4 D 5/ 6/

Example Linked List: A B D C E 1/4 2/3 E 1/4 D 5/ 6/7 C

Example Linked List: A B D C E 1/4 2/3 E 1/4 D 5/8 6/7 C 5/8 B

Example Linked List: A B D C E 1/4 2/3 E 1/4 D 5/8 6/7 C 5/8 B 9/

Example Linked List: A B D C E 1/4 2/3 E 1/4 D 5/8 6/7 C 5/8 B 9/10 A

Correctness Proof  Just need to show if ( u, v)  E, then f [ v ] < f [u].  When we explore ( u, v), what are the colors of u and v ? »u is gray. »Is v gray, too? No, because then v would be ancestor of u.  ( u, v) is a back edge.  contradiction of Lemma (dag has no back edges). »Is v white? Then becomes descendant of u. By parenthesis theorem, d[u] < d[ v ] < f [ v ] < f [u]. »Is v black? Then v is already finished. Since we’re exploring ( u, v), we have not yet finished u. Therefore, f [ v ] < f [u].

Strongly Connected Components  G is strongly connected if every pair (u, v) of vertices in G is reachable from one another.  A strongly connected component (SCC) of G is a maximal set of vertices C  V such that for all u, v  C, both u v and v u exist.

Component Graph  G SCC = ( V SCC, E SCC ).  V SCC has one vertex for each SCC in G.  E SCC has an edge if there’s an edge between the corresponding SCC’s in G.  G SCC for the example considered:

G SCC is a DAG Proof:  Suppose there is a path v v in G.  Then there are paths u u v and v v u in G.  Therefore, u and v are reachable from each other, so they are not in separate SCC’s. Lemma Let C and C be distinct SCC’s in G, let u, v  C, u, v  C, and suppose there is a path u u in G. Then there cannot also be a path v v in G. Lemma Let C and C be distinct SCC’s in G, let u, v  C, u, v  C, and suppose there is a path u u in G. Then there cannot also be a path v v in G.

Transpose of a Directed Graph  G T = transpose of directed G. »G T = ( V, E T ), E T = {( u, v ) : ( v, u )  E }. »G T is G with all edges reversed.  Can create G T in Θ ( V + E ) time if using adjacency lists.  G and G T have the same SCC’s. (u and v are reachable from each other in G if and only if reachable from each other in G T.)

Algorithm to determine SCCs SCC ( G ) 1.call DFS ( G ) to compute finishing times f [u] for all u 2.compute G T 3.call DFS ( G T ), but in the main loop, consider vertices in order of decreasing f [u] (as computed in first DFS) 4.output the vertices in each tree of the depth-first forest formed in second DFS as a separate SCC SCC ( G ) 1.call DFS ( G ) to compute finishing times f [u] for all u 2.compute G T 3.call DFS ( G T ), but in the main loop, consider vertices in order of decreasing f [u] (as computed in first DFS) 4.output the vertices in each tree of the depth-first forest formed in second DFS as a separate SCC Time:  ( V + E ).

Example 13/14 12/15 3/4 2/7 11/16 1/10 a b c e f g 5/6 8/9 h d (Courtesy of Prof. Jim Anderson) G

Example a b c e f g h d GTGT

cd h fg abe

How does it work?  Idea: »By considering vertices in second DFS in decreasing order of finishing times from first DFS, we are visiting vertices of the component graph in topologically sorted order. »Because we are running DFS on G T, we will not be visiting any v from a u, where v and u are in different components.  Notation: »d[u] and f [u] always refer to first DFS. »Extend notation for d and f to sets of vertices U  V: »d ( U ) = min u  U { d[u] } (earliest discovery time) »f ( U ) = max u  U { f [u] } (latest finishing time)

SCCs and DFS finishing times Proof:  Case 1: d ( C ) < d ( C ) »Let x be the first vertex discovered in C. »At time d[x], all vertices in C and C are white. Thus, there exist paths of white vertices from x to all vertices in C and C. »By the white-path theorem, all vertices in C and C are descendants of x in depth- first tree. »By the parenthesis theorem, f [x] = f ( C ) > f ( C ). Lemma Let C and C be distinct SCC’s in G = ( V, E ). Suppose there is an edge ( u, v)  E such that u  C and v  C. Then f ( C ) > f ( C ). Lemma Let C and C be distinct SCC’s in G = ( V, E ). Suppose there is an edge ( u, v)  E such that u  C and v  C. Then f ( C ) > f ( C ). C C u v x

SCCs and DFS finishing times Proof:  Case 2: d ( C ) > d ( C ) »Let y be the first vertex discovered in C. »At time d[y], all vertices in C are white and there is a white path from y to each vertex in C  all vertices in C become descendants of y. Again, f [y] = f ( C ). »At time d[y], all vertices in C are also white. »By earlier lemma, since there is an edge ( u, v), we cannot have a path from C to C. »So no vertex in C is reachable from y. »Therefore, at time f [y], all vertices in C are still white. »Therefore, for all w  C, f [ w ] > f [y], which implies that f ( C ) > f ( C ). Lemma Let C and C be distinct SCC’s in G = ( V, E ). Suppose there is an edge ( u, v)  E such that u  C and v  C. Then f ( C ) > f ( C ). Lemma Let C and C be distinct SCC’s in G = ( V, E ). Suppose there is an edge ( u, v)  E such that u  C and v  C. Then f ( C ) > f ( C ). C C u v y x

SCCs and DFS finishing times Proof:  ( u, v)  E T  (v, u )  E.  Since SCC’s of G and G T are the same, f ( C ) > f ( C ), by Lemma Corollary Let C and C be distinct SCC’s in G = ( V, E ). Suppose there is an edge ( u, v)  E T, where u  C and v  C. Then f ( C ) < f ( C ). Corollary Let C and C be distinct SCC’s in G = ( V, E ). Suppose there is an edge ( u, v)  E T, where u  C and v  C. Then f ( C ) < f ( C ).

Correctness of SCC  When we do the second DFS, on G T, start with SCC C such that f ( C ) is maximum. »The second DFS starts from some x  C, and it visits all vertices in C. »Corollary says that since f ( C ) > f ( C ) for all C  C, there are no edges from C to C in G T. »Therefore, DFS will visit only vertices in C. »Which means that the depth-first tree rooted at x contains exactly the vertices of C.

Correctness of SCC  The next root chosen in the second DFS is in SCC C such that f ( C ) is maximum over all SCC’s other than C. »DFS visits all vertices in C, but the only edges out of C go to C, which we’ve already visited. »Therefore, the only tree edges will be to vertices in C.  We can continue the process.  Each time we choose a root for the second DFS, it can reach only »vertices in its SCC—get tree edges to these, »vertices in SCC’s already visited in second DFS—get no tree edges to these.