Copyright 2000, Kevin Wayne 1

Similar documents
Undirected Graphs. V = { 1, 2, 3, 4, 5, 6, 7, 8 } E = { 1-2, 1-3, 2-3, 2-4, 2-5, 3-5, 3-7, 3-8, 4-5, 5-6 } n = 8 m = 11

Chapter 3. Graphs. Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

3.1 Basic Definitions and Applications

3.1 Basic Definitions and Applications. Chapter 3. Graphs. Undirected Graphs. Some Graph Applications

CSE 417: Algorithms and Computational Complexity. 3.1 Basic Definitions and Applications. Goals. Chapter 3. Winter 2012 Graphs and Graph Algorithms

Chapter 3. Graphs CLRS Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

Copyright 2000, Kevin Wayne 1

CS781 Lecture 2 January 13, Graph Traversals, Search, and Ordering

12/5/17. trees. CS 220: Discrete Structures and their Applications. Trees Chapter 11 in zybooks. rooted trees. rooted trees

3.1 Basic Definitions and Applications

Algorithm Design and Analysis

Algorithms: Lecture 10. Chalmers University of Technology

CS 220: Discrete Structures and their Applications. graphs zybooks chapter 10

Chapter 3. Graphs CLRS Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

3. GRAPHS. Lecture slides by Kevin Wayne Copyright 2005 Pearson-Addison Wesley Copyright 2013 Kevin Wayne. Last updated on Sep 8, :19 AM

Graph Algorithms. Imran Rashid. Jan 16, University of Washington

3.4 Testing Bipartiteness

Goals! CSE 417: Algorithms and Computational Complexity!

Lecture 3: Graphs and flows

CSE 421 DFS / Topological Ordering

Algorithm Design and Analysis

Plan. CMPSCI 311: Introduction to Algorithms. Recall. Adjacency List Representation. DFS Descriptions. BFS Description

Graph Traversals. CS200 - Graphs 1

CS 310 Advanced Data Structures and Algorithms

Graph: representation and traversal

Graphs. Part I: Basic algorithms. Laura Toma Algorithms (csci2200), Bowdoin College

Graph Algorithms. Definition

Graph Algorithms Using Depth First Search

Scribes: Romil Verma, Juliana Cook (2015), Virginia Williams, Date: May 1, 2017 and Seth Hildick-Smith (2016), G. Valiant (2017), M.

Outline. Graphs. Divide and Conquer.

W4231: Analysis of Algorithms

CS 491 CAP Introduction to Graphs and Search

Objec&ves. Wrap up: Implemen&ng BFS and DFS Graph Applica&on: Bipar&te Graphs. Get out your BFS implementa&on handouts. Feb 2, 2018 CSCI211 - Sprenkle

Graph. Vertex. edge. Directed Graph. Undirected Graph

Graph Representations and Traversal

implementing the breadth-first search algorithm implementing the depth-first search algorithm

Searching in Graphs (cut points)

Graph Algorithms. Chapter 22. CPTR 430 Algorithms Graph Algorithms 1

Lecture 2 - Graph Theory Fundamentals - Reachability and Exploration 1

Graph Representation

Graphs. A graph is a data structure consisting of nodes (or vertices) and edges. An edge is a connection between two nodes

Strongly connected: A directed graph is strongly connected if every pair of vertices are reachable from each other.

Trees. Arash Rafiey. 20 October, 2015

Depth-First Search Depth-first search (DFS) is another way to traverse the graph.

Figure 1: A directed graph.

COL351: Analysis and Design of Algorithms (CSE, IITD, Semester-I ) Name: Entry number:

MA/CSSE 473 Day 12. Questions? Insertion sort analysis Depth first Search Breadth first Search. (Introduce permutation and subset generation)

CS4800: Algorithms & Data Jonathan Ullman

Algorithm Design and Analysis

LECTURE NOTES OF ALGORITHMS: DESIGN TECHNIQUES AND ANALYSIS

Graph Algorithms. Andreas Klappenecker. [based on slides by Prof. Welch]

CS 125 Section #6 Graph Traversal and Linear Programs 10/13/14

CSE 421 Applications of DFS(?) Topological sort

Announcements. HW3 is graded. Average is 81%

Applications of BDF and DFS

Depth First Search (DFS)

Fundamental Graph Algorithms Part Three

Solutions to relevant spring 2000 exam problems

CSE 331: Introduction to Algorithm Analysis and Design Graphs

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI. Department of Computer Science and Engineering CS6301 PROGRAMMING DATA STRUCTURES II

CS Lunch. Grace Hopper??? O()? Slides06 BFS, DFS, Bipartite.key - October 3, 2016

Elementary Graph Algorithms. Ref: Chapter 22 of the text by Cormen et al. Representing a graph:

CS200: Graphs. Prichard Ch. 14 Rosen Ch. 10. CS200 - Graphs 1

Adjacent: Two distinct vertices u, v are adjacent if there is an edge with ends u, v. In this case we let uv denote such an edge.

Computer Science & Engineering 423/823 Design and Analysis of Algorithms

Algorithms: Graphs. Amotz Bar-Noy. Spring 2012 CUNY. Amotz Bar-Noy (CUNY) Graphs Spring / 95

Fundamental Graph Algorithms Part Four

Lecture 26: Graphs: Traversal (Part 1)

Lecture 22 Tuesday, April 10

Lecture 9 Graph Traversal

CSI 604 Elementary Graph Algorithms

BACKGROUND: A BRIEF INTRODUCTION TO GRAPH THEORY

Chapter 3. Graphs. 3.1 Basic Definitions and Applications

Chapter 5. Decrease-and-Conquer. Copyright 2007 Pearson Addison-Wesley. All rights reserved.

CSE101: Design and Analysis of Algorithms. Ragesh Jaiswal, CSE, UCSD

Lecture 10. Elementary Graph Algorithm Minimum Spanning Trees

Lecture 10 Graph algorithms: testing graph properties

CSE 417: Algorithms and Computational Complexity

CS521 \ Notes for the Final Exam

Algorithm Design and Analysis

CS 580: Algorithm Design and Analysis. Jeremiah Blocki Purdue University Spring 2019

Introduction to Graphs. CS2110, Spring 2011 Cornell University

Algorithms. Graphs. Algorithms

Homework 2. Sample Solution. Due Date: Thursday, May 31, 11:59 pm

Section Summary. Introduction to Trees Rooted Trees Trees as Models Properties of Trees

Computer Science & Engineering 423/823 Design and Analysis of Algorithms

CS 161 Lecture 11 BFS, Dijkstra s algorithm Jessica Su (some parts copied from CLRS) 1 Review

Basic Graph Algorithms (CLRS B.4-B.5, )

Strongly Connected Components

Introduction to Algorithms

GRAPHS Lecture 17 CS2110 Spring 2014

COMPSCI 311: Introduction to Algorithms First Midterm Exam, October 3, 2018

COP 4531 Complexity & Analysis of Data Structures & Algorithms

Basic Graph Definitions

DFS & STRONGLY CONNECTED COMPONENTS

(Re)Introduction to Graphs and Some Algorithms

22.3 Depth First Search

UNIT III TREES. A tree is a non-linear data structure that is used to represents hierarchical relationships between individual data items.

Definition f(n) = O(g(n)) if there exists n 0, c such that for all n n 0, f(n) cg(n). g is an asymptotic upper bound on f.

1 Connected components in undirected graphs

Transcription:

Chapter 3 - Graphs Undirected Graphs Undirected graph. G = (V, E) V = nodes. E = edges between pairs of nodes. Captures pairwise relationship between objects. Graph size parameters: n = V, m = E. Directed Graphs Directed graph. G = (V, E) Edge (u, v) goes from node u to node v. V = { 1, 2, 3, 4, 5, 6, 7, 8 } E = { (1,2), (1,3), (2,3), (2,4), (2,5), (3,5), (3,7), (3,8), (4,5), (5,6) } n = 8 m = 11 Example. Web graph - hyperlink points from one web page to another. Modern web search engines exploit hyperlink structure to rank web pages by importance. http://datamining.typepad.com/gallery/blog-map-gallery.html 2 3 Examples Social network graph World Wide web Node: person web page Edge: undirected directed, link relationship A graph of blogosphere links Google maps Transportation graph. Nodes: street addresses Edges: streets/highways http://people.oii.ox.ac.uk/hogan/ 2010/01/new-pinwheel-network-layout/ http://googlesystem.blogspot.com/2007/05/worldwide-web-as-seen-by-google.html http://datamining.typepad.com/gallery/blog-map-gallery.html 4 5 6 Copyright 2000, Kevin Wayne 1

Graph transportation Graph Applications Nodes Edges street intersections highways communication computers Network connections World Wide Web web pages hyperlinks social people relationships scheduling tasks precedence Graph Representation: Adjacency Matrix Adjacency matrix. n-by-n matrix with A uv = 1 if (u, v) is an edge. For undirected graphs, each edge is represented twice. Space proportional to n 2. Checking if (u, v) is an edge takes Θ(1) time. Identifying all outgoing edges from a node takes Θ(n) time. 1 2 3 4 5 6 7 8 1 0 1 1 0 0 0 0 0 2 1 0 1 1 1 0 0 0 3 1 1 0 0 1 0 1 1 4 0 1 0 1 1 0 0 0 5 0 1 1 1 0 1 0 0 6 0 0 0 0 1 0 0 0 7 0 0 1 0 0 0 0 1 8 0 0 1 0 0 0 1 0 Graph Representation: Adjacency List Adjacency list. Node indexed array of lists. For undirected graphs, each edge represented twice. Space proportional to m + n. Checking if (u, v) is an edge takes O(deg(u)) time. Identifying all outgoing edges from a node takes O(deg(u)) time Identifying all edges takes Θ(m + n) time. 1 2 3 2 3 4 2 5 5 6 7 3 8 8 1 3 4 5 1 2 5 7 8 2 3 4 6 5 3 7 degree = number of neighbors of u 7 8 9 Which Implementation Which implementation best supports common graph operations: Is there an edge between vertex i and vertex j? Find all vertices adjacent to vertex j Which best uses space? Paths and Connectivity Def. A path in an undirected graph G = (V, E) is a sequence P of nodes v 1, v 2,, v k-1, v k with the property that each consecutive pair v i, v i+1 is joined by an edge in G. Def. An undirected graph is connected if for every pair of nodes u and v, there is a path between u and v. Cycles Def. A cycle is a path v 1, v 2,, v k-1, v k in which v 1 = v k, k > 2, and the first k-1 nodes are distinct. cycle C = 1-2-4-5-3-1 10 11 12 Copyright 2000, Kevin Wayne 2

Trees Def. An undirected graph is a tree if it is connected and does not contain a cycle. How many edges does a tree have? Trees Def. An undirected graph is a tree if it is connected and does not contain a cycle. How many edges does a tree have? Given a set of nodes, build a tree step wise every time you add an edge, you must add a new node to the growing tree. WHY? how many edges to connect n nodes? Trees Def. An undirected graph is a tree if it is connected and does not contain a cycle. Theorem. Let G be an undirected graph on n nodes. Any two of the following statements imply the third. G is connected. G does not contain a cycle. G has n-1 edges. 13 14 15 Rooted Trees Phylogenetic Trees Traversing a Rooted Binary Tree Rooted tree. Given a tree T, choose a root node r and orient each edge away from r. Models hierarchical structure. root r parent of v Phylogeny. Describes the evolutionary history of species. Pre order visit the node go left go right In order go left visit the node go right A Post order go left go right visit the node Level order / breadth first for d = 0 to height visit nodes at level d v B C child of v D E F a tree the same tree, rooted at 1 G H I 16 http://www.whozoo.org/mammals/carnivores/cat_phylogeny.htm 17 Copyright 2000, Kevin Wayne 3

Traversal Examples Traversal Implementation recursive implementation of preorder 3.2 Graph Traversal D B A E C F Pre order A B D G H C E F I In order G D H B A E C F I Post order G H D B E I F C A The steps: visit node preorder(left child) preorder(right child) What changes need to be made for in-order, postorder? How would you implement level order? G H I Level order A B C D E F G H I Connectivity s-t connectivity problem. Given two nodes s and t, is there a path between s and t? s-t shortest path problem. Given two node s and t, what is the length of the shortest path between s and t? Graph Traversal What makes it different from rooted tree traversal? Graph Traversal What makes it different from rooted tree traversal: cycles What to do about it? 22 Copyright 2000, Kevin Wayne 4

Graph Traversal Breadth First Search Breadth First Search What makes it different from rooted tree traversal: cycles What to do about it? mark the nodes BFS intuition. Explore outward from s, adding nodes one "layer" at a time. BFS algorithm. L 0 = { s }. s L 1 L 2 L n-1 L 1 = all neighbors of L 0. L 2 = all nodes that do not belong to L 0 or L 1, and that have an edge to a node in L 1. L i+1 = all nodes that do not belong to an earlier layer, and that have an edge to a node in L i. Property. Let T be a BFS tree of G, and let (x, y) be an edge of G. Then the level of x and y differ by at most 1. L 0 Theorem. For each i, L i consists of all nodes at distance exactly i from s. There is a path from s to t iff t appears in some layer. L 1 L 2 L 3 26 27 BFS - implementation bfs(v) : q queue of nodes to be processed q.enque(v) while(q is non empty) : u = q.dequeue() for (each node v adjacent to u) : if v is unexplored : q.enqueue(v) Claim: this implementation explores nodes in order of their appearance in BFS layers BFS - implementation bfs(v) : q queue of nodes to be processed q.enque(v) while(q is non empty) : u = q.dequeue() for (each node v adjacent to u) : if v is unexplored : q.enqueue(v) How would you compute distances using this implementation? Breadth First Search: Analysis bfs(v) : q queue of nodes to be processed q.enque(v) while(q is non empty) : u = q.dequeue() for (each node v adjacent to u) : if v is unexplored : q.enqueue(v) Theorem. The above implementation of BFS runs in O(m + n) time if the graph is given by its adjacency list representation. Proof: when we consider node u, there are deg(u) incident edges (u, v) total time processing edges is Σ u V deg(u) = 2m each edge (u, v) is counted exactly twice in sum: once in deg(u) and once in deg(v) 30 Copyright 2000, Kevin Wayne 5

Connected Components Connected graph. There is a path between any pair of nodes. Connected component of a node s. The set of all nodes reachable from s. Connected Components Connected component of a node s. The set of all nodes reachable from s. Given two nodes s, and t, what can you say about their connected components? Connected Components Connected component of a node s. The set of all nodes reachable from s. Given two nodes s, and t, their connected components are either identical or disjoint. Connected component containing node 1 = { 1, 2, 3, 4, 5, 6, 7, 8 }. 31 32 33 The facebook graph u 721 million active accounts u 68.7 billion friendship edges (median number of friends = 99) u The largest connected component of facebook users contains 99.9% of the users u Average distance between any pair of users: 4.7 Connected Components A generic algorithm for finding connected components: R = {s} # the connected component of s is initially s. while there is an edge (u,v) where u is in R and v is not in R: add v to R s R u v DFS: Depth First Search Explores edges from the most recently discovered node; backtracks when reaching a dead-end. source: http://arxiv.org/pdf/1111.4503v1.pdf 34 Upon termination, R is the connected component containing s. BFS: explore in order of distance from s. DFS: explores edges from the most recently discovered node; backtracks when reaching a dead-end. 35 Copyright 2000, Kevin Wayne 6

DFS: Depth First Search DFS - nonrecursively DFS - Analysis Explores edges from the most recently discovered node; backtracks when reaching a dead-end. Recursively: DFS(u): mark u as Explored and add u to R for each edge (u,v) : if v is not marked Explored : DFS(v) DFS(u): mark u as Explored and add u to R for each edge (u,v) : if v is not marked Explored : DFS(v) DFS(v) : s stack of nodes to be processed mark v as Explored s.push(v) while(s is non empty) : u = s.pop() for (each node v adjacent to u) : if v is not Explored : mark v as Explored s.push(v) 38 DFS(v) : s stack of nodes to be processed s.push(v) mark v as Explored while(s is non empty) : u = s.pop() for (each node v adjacent to u) : if v is not Explored : mark v as Explored s.push(v) Theorem. The above implementation of DFS runs in O(m + n) time if the graph is given by its adjacency list representation. Proof: Same as in BFS 39 Bipartite Graphs Testing Bipartiteness 3.4 Testing Bipartiteness Def. An undirected graph G = (V, E) is bipartite if the nodes can be colored red or blue such that every edge has one red end and one blue end. Applications. Scheduling: machines = red, jobs = blue. Testing bipartiteness. Given a graph G, is it bipartite? Many graph problems become tractable if the underlying graph is bipartite (independent set) A graph is bipartite if it is 2-colorable v 2 v 3 v 2 v 1 v 4 v 6 v 5 v 4 v 3 v 5 v 6 v 7 v 1 v 7 a bipartite graph a bipartite graph G another drawing of G 41 42 Copyright 2000, Kevin Wayne 7

q q q q q Algorithm for testing if a graph is bipartite Pick a node s and color it blue Its neighbors must be colored red. Their neighbors must be colored blue. Proceed until the graph is colored. Check that there is no edge whose ends are the same color. An Obstacle to Bipartiteness Which of these graphs is 2-colorable? An Obstacle to Bipartiteness Lemma. If a graph G is bipartite, it cannot contain an odd cycle. Proof. Not possible to 2-color the odd cycle, let alone G. bipartite (2-colorable) not bipartite (not 2-colorable) L 1 L 2 L 3 43 44 45 Bipartite Graphs Lemma. Let G be a connected graph, and let L 0,, L k be the layers produced by BFS starting at node s. Exactly one of the following holds. (i) No edge of G joins two nodes of the same layer. G is bipartite. (ii) An edge of G joins two nodes of the same layer. G contains an odd-length cycle (and hence is not bipartite). Bipartite Graphs Lemma. Let G be a connected graph, and let L 0,, L k be the layers produced by BFS starting at node s. Exactly one of the following holds. (i) No edge of G joins two nodes of the same layer. G is bipartite. (ii) An edge of G joins two nodes of the same layer. G contains an odd-length cycle (and hence is not bipartite). Proof. (i) Suppose no edge joins two nodes in the same layer. I.e. all edges join nodes on adjacent layers. Bipartition: red = nodes on odd levels, blue = nodes on even levels. Bipartite Graphs Lemma. Let G be a connected graph, and let L 0,, L k be the layers produced by BFS starting at node s. Exactly one of the following holds. (i) No edge of G joins two nodes of the same layer. G is bipartite. (ii) An edge of G joins two nodes of the same layer. G contains an odd-length cycle (and hence is not bipartite). Proof. (ii) Suppose (x, y) is an edge with x, y in same level L j. Let z = lca(x, y) = lowest common ancestor. Let L i be level containing z. Consider cycle that takes edge from x to y, then path from y to z, then path from z to x. Its length is 1 + (j-i) + (j-i), which is odd. z = lca(x, y) L 1 L 2 L 3 Case (i) L 1 L 2 L 3 Case (ii) L 1 L 2 L 3 Case (i) (x, y) path from y to z path from z to x 46 47 48 Copyright 2000, Kevin Wayne 8

3.5 Connectivity in Directed Graphs Directed Graphs Directed graph. G = (V, E) Edge (u, v) goes from node u to node v. Example. Web graph - hyperlink points from one web page to another. Modern web search engines exploit hyperlink structure to rank web pages by importance. Graph Search Directed reachability. Given a node s, find all nodes reachable from s. Web crawler. Start from web page s. Find all web pages linked from s, either directly or indirectly. BFS and DFS extend naturally to directed graphs. Given a path from s to t, not guaranteed there is a path from t to s. 50 51 Strong Connectivity Def. Nodes u and v are mutually reachable if there is a path from u to v and also a path from v to u. Def. A graph is strongly connected if every pair of nodes is mutually reachable. Lemma. Let s be any node. G is strongly connected iff every node is reachable from s, and s is reachable from every node. Proof. Follows from definition. Proof. Path from u to v: concatenate u-s path with s-v path. Path from v to u: concatenate v-s path with s-u path. Strong Connectivity: Algorithm Theorem. Can determine if G is strongly connected in O(m + n) time. Proof. Pick any node s. reverse orientation of every edge in G Run BFS from s in G. Run BFS from s in G rev. Return true iff all nodes reached in both BFS executions. Correctness follows immediately from previous lemma. 3.6 DAGs and Topological Ordering s u strongly connected not strongly connected v 52 53 Copyright 2000, Kevin Wayne 9

DAGs and Topological Ordering Examples: Graphs Describing Precedence prerequisites for a set of courses dependencies between programs dependencies between jobs Order of putting your clothes on Precedence constraints. Edge (v i, v j ) means task v i must occur before v j. Topological ordering: a total ordering of the nodes that respects the precedence relation Example: An ordering of CS courses Graphs describing precedence must not contain cycles. Why? Def. A Directed Acyclic Graph (DAG) is a directed graph that contains no directed cycles. Def. A topological order of a directed graph G is an ordering of its nodes as v 1, v 2,, v n so that for every edge (v i, v j ) we have i < j. v 2 v 3 v 6 v 5 v 4 v 7 v 1 a DAG v 1 v 2 v 3 v 4 v 5 v 6 v 7 a topological ordering Example Is the topological order unique? Batman images are from the book Introduction to bioinformatics algorithms 56 Lemma. If G has a topological order, then G is a DAG. Lemma. If G has a topological order, then G is a DAG. Proof. (by contradiction) Suppose that G has a topological order v 1,, v n and that G also has a directed cycle C. Let v i be the lowest-indexed node in C, and let v j be the node just before v i ; thus (v j, v i ) is an edge. By our choice of i, we have i < j. On the other hand, since (v j, v i ) is an edge and v 1,, v n is a topological order, we must have j < i, a contradiction. the directed cycle C the directed cycle C v 1 v i v j v n v 1 v i v j v n the supposed topological order: v 1,, v n the supposed topological order: v 1,, v n 59 60 Copyright 2000, Kevin Wayne 10

Lemma. If G has a topological order, then G is a DAG. Q. Does every DAG have a topological ordering? Q. If so, how do we compute one? Lemma. If G is a DAG, then G has a node with no incoming edges. Lemma. If G is a DAG, then G has a node with no incoming edges. Proof. (by contradiction) Suppose that G is a DAG and every node has at least one incoming edge. Pick any node v, and begin following edges backward from v. Repeat. After n + 1 steps we will have visited a node, say w, twice. Let C denote the sequence of nodes encountered between successive visits to w. C is a cycle - contradiction. w x u v w x u v 61 62 63 Lemma. If G is a DAG, then G has a topological ordering. Proof. (by induction on n) Base case: true if n = 1. Given a DAG with n > 1 nodes, find a node v with no incoming edges. G - { v } is a DAG, since deleting v cannot create cycles. By induction hypothesis, G - { v } has a topological ordering. Place v first in topological ordering; append nodes of G - { v } in topological order. Algorithm: Topological Sort: Algorithm keep track of # incoming edges per node while (nodes left) : extract one with 0 incoming subtract one from all its adjacent nodes Running time? Better way? Topological Sort: Algorithm Running Time Theorem. Algorithm can run in O(m + n) time. Proof. Maintain the following information: count[w] = remaining number of incoming edges S = set of nodes with no incoming edges Initialization: O(m + n) via single scan through graph. Update: pick a node v in S remove v from S for each edge (v, w) : decrement count[w] and add w to S if count[w] hits 0 this is O(1) per edge 64 65 66 Copyright 2000, Kevin Wayne 11