Class 2: Graph theory and basic terminology Learning the language Network Science: Graph Theory 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr.

Slides:



Advertisements
Similar presentations
CSE 211 Discrete Mathematics
Advertisements

Chapter 9 Graphs.
Lecture 5 Graph Theory. Graphs Graphs are the most useful model with computer science such as logical design, formal languages, communication network,
22C:19 Discrete Math Graphs Fall 2010 Sukumar Ghosh.
22C:19 Discrete Math Graphs Fall 2014 Sukumar Ghosh.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 – CHAPTER 4 GRAPHS 1.
Graph-02.
Graph Theory: Euler Circuits Christina Mende Math 480 April 15, 2013.
CS 206 Introduction to Computer Science II 03 / 27 / 2009 Instructor: Michael Eckmann.
Data Structures Using C++
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 CHAPTER 4 - PART 2 GRAPHS 1.
BY: MIKE BASHAM, Math in Scheduling. The Bridges of Konigsberg.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
CS 206 Introduction to Computer Science II 11 / 11 / Veterans Day Instructor: Michael Eckmann.
C++ Programming: Program Design Including Data Structures, Third Edition Chapter 21: Graphs.
Mining and Searching Massive Graphs (Networks)
Prof. Amr Goneid, AUC1 Analysis & Design of Algorithms (CSCE 321) Prof. Amr Goneid Department of Computer Science, AUC Part R5. Graphs.
Introduction to Graphs
Graph: Relations There are many kinds of social relations. For example: Role-based : brother of, father of, sister of, etc. : friend of, acquaintance of,
Chapter 11 Graphs and Trees This handout: Terminology of Graphs Eulerian Cycles.
Discrete Mathematics Lecture 9 Alexander Bukharovich New York University.
22C:19 Discrete Math Graphs Spring 2014 Sukumar Ghosh.
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Graph Essentials Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Graph Essentials Networks A network is a graph. – Elements of the network.
Social Media Mining Graph Essentials.
Discrete Math Round, Round, Get Around… I Get Around Mathematics of Getting Around.
GRAPH Learning Outcomes Students should be able to:
Data Structures Using C++ 2E
Programming for Geographical Information Analysis: Advanced Skills Online mini-lecture: Introduction to Networks Dr Andy Evans.
Graph Theoretic Concepts. What is a graph? A set of vertices (or nodes) linked by edges Mathematically, we often write G = (V,E)  V: set of vertices,
Chapter 2 Graph Algorithms.
Network properties Slides are modified from Networks: Theory and Application by Lada Adamic.
5.1  Routing Problems: planning and design of delivery routes.  Euler Circuit Problems: Type of routing problem also known as transversability problem.
1 CS104 : Discrete Structures Chapter V Graph Theory.
Lecture 5: Mathematics of Networks (Cont) CS 790g: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Nyhoff, ADTs, Data Structures and Problem Solving with C++, Second Edition, © 2005 Pearson Education, Inc. All rights reserved Graphs.
Mathematics of Networks (Cont)
Graphs.  Definition A simple graph G= (V, E) consists of vertices, V, a nonempty set of vertices, and E, a set of unordered pairs of distinct elements.
Week 11 - Monday.  What did we talk about last time?  Binomial theorem and Pascal's triangle  Conditional probability  Bayes’ theorem.
Graphs What are Graphs? General meaning in everyday math: A plot or chart of numerical data using a coordinate system. Technical meaning in discrete.
Data Structures & Algorithms Graphs
Basic Notions on Graphs. The House-and-Utilities Problem.
COSC 2007 Data Structures II Chapter 14 Graphs I.
Most of contents are provided by the website Graph Essentials TJTSD66: Advanced Topics in Social Media.
The Structure of the Web. Getting to knowing the Web How big is the web and how do you measure it? How many people use the web? How many use search engines?
Graphs Edge(arc) Vertices can be even or odd or undirected (two-way) Edges can be directed (one-way) This graph is connected. Degree(order) = 3 Odd vertex.
Graph theory and networks. Basic definitions  A graph consists of points called vertices (or nodes) and lines called edges (or arcs). Each edge joins.
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
MAT 2720 Discrete Mathematics Section 8.2 Paths and Cycles
Discrete Structures CISC 2315 FALL 2010 Graphs & Trees.
Class 2: Graph Theory IST402. The Bridges of Konigsberg Section 1.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Chapter 9: Graphs.
Introduction to Graph Theory By: Arun Kumar (Asst. Professor) (Asst. Professor)
Class 2: Graph Theory IST402.
Chapter 20: Graphs. Objectives In this chapter, you will: – Learn about graphs – Become familiar with the basic terminology of graph theory – Discover.
Department of Computer and IT Engineering University of Kurdistan Social Network Analysis Graph Theory By: Dr. Alireza Abdollahpouri.
Graphs Rosen, Chapter 8. NOT ONE OF THESE! One of these!
Week 11 - Wednesday.  What did we talk about last time?  Graphs  Paths and circuits.
1 GRAPH Learning Outcomes Students should be able to: Explain basic terminology of a graph Identify Euler and Hamiltonian cycle Represent graphs using.
Lecture 20. Graphs and network models 1. Recap Binary search tree is a special binary tree which is designed to make the search of elements or keys in.
1 Data Structures and Algorithms Graphs. 2 Graphs Basic Definitions Paths and Cycles Connectivity Other Properties Representation Examples of Graph Algorithms:
Complex Networks Analysis Information Systems Engineering ‎ (2013 A) Instructor: Rami Puzis TA: Luiza
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Data Structures Graphs - Terminology
Graph theory Definitions Trees, cycles, directed graphs.
Network analysis.
Community detection in graphs
CS120 Graphs.
Network Science: A Short Introduction i3 Workshop
Presentation transcript:

Class 2: Graph theory and basic terminology Learning the language Network Science: Graph Theory 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr. Mauro Martino

Can one walk across the seven bridges and never cross the same bridge twice? What if the trip has to start and end at the same location? THE BRIDGES OF KONIGSBERG Network Science: Graph Theory 2012

Can one walk across the seven bridges and never cross the same bridge twice? THE BRIDGES OF KONIGSBERG : Leonhard Euler’s theorem: (a)If a graph has nodes of odd degree, there is no path that starts and ends at the same location (b)If a graph is connected and has no odd degree nodes, it has at least one path. Network Science: Graph Theory 2012

COMPONENTS OF A COMPLEX SYSTEM  components: nodes, vertices N  interactions: links, edges L  system: network, graph (N,L) Network Science: Graph Theory 2012

network often refers to real systems www, social network metabolic network. Language: (Network, node, link) graph: mathematical representation of a network web graph, social graph (a Facebook term) Language: (Graph, vertex, edge) We will try to make this distinction whenever it is appropriate, but in most cases we will use the two terms interchangeably. NETWORKS OR GRAPHS? Network Science: Graph Theory 2012

A COMMON LANGUAGE Peter Mary Albert co-worker friend brothers friend Protein 1 Protein 2 Protein 5 Protein 9 Movie 1 Movie 3 Movie 2 Actor 3 Actor 1 Actor 2 Actor 4 N=4 L=4 Network Science: Graph Theory 2012

The choice of the proper network representation determines our ability to use network theory successfully. In some cases there is a unique, unambiguous representation. In other cases, the representation is by no means unique. For example, the way we assign the links between a group of individuals will determine the nature of the question we can study. CHOOSING A PROPER REPRESENTATION Network Science: Graph Theory 2012 How do you define or frame the problem

If you connect individuals that work with each other, you will explore the professional network. CHOOSING A PROPER REPRESENTATION Network Science: Graph Theory 2012

If you connect those that have a romantic and sexual relationship, you will be exploring the sexual networks. CHOOSING A PROPER REPRESENTATION Network Science: Graph Theory 2012

If you connect individuals based on their first name (all Peters connected to each other), you will be exploring what? It is a network, nevertheless. CHOOSING A PROPER REPRESENTATION Network Science: Graph Theory 2012

Links: undirected (symmetrical) Graph: Directed links : URLs on the www phone calls metabolic reactions UNDIRECTED VS. DIRECTED NETWORKS UndirectedDirected A B D C L M F G H I Links: directed (arcs). Digraph = directed graph: Undirected links : coauthorship links Actor network protein interactions An undirected link is the superposition of two opposite directed links. A G F B C D E Network Science: Graph Theory 2012

Node degree: the number of links connected to the node. NODE DEGREES Undirected In directed networks we can define an in-degree and out-degree. The (total) degree is the sum of in- and out-degree. Source: a node with k in = 0; Sink: a node with k out = 0. Directed A G F B C D E A B

We have a sample of values x 1,..., x N Average (a.k.a. mean) : typical value = (x 1 + x x N )/N = Σ i x i /N A BIT OF STATISTICS Network Science: Graph Theory 2012

N – the number of nodes in the graph AVERAGE DEGREE Undirected Directed A F B C D E j i Network Science: Graph Theory 2012

The maximum number of links a network of N nodes can have is: A graph with degree L=L max is called a complete graph, and its average degree is =N-1 COMPLETE GRAPH Network Science: Graph Theory 2012

Most networks observed in real systems are sparse: L << L max or <<N-1. WWW (ND Sample): N=325,729;L= L max =10 12 =4.51 Protein (S. Cerevisiae): N= 1,870;L=4,470L max =10 7 =2.39 Coauthorship (Math): N= 70,975; L= L max = =3.9 Movie Actors: N=212,250; L= L max = =28.78 (Source: Albert, Barabasi, RMP2002) REAL NETWORKS ARE SPARSE Network Science: Graph Theory 2012

The maximum number of links a network of N nodes can have is: METCALFE’S LAW Network Science: Graph Theory 2012

A ij =1 if there is a link between node i and j A ij =0 if nodes i and j are not connected to each other. ADJACENCY MATRIX Note that for a directed graph (right) the matrix is not symmetric Network Science: Graph Theory 2012

a b c d e f g h a b c d e f g h ADJACENCY MATRIX b e g a c f h d Network Science: Graph Theory 2012

ADJACENCY MATRIX AND NODE DEGREES Undirected Directed Network Science: Graph Theory 2012

3 GRAPHOLOGY 1 UndirectedDirected Actor network, protein-protein interactionsWWW, citation networks Network Science: Graph Theory 2012

GRAPHOLOGY 2 Unweighted (undirected) Weighted (undirected) protein-protein interactions, wwwCall Graph, metabolic networks Network Science: Graph Theory 2012

GRAPHOLOGY 3 Self-interactionsMultigraph (undirected) Protein interaction network, wwwSocial networks, collaboration networks Network Science: Graph Theory 2012

GRAPHOLOGY 4 Complete Graph (undirected) Actor network, protein-protein interactions Network Science: Graph Theory 2012

GRAPHOLOGY: Real networks can have multiple characteristics WWW > directed multigraph with self-interactions Protein Interactions > undirected unweighted with self-interactions Collaboration network > undirected multigraph or weighted. Mobile phone calls > directed, weighted. Facebook Friendship links > undirected, unweighted. Network Science: Graph Theory 2012

bipartite graph (or bigraph) is a graph whose nodes can be divided into two disjoint sets U and V such that every link connects a node in U to one in V; that is, U and V are independent sets.graph disjoint setsindependent sets Examples: Hollywood actor network (actors vs. movies) Collaboration networks (authors vs. papers) Disease network (diseasome) BIPARTITE GRAPHS Network Science: Graph Theory 2012

Gene network GENOME PHENOME DISEASOME Disease network Goh, Cusick, Valle, Childs, Vidal & Barabási, PNAS (2007) GENE NETWORK – DISEASE NETWORK Network Science: Graph Theory 2012

HUMAN DISEASE NETWORK

Network Science: Graph Theory 2012

A path is a sequence of nodes in which each node is adjacent to the next one P i0,in of length n between nodes i 0 and i n is an ordered collection of n+1 nodes and n links A path can intersect itself and pass through the same link repeatedly. Each time a link is crossed, it is counted separately A legitimate path on the graph on the right: ABCBCADEEBA In a directed network, the path can follow only the direction of an arrow. PATHS A B C D E Network Science: Graph Theory 2012

The distance (shortest path, geodesic path) between two nodes is defined as the number of edges along the shortest path connecting them. *If the two nodes are disconnected, the distance is infinity. In directed graphs each path needs to follow the direction of the arrows. Thus in a digraph the distance from node A to B (on an AB path) is generally different from the distance from node B to A (on a BCA path). DISTANCE IN A GRAPH Shortest Path, Geodesic Path D C A B D C A B Network Science: Graph Theory 2012

N ij,number of paths between any two nodes i and j: Length n=1: If there is a link between i and j, then A ij =1 and A ij =0 otherwise. Length n=2: If there is a path of length two between i and j, then A ik A kj =1, and A ik A kj =0 otherwise. The number of paths of length 2: Length n: In general, if there is a path of length n between i and j, then A ik …A lj =1 and A ik …A lj =0 otherwise. The number of paths of length n between i and j is * * holds for both directed and undirected networks. NUMBER OF PATHS BETWEEN TWO NODES Adjacency Matrix Network Science: Graph Theory 2012

Distance between node 1 and node 4: 1.Start at 1. FINDING DISTANCES: BREADTH FIRST SEATCH Network Science: Graph Theory 2012

Distance between node 1 and node 4: 1.Start at 1. 2.Find the nodes adjacent to 1. Mark them as at distance 1. Put them in a queue. FINDING DISTANCES: BREADTH FIRST SEATCH Network Science: Graph Theory 2012

Distance between node 1 and node 4: 1.Start at 1. 2.Find the nodes adjacent to 1. Mark them as at distance 1. Put them in a queue. 3.Take the first node out of the queue. Find the unmarked nodes adjacent to it in the graph. Mark them with the label of 2. Put them in the queue. FINDING DISTANCES: BREADTH FIRST SEATCH Network Science: Graph Theory 2012

Distance between node 1 and node 4: 1.Repeat until you find node 4 or there are no more nodes in the queue. 2.The distance between 1 and 4 is the label of 4 or, if 4 does not have a label, infinity. FINDING DISTANCES: BREADTH FIRST SEATCH Network Science: Graph Theory 2012

Diameter: d max the maximum distance (shortest path) between any pair of nodes in the graph. Average path length/distance,, for a connected graph: where d ij is the distance from node i to node j In an undirected graph d ij =d ji, so we only need to count them once: NETWORK DIAMETER AND AVERAGE DISTANCE Network Science: Graph Theory 2012

Can one walk across the seven bridges and never cross the same bridge twice? THE BRIDGES OF KONIGSBERG Euler PATH or CIRCUIT: return to the starting point by traveling each link of the graph once and only once. Network Science: Graph Theory 2012

Every vertex of this graph has an even degree, therefore this is an Eulerian graph. Following the edges in alphabetical order gives an Eulerian circuit/cycle. EULERIAN GRAPH: it has an Eulerian path Network Science: Graph Theory 2012

If a digraph is strongly connected and the in- degree of each node is equal to its out-degree, then there is an Euler circuit Otherwise there is no Euler circuit. in a circuit we need to enter each node as many times as we leave it. A B C D E F G EULER CIRCUITS IN DIRECTED GRAPHS Network Science: Graph Theory 2012

PATHOLOGY: summary Path Shortest Path A sequence of nodes such that each node is connected to the next node along the path by a link. The path with the shortest length between two nodes (distance). Network Science: Graph Theory 2012

PATHOLOGY: summary Diameter Average Path Length The longest shortest path in a graph The average of the shortest paths for all pairs of nodes. Network Science: Graph Theory 2012

PATHOLOGY: summary Cycle Self-avoiding Path A path with the same start and end node. A path that does not intersect itself. Network Science: Graph Theory 2012

PATHOLOGY: summary Eulerian Path Hamiltonian Path A path that visits each node exactly once. A path that traverses each link exactly once. Network Science: Graph Theory 2012

Connected (undirected) graph: any two vertices can be joined by a path. A disconnected graph is made up by two or more connected components. Bridge: if we erase it, the graph becomes disconnected. Largest Component: Giant Component The rest: Isolates CONNECTIVITY OF UNDIRECTED GRAPHS D C A B F F G D C A B F F G Network Science: Graph Theory 2012

The adjacency matrix of a network with several components can be written in a block- diagonal form, so that nonzero elements are confined to squares, with all other elements being zero: Figure after Newman, 2010 CONNECTIVITY OF UNDIRECTED GRAPHS Adjacency Matrix Network Science: Graph Theory 2012

Strongly connected directed graph: has a path from each node to every other node and vice versa (e.g. AB path and BA path). Weakly connected directed graph: it is connected if we disregard the edge directions. Strongly connected components (scc) can be identified, but not every node is part of a nontrivial strongly connected component. In-component : nodes that can reach the scc, Out-component : nodes that can be reached from the scc. CONNECTIVITY OF DIRECTED GRAPHS D C A B F G E E C A B G F D Network Science: Graph Theory 2012

Degree distribution p k THREE CENTRAL QUANTITIES IN NETWORK SCIENCE Average path length Clustering coefficient C Network Science: Graph Theory 2012

We have a sample of values x 1,..., x N Distribution of x (a.k.a. PDF): probability that a randomly chosen value is x P(x) = (# values x) / N Σ i P(x i ) = 1 always! STATISTICS REMINDER Histograms >>> Network Science: Graph Theory 2012

Degree distribution P(k): probability that a randomly chosen vertex has degree k N k = # nodes with degree k P(k) = N k / N ➔ plot k P(k) DEGREE DISTRIBUTION Network Science: Graph Theory 2012

discrete representation: p k is the probability that a node has degree k. continuum description: p(k) is the pdf of the degrees, where represents the probability that a node’s degree is between k 1 and k 2. Normalization condition: where K min is the minimal degree in the network. DEGREE DISTRIBUTION Network Science: Graph Theory 2012

Clustering coefficient: what portion of your neighbors are connected? Node i with degree k i C i in [0,1] CLUSTERING COEFFICIENT Network Science: Graph Theory 2012

Degree distribution: P(k) Path length: Clustering coefficient: THREE CENTRAL QUANTITIES IN NETWORK SCIENCE Network Science: Graph Theory 2012

A. Degree distribution: p k B. Path length: C. Clustering coefficient: THREE CENTRAL QUANTITIES IN NETWORK SCIENCE Network Science: Graph Theory 2012

The average path-length varies as Constant degree, constant clustering coefficient. ONE DIMENSIONAL LATTICE: nodes on a line Network Science: Graph Theory 2012

In general, the average distance varies as where D is the dimensionality of the lattice. Constant degree (coordination number), constant clustering coefficient. TWO DIMENSIONAL LATTICE Network Science: Graph Theory 2012