Random Dot Product Graphs Ed Scheinerman Applied Mathematics & Statistics Johns Hopkins University IPAM Intelligent Extraction of Information from Graphs.

Slides:



Advertisements
Similar presentations
Network analysis Sushmita Roy BMI/CS 576
Advertisements

1 Dynamics of Real-world Networks Jure Leskovec Machine Learning Department Carnegie Mellon University
Connectivity - Menger’s Theorem Graphs & Algorithms Lecture 3.
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Analysis and Modeling of Social Networks Foudalis Ilias.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Models of Network Formation Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
On the Spread of Viruses on the Internet Noam Berger Joint work with C. Borgs, J.T. Chayes and A. Saberi.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Future directions in computer science research John Hopcroft Department of Computer Science Cornell University CINVESTAV-IPN Dec 2,2013.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Mining and Searching Massive Graphs (Networks)
Network Design IS250 Spring 2010 John Chuang. 2 Questions  What does the Internet look like? -Why do we care?  Are there any structural invariants?
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
L15:Microarray analysis (Classification) The Biological Problem Two conditions that need to be differentiated, (Have different treatments). EX: ALL (Acute.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Common Properties of Real Networks. Erdős-Rényi Random Graphs.
L15:Microarray analysis (Classification). The Biological Problem Two conditions that need to be differentiated, (Have different treatments). EX: ALL (Acute.
Clustering (Part II) 11/26/07. Spectral Clustering.
Advanced Topics in Data Mining Special focus: Social Networks.
SDSC, skitter (July 1998) A random graph model for massive graphs William Aiello Fan Chung Graham Lincoln Lu.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
TELCOM2125: Network Science and Analysis
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Statistical Properties of Massive Graphs (Networks) Networks and Measurements.
Computer Science 1 Web as a graph Anna Karpovsky.
Decomposing Networks and Polya Urns with the Power of Choice Joint work with Christos Amanatidis, Richard Karp, Christos Papadimitriou, Martha Sideri Presented.
Random Graph Models of Social Networks Paper Authors: M.E. Newman, D.J. Watts, S.H. Strogatz Presentation presented by Jessie Riposo.
The Erdös-Rényi models
Information Networks Power Laws and Network Models Lecture 3.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Soon-Hyung Yook, Sungmin Lee, Yup Kim Kyung Hee University NSPCS 08 Unified centrality measure of complex networks.
Popularity versus Similarity in Growing Networks Fragiskos Papadopoulos Cyprus University of Technology M. Kitsak, M. Á. Serrano, M. Boguñá, and Dmitri.
Network properties Slides are modified from Networks: Theory and Application by Lada Adamic.
July The Mathematical Challenge of Large Networks László Lovász Eötvös Loránd University, Budapest
Flexible Graph Models for Complex Networks Complex Networks: Internet and its applications: WWW, content sharing, social online Other: further social,
CS240A: Measurements of Graphs slides under construction – see the Matlab transcript for what I actually did in class.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
CS 219: Sparse Matrix Algorithms
1 Burning a graph as a model of social contagion Anthony Bonato Ryerson University Institute of Software Chinese Academy of Sciences.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
October Large networks: a new language for science László Lovász Eötvös Loránd University, Budapest
Soon-Hyung Yook, Sungmin Lee, Yup Kim Kyung Hee University NSPCS 08 Unified centrality measure of complex networks: a dynamical approach to a topological.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
How Do “Real” Networks Look?
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
 In the previews parts we have seen some kind of segmentation method.  In this lecture we will see graph cut, which is a another segmentation method.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Class 2: Graph Theory IST402.
Biao Wang 1, Ge Chen 1, Luoyi Fu 1, Li Song 1, Xinbing Wang 1, Xue Liu 2 1 Shanghai Jiao Tong University 2 McGill University
On the behaviour of an edge number in a power-law random graph near a critical points E. V. Feklistova, Yu.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Lecture 5 Graph Theory prepped by Lecturer ahmed AL tememe 1.
How Do “Real” Networks Look?
Lecture 18: Uniformity Testing Monotonicity Testing
How Do “Real” Networks Look?
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Topic models for corpora and for graphs
Graphs G = (V, E) V are the vertices; E are the edges.
Shan Lu, Jieqi Kang, Weibo Gong, Don Towsley UMASS Amherst
Network Models Michael Goodrich Some slides adapted from:
Presentation transcript:

Random Dot Product Graphs Ed Scheinerman Applied Mathematics & Statistics Johns Hopkins University IPAM Intelligent Extraction of Information from Graphs & High Dimensional Data July 26, 2005

Coconspirators Libby Beer John Conroy (IDA) Paul Hand (Columbia) Miro Kraetzl (DSTO) Christine Nickel Carey Priebe Kim Tucker Stephen Young (Georgia Tech)

Overview Mathematical context Modeling networks Random dot product model The inverse problem

Mathematical Context

Graphs I Have Loved Interval graphs & intersection graphs Random graphs Random intersection graphs Threshold graphs & dot product graphs

Interval Graphs

Intersection Graphs {1} {1,2} {2}

Random Graphs Erdös-Rényi style… p1 – p Randomness is “in” the edges. Vertices are “dumb” placeholders.

Random Intersection Graphs Assign random sets to vertices. Two vertices are adjacent iff their sets intersect. Randomness is “in” the vertices. Edges reflect relationships between vertices.

Threshold Graphs

Dot Product Graphs [1 0] [2 0] [1 1] [0 1] Fractional intersection graphs

Communication Networks

Physical Networks Telephone Local area network Power grid Internet

Social Networks Alice Bob A B

Social Network Graphs Vertices (Actors)Edges (Dyads) TelephonesCalls addressesMessages ComputersIP Packets Human beingsAcquaintance AcademiciansCoauthorship

Example: at HP 485 employees 185,000 s Social network (who s whom) identified 7 “communities”, validated by interviews with employees.

Properties of Social Networks Clustering Low diameter Power law

Properties of Social Networks Clustering Low diameter Power law a b c

Properties of Social Networks Clustering Low diameter Power law “Six degrees of separation”

Properties of Social Networks Clustering Low diameter Power law log d log N(d) Degree Histogram

Degree Histogram Example vertices degree Number of vertices

Degree Histogram Example vertices degree Number of vertices

Random Graph Models Goal: Simple and realistic random graph models of social networks.

Erdös-Rényi? Low diameter! No clustering: P[a~c]=P[a~c|a~b~c]. No power-law degree distribution. Not a good model.

Model by Fan Chung et al Consider only those graphs with with all such graphs equally likely.

People as Vectors Sports Politics Movies Graph theory

Shared Interests Alice and Bob are more likely to communicate when they have more shared interests.

Selecting the Function

Random Dot Product Graphs, I

Generalize Erdös-Rényi

Generalize Intersection Graphs

Whence the Vectors? Vectors are given in advance. Vectors chosen (iid) from some distribution.

Random Dot Product Graphs, II Step 1: Pick the vectors  Given by fiat.  Chosen from iid a distribution. Step 2: For all i<j  Let p=f(x i x j ).  Insert an edge from i to j with probability p.

Megageneralization Generalization of:  Intersection graphs (ordinary & random)  Threshold graphs  Dot product graphs  Erdös-Rényi random graphs Randomness is “in” both the vertices and the edges. P[a~b] independent of P[c~d] when a,b,c,d are distinct.

Results in Dimension 1

Probability/Number of Edges

Clustering

Power Law

Power Law Example

Isolated Vertices Thus, the graph is not connected, but…

“Mostly” Connected “Giant” connected component A “few” isolated vertices

Six Degrees of Separation Diameter ≤ 6

Attached Attached pair Diameter ≤ 6 Proof Outline Diameter = 2 Isolated

Diameter ≤ 6 Proof Outline

Graphs to Vectors The Inverse Problem

Given Graphs, Find Vectors Given: A graph, or a series of graphs, on a common vertex set. Problem: Find vectors to assign to vertices that “best” model the graph(s).

Maximum Likelihood Method Feasible in dimension 1. Awful d>1. Nice results for f(t) = t / (1+t).

Gram Matrix Approach

Wrong Best Solution

Real Problem

Iterative Algorithm

Convergence

iteration diagonal entries

Convergence iteration diagonal entries

Convergence iteration diagonal entries

Convergence iteration diagonal entries

Convergence iteration diagonal entries

Convergence iteration diagonal entries

Enron example

Applications Network Change/Anomaly Detection Clustering

Change/Anomaly Detection

Graph Clustering

Synthetic Lethality Graphs Vertices are genes in yeast Edge between u and v iff  Deleting one of u or v does not kill, but  Deleting both is lethal.

SL Graph Status Yeast has about 6000 genes. Full graph known on 126 “query” genes (about 1300 edges). Partial graph known on 1000 “library” genes.

What Next?

Random Dot Product Graphs Extension to higher dimension  Cube  Unit ball intersect positive orthant Small world measures: clustering coefficient Other random graph properties

Vector Estimation MLE method  Computationally efficient?  More useful? Eigenvalue method  Understand convergence  Prove that it globally minimizes  Extension to missing data Validate against real data

Network Evolution Communication influences interests:

Rapid Generation Can we generate a sparse random dot product graph with n vertices and m edges in time O(n+m)? Partial answer: Yes, but.

The End