Lecture 21 Network evolution

Slides:



Advertisements
Similar presentations
1 Dynamics of Real-world Networks Jure Leskovec Machine Learning Department Carnegie Mellon University
Advertisements

Jurij Leskovec, CMU Jon Kleinberg, Cornell Christos Faloutsos, CMU
1 Realistic Graph Generation and Evolution Using Kronecker Multiplication Jurij Leskovec, CMU Deepay Chakrabarti, CMU/Yahoo Jon Kleinberg, Cornell Christos.
Modeling Blog Dynamics Speaker: Michaela Götz Joint work with: Jure Leskovec, Mary McGlohon, Christos Faloutsos Cornell University Carnegie Mellon University.
Analysis and Modeling of Social Networks Foudalis Ilias.
Jure Leskovec, CMU Lars Backstrom, Cornell Ravi Kumar, Yahoo! Research Andrew Tomkins, Yahoo! Research.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos.
Models of Network Formation Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
What did we see in the last lecture?. What are we going to talk about today? Generative models for graphs with power-law degree distribution Generative.
SILVIO LATTANZI, D. SIVAKUMAR Affiliation Networks Presented By: Aditi Bhatnagar Under the guidance of: Augustin Chaintreau.
Jure Leskovec, CMU Kevin Lang, Anirban Dasgupta, Michael Mahoney Yahoo! Research.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
CS728 Lecture 5 Generative Graph Models and the Web.
Masters Thesis Defense Amit Karandikar Advisor: Dr. Anupam Joshi Committee: Dr. Finin, Dr. Yesha, Dr. Oates Date: 1 st May 2007 Time: 9:30 am Place: ITE.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
CS Lecture 6 Generative Graph Models Part II.
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Graphs over time: densification laws, shrinking diameters and possible explanations 1.
Advanced Topics in Data Mining Special focus: Social Networks.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Measurement and Evolution of Online Social Networks Review of paper by Ophir Gaathon Analysis of Social Information Networks COMS , Spring 2011,
Online Social Networks and Media Network models. What is a network model? Informally, a network model is a process (radomized or deterministic) for generating.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Survey on Evolving Graphs Research Speaker: Chenghui Ren Supervisors: Prof. Ben Kao, Prof. David Cheung 1.
Weighted Graphs and Disconnected Components Patterns and a Generator IDB Lab 현근수 In KDD 08. Mary McGlohon, Leman Akoglu, Christos Faloutsos.
On-line Social Networks - Anthony Bonato 1 Dynamic Models of On-Line Social Networks Anthony Bonato Ryerson University WAW’2009 February 13, 2009 nt.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Slides are modified from Lada Adamic and Jure Leskovec
School of Information University of Michigan Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution.
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
Jure Leskovec, CMU Lars Backstrom, Cornell Ravi Kumar, Yahoo! Research Andrew Tomkins, Yahoo! Research.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
The simultaneous evolution of author and paper networks
Network (graph) Models
Lecture 23: Structure of Networks
Graph Models Class Algorithmic Methods of Data Mining
Topics In Social Computing (67810)
Modeling networks using Kronecker multiplication
NetMine: Mining Tools for Large Graphs
How Do “Real” Networks Look?
Lecture 23: Structure of Networks
CS224W: Social and Information Network Analysis
MST in Log-Star Rounds of Congested Clique
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Models of Network Formation
Lecture 13 Network evolution
Models of Network Formation
Peer-to-Peer and Social Networks Fall 2017
Models of Network Formation
How Do “Real” Networks Look?
Models of Network Formation
Dynamics of Real-world Networks
Graph and Tensor Mining for fun and profit
Peer-to-Peer and Social Networks
Lecture 23: Structure of Networks
Modelling and Searching Networks Lecture 2 – Complex Networks
Modelling and Searching Networks Lecture 5 – Random graphs
Modelling and Searching Networks Lecture 6 – PA models
Network Science: A Short Introduction i3 Workshop
Discrete Mathematics and its Applications Lecture 5 – Random graphs
Diffusion in Networks
Network Models Michael Goodrich Some slides adapted from:
Advanced Topics in Data Mining Special focus: Social Networks
Discrete Mathematics and its Applications Lecture 6 – PA models
Advanced Topics in Data Mining Special focus: Social Networks
What did we see in the last lecture?
Presentation transcript:

Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos

What can we do with graphs? Introduction What can we do with graphs? What patterns or “laws” hold for most real-world graphs? How do the graphs evolve over time? Can we generate synthetic but “realistic” graphs? “Needle exchange” networks of drug users

Evolution of the Graphs How do graphs evolve over time? Conventional Wisdom: Constant average degree: the number of edges grows linearly with the number of nodes Slowly growing diameter: as the network grows the distances between nodes grow Findings: Densification Power Law: networks are becoming denser over time Shrinking Diameter: diameter is decreasing as the network grows

Why is all this important? Gives insight into the graph formation process: Anomaly detection – abnormal behavior, evolution Predictions – predicting future from the past Simulations of new algorithms Graph sampling – many real world graphs are too large to deal with

Graph models: Random Graphs How can we generate a realistic graph? given the number of nodes N and edges E Random graph [Erdos & Renyi, 60s]: Pick 2 nodes at random and link them Does not obey Power laws No community structure

Graph models: Preferential attachment Preferential attachment [Albert & Barabasi, 99]: Add a new node, create M out-links Probability of linking a node is proportional to its degree Examples: Citations: new citations of a paper are proportional to the number it already has Rich get richer phenomena Explains power-law degree distributions But, all nodes have equal (constant) out-degree

Graph models: Copying model Copying model [Kleinberg, Kumar, Raghavan, Rajagopalan and Tomkins, 99]: Add a node and choose the number of edges to add Choose a random vertex and “copy” its links (neighbors) Generates power-law degree distributions Generates communities

Temporal Evolution of the Graphs Densification Power Law networks are becoming denser over time the number of edges grows faster than the number of nodes – average degree is increasing a … densification exponent Densification exponent: 1 ≤ a ≤ 2: a=1: linear growth – constant out-degree (assumed in the literature so far) a=2: quadratic growth – clique or equivalently

Evolution of the Diameter Prior work on Power Law graphs hints at Slowly growing diameter: diameter ~ O(log N) diameter ~ O(log log N) However, Diameters shrinks over the time As the network grows the distances between nodes slowly decrease There are several factors that could influence the Shrinking diameter Effective Diameter: Distance at which 90% of pairs of nodes is reachable Problem of “Missing past” How do we handle the citations outside the dataset? Disconnected components ….

Densification – Possible Explanation Existing graph generation models do not capture the Densification Power Law and Shrinking diameters Can we find a simple model of local behavior, which naturally leads to observed phenomena? Yes! Community Guided Attachment obeys Densification Forest Fire model obeys Densification, Shrinking diameter and Power Law degree distribution

Let’s assume the community structure One expects many within-group friendships and fewer cross-group ones How hard is it to cross communities? University Science Arts CS Math Drama Music Animation: first faces, then comminities, then edges Self-similar university community structure

Fundamental Assumption If the cross-community linking probability of nodes at tree-distance h is scale-free cross-community linking probability: where: c ≥ 1 … the Difficulty constant h … tree-distance Animation with communities: f(1), f(2), … f(h)

Densification Power Law (1) Theorem: The Community Guided Attachment leads to Densification Power Law with exponent a … densification exponent b … community structure branching factor c … difficulty constant Animation for a,b,c

Gives any non-integer Densification exponent Difficulty Constant Theorem: Gives any non-integer Densification exponent If c = 1: easy to cross communities Then: a=2, quadratic growth of edges near clique If c = b: hard to cross communities Then: a=1, linear growth of edges constant out-degree

Dynamic Community Guided Attachment The community tree grows At each iteration a new level of nodes gets added New nodes create links among themselves as well as to the existing nodes in the hierarchy Based on the value of parameter c we get: Densification with heavy-tailed in-degrees Constant average degree and heavy-tailed in-degrees Constant in- and out-degrees But: Community Guided Attachment still does not obey the shrinking diameter property

Community Guided Attachment explains Densification Power Law Room for Improvement Community Guided Attachment explains Densification Power Law Issues: Requires explicit Community structure Does not obey Shrinking Diameters

“Forest Fire” model – Wish List Want no explicit Community structure Shrinking diameters and: “Rich get richer” attachment process, to get heavy-tailed in-degrees “Copying” model, to lead to communities Community Guided Attachment, to produce Densification Power Law Animation with people connected New person Add hierarchy Connect Remove hierarchy

“Forest Fire” model – Intuition (1) How do authors identify references? Find first paper and cite it Follow a few citations, make citations Continue recursively From time to time use bibliographic tools (e.g. CiteSeer) and chase back-links

“Forest Fire” model – Intuition (2) How do people make friends in a new environment? Find first a person and make friends Follow a friend of his/her friends Continue recursively From time to time get introduced to his friends Forest Fire model imitates exactly this process

“Forest Fire” – the Model A node arrives Randomly chooses an “ambassador” Starts burning nodes (with probability p) and adds links to burned nodes “Fire” spreads recursively End the red line

Nodes arrive one at a time Forest Fire – the Model 2 parameters: p … forward burning probability r … backward burning ratio Nodes arrive one at a time New node v attaches to a random node – the ambassador Then v begins burning ambassador’s neighbors: Burn X links, where X is binomially distributed with mean p/(1-p) Choose in-links with probability r times less than out-links with mean rp/(1-rp) Fire spreads recursively Node v attaches to all nodes that got burned

Forest Fire in Action (1) Forest Fire generates graphs that Densify and have Shrinking Diameter E(t) densification diameter 1.21 diameter N(t) N(t)

Forest Fire in Action (2) Forest Fire also generates graphs with heavy-tailed degree distribution in-degree out-degree Label the axis count vs. in-degree count vs. out-degree

Forest Fire – Phase plots Exploring the Forest Fire parameter space Shrinking diameter Dense graph Sparse graph Increasing diameter

Forest Fire model – Justification Densification Power Law: Similar to Community Guided Attachment The probability of linking decays exponentially with the distance Densification Power Law Power law out-degrees: From time to time we get large fires Power law in-degrees: The fire is more likely to burn hubs Communities: Newcomer copies neighbors’ links Shrinking diameter

Forest Fire – Extensions Orphans: isolated nodes that eventually get connected into the network Example: citation networks Orphans can be created in two ways: start the Forest Fire model with a group of nodes new node can create no links Diameter decreases even faster Multiple ambassadors: Example: following paper citations from different fields Faster decrease of diameter

we can sometimes predict where new edges will form wrap up networks evolve we can sometimes predict where new edges will form e.g. social networks tend to display triadic closure friends introduce friends to other friends network structure as a whole evolves densification: edges are added at a greater rate than nodes e.g. papers today have longer lists of references