Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos.

Slides:



Advertisements
Similar presentations
1 Dynamics of Real-world Networks Jure Leskovec Machine Learning Department Carnegie Mellon University
Advertisements

Jurij Leskovec, CMU Jon Kleinberg, Cornell Christos Faloutsos, CMU
1 Realistic Graph Generation and Evolution Using Kronecker Multiplication Jurij Leskovec, CMU Deepay Chakrabarti, CMU/Yahoo Jon Kleinberg, Cornell Christos.
Analysis and Modeling of Social Networks Foudalis Ilias.
Jure Leskovec, CMU Lars Backstrom, Cornell Ravi Kumar, Yahoo! Research Andrew Tomkins, Yahoo! Research.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
Kronecker Graphs: An Approach to Modeling Networks Jure Leskovec, Deepayan Chakrabarti, Jon Kleinberg, Christos Faloutsos, Zoubin Ghahramani Presented.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Models of Network Formation Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
What did we see in the last lecture?. What are we going to talk about today? Generative models for graphs with power-law degree distribution Generative.
SILVIO LATTANZI, D. SIVAKUMAR Affiliation Networks Presented By: Aditi Bhatnagar Under the guidance of: Augustin Chaintreau.
Jure Leskovec, CMU Kevin Lang, Anirban Dasgupta, Michael Mahoney Yahoo! Research.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
The influence of search engines on preferential attachment Dan Li CS3150 Spring 2006.
1 A Random-Surfer Web-Graph Model (Joint work with Avrim Blum & Hubert Chan) Mugizi Rwebangira.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
CS728 Lecture 5 Generative Graph Models and the Web.
Masters Thesis Defense Amit Karandikar Advisor: Dr. Anupam Joshi Committee: Dr. Finin, Dr. Yesha, Dr. Oates Date: 1 st May 2007 Time: 9:30 am Place: ITE.
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
Small Worlds Presented by Geetha Akula For the Faculty of Department of Computer Science, CALSTATE LA. On 8 th June 07.
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
Modeling Real Graphs using Kronecker Multiplication
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
CS Lecture 6 Generative Graph Models Part II.
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Graphs over time: densification laws, shrinking diameters and possible explanations 1.
Advanced Topics in Data Mining Special focus: Social Networks.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Computer Science 1 Web as a graph Anna Karpovsky.
Measurement and Evolution of Online Social Networks Review of paper by Ophir Gaathon Analysis of Social Information Networks COMS , Spring 2011,
Online Social Networks and Media Network models. What is a network model? Informally, a network model is a process (radomized or deterministic) for generating.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Survey on Evolving Graphs Research Speaker: Chenghui Ren Supervisors: Prof. Ben Kao, Prof. David Cheung 1.
Weighted Graphs and Disconnected Components Patterns and a Generator IDB Lab 현근수 In KDD 08. Mary McGlohon, Leman Akoglu, Christos Faloutsos.
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Jon Kleinberg (Cornell), Christos.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
On-line Social Networks - Anthony Bonato 1 Dynamic Models of On-Line Social Networks Anthony Bonato Ryerson University WAW’2009 February 13, 2009 nt.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
Slides are modified from Lada Adamic and Jure Leskovec
School of Information University of Michigan Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution.
Du, Faloutsos, Wang, Akoglu Large Human Communication Networks Patterns and a Utility-Driven Generator Nan Du 1,2, Christos Faloutsos 2, Bai Wang 1, Leman.
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
Jure Leskovec, CMU Lars Backstrom, Cornell Ravi Kumar, Yahoo! Research Andrew Tomkins, Yahoo! Research.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Network (graph) Models
Graph Models Class Algorithmic Methods of Data Mining
Topics In Social Computing (67810)
Modeling networks using Kronecker multiplication
Models of Network Formation
Lecture 13 Network evolution
Models of Network Formation
Peer-to-Peer and Social Networks Fall 2017
Models of Network Formation
Models of Network Formation
Graph and Tensor Mining for fun and profit
Lecture 21 Network evolution
Modelling and Searching Networks Lecture 5 – Random graphs
Modelling and Searching Networks Lecture 6 – PA models
Discrete Mathematics and its Applications Lecture 5 – Random graphs
Network Models Michael Goodrich Some slides adapted from:
Discrete Mathematics and its Applications Lecture 6 – PA models
Advanced Topics in Data Mining Special focus: Social Networks
What did we see in the last lecture?
Presentation transcript:

Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos

2 “Needle exchange” networks of drug users Introduction What can we do with graphs? What patterns or “laws” hold for most real-world graphs? How do the graphs evolve over time? Can we generate synthetic but “realistic” graphs?

3 Evolution of the Graphs How do graphs evolve over time? Conventional Wisdom: Constant average degree: the number of edges grows linearly with the number of nodes Slowly growing diameter: as the network grows the distances between nodes grow Findings: Densification Power Law: networks are becoming denser over time Shrinking Diameter: diameter is decreasing as the network grows

4 Why is all this important? Gives insight into the graph formation process: Anomaly detection – abnormal behavior, evolution Predictions – predicting future from the past Simulations of new algorithms Graph sampling – many real world graphs are too large to deal with

5 Graph models: Random Graphs How can we generate a realistic graph? given the number of nodes N and edges E Random graph [Erdos & Renyi, 60s]: Pick 2 nodes at random and link them Does not obey Power laws No community structure

6 Graph models: Preferential attachment Preferential attachment [Albert & Barabasi, 99]: Add a new node, create M out-links Probability of linking a node is proportional to its degree Examples: Citations: new citations of a paper are proportional to the number it already has Rich get richer phenomena Explains power-law degree distributions But, all nodes have equal (constant) out-degree

7 Graph models: Copying model Copying model [Kleinberg, Kumar, Raghavan, Rajagopalan and Tomkins, 99]: Add a node and choose the number of edges to add Choose a random vertex and “copy” its links (neighbors) Generates power-law degree distributions Generates communities

8 Temporal Evolution of the Graphs Densification Power Law networks are becoming denser over time the number of edges grows faster than the number of nodes – average degree is increasing a … densification exponent Densification exponent: 1 ≤ a ≤ 2: a=1: linear growth – constant out-degree (assumed in the literature so far) a=2: quadratic growth – clique or equivalently

9 Evolution of the Diameter Prior work on Power Law graphs hints at Slowly growing diameter: diameter ~ O(log N) diameter ~ O(log log N) However, Diameters shrinks over the time As the network grows the distances between nodes slowly decrease There are several factors that could influence the Shrinking diameter Effective Diameter: Distance at which 90% of pairs of nodes is reachable Problem of “Missing past” How do we handle the citations outside the dataset? Disconnected components ….

Densification – Possible Explanation Existing graph generation models do not capture the Densification Power Law and Shrinking diameters Can we find a simple model of local behavior, which naturally leads to observed phenomena? Yes! Community Guided Attachment obeys Densification Forest Fire model obeys Densification, Shrinking diameter and Power Law degree distribution

Community structure Let’s assume the community structure One expects many within-group friendships and fewer cross-group ones How hard is it to cross communities? Self-similar university community structure CS Math DramaMusic Science Arts University

If the cross-community linking probability of nodes at tree-distance h is scale-free cross-community linking probability: where: c ≥ 1 … the Difficulty constant h … tree-distance Fundamental Assumption

Densification Power Law (1) Theorem: The Community Guided Attachment leads to Densification Power Law with exponent a … densification exponent b … community structure branching factor c … difficulty constant

Theorem: Gives any non-integer Densification exponent If c = 1: easy to cross communities Then: a=2, quadratic growth of edges near clique If c = b: hard to cross communities Then: a=1, linear growth of edges constant out-degree Difficulty Constant

15 Dynamic Community Guided Attachment The community tree grows At each iteration a new level of nodes gets added New nodes create links among themselves as well as to the existing nodes in the hierarchy Based on the value of parameter c we get: a) Densification with heavy-tailed in-degrees b) Constant average degree and heavy-tailed in-degrees c) Constant in- and out-degrees But: Community Guided Attachment still does not obey the shrinking diameter property

Room for Improvement Community Guided Attachment explains Densification Power Law Issues: Requires explicit Community structure Does not obey Shrinking Diameters

“Forest Fire” model – Wish List Want no explicit Community structure Shrinking diameters and: “Rich get richer” attachment process, to get heavy-tailed in-degrees “Copying” model, to lead to communities Community Guided Attachment, to produce Densification Power Law

“Forest Fire” model – Intuition (1) How do authors identify references? 1. Find first paper and cite it 2. Follow a few citations, make citations 3. Continue recursively 4. From time to time use bibliographic tools (e.g. CiteSeer) and chase back-links

“Forest Fire” model – Intuition (2) How do people make friends in a new environment? 1. Find first a person and make friends 2. Follow a friend of his/her friends 3. Continue recursively 4. From time to time get introduced to his friends Forest Fire model imitates exactly this process

“Forest Fire” – the Model A node arrives Randomly chooses an “ambassador” Starts burning nodes (with probability p) and adds links to burned nodes “Fire” spreads recursively

21 Forest Fire – the Model 2 parameters: p … forward burning probability r … backward burning ratio Nodes arrive one at a time New node v attaches to a random node – the ambassador Then v begins burning ambassador’s neighbors: Burn X links, where X is binomially distributed with mean p/(1-p) Choose in-links with probability r times less than out-links with mean rp/(1-rp) Fire spreads recursively Node v attaches to all nodes that got burned

22 Forest Fire in Action (1) Forest Fire generates graphs that Densify and have Shrinking Diameter densification diameter 1.21 N(t) E(t) N(t) diameter

Forest Fire in Action (2) Forest Fire also generates graphs with heavy-tailed degree distribution in-degreeout-degree count vs. in-degreecount vs. out-degree

24 Forest Fire – Phase plots Exploring the Forest Fire parameter space Sparse graph Dense graph Increasing diameter Shrinking diameter

Forest Fire model – Justification Densification Power Law: Similar to Community Guided Attachment The probability of linking decays exponentially with the distance Densification Power Law Power law out-degrees: From time to time we get large fires Power law in-degrees: The fire is more likely to burn hubs Communities: Newcomer copies neighbors’ links Shrinking diameter

26 Forest Fire – Extensions Orphans: isolated nodes that eventually get connected into the network Example: citation networks Orphans can be created in two ways: start the Forest Fire model with a group of nodes new node can create no links Diameter decreases even faster Multiple ambassadors: Example: following paper citations from different fields Faster decrease of diameter

wrap up networks evolve we can sometimes predict where new edges will form e.g. social networks tend to display triadic closure friends introduce friends to other friends network structure as a whole evolves densification: edges are added at a greater rate than nodes e.g. papers today have longer lists of references