Power Law and Its Generative Models Bo Young Kim 2010-03-16.

Slides:

Advertisements

Similar presentations

Paradigms of complex systems

Advertisements

Algorithmic and Economic Aspects of Networks Nicole Immorlica.

Hierarchical Dirichlet Processes

Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.

Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos.

Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.

Generative Models for the Web Graph José Rolim. Aim Reproduce emergent properties: –Distribution site size –Connectivity of the Web –Power law distriubutions.

Information Retrieval Lecture 8 Introduction to Information Retrieval (Manning et al. 2007) Chapter 19 For the MSc Computer Science Programme Dell Zhang.

On Power-Law Relationships of the Internet Topology Michalis Faloutsos Petros Faloutsos Christos Faloutsos.

Web Graph Characteristics Kira Radinsky All of the following slides are courtesy of Ronny Lempel (Yahoo!)

4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.

The influence of search engines on preferential attachment Dan Li CS3150 Spring 2006.

School of Information University of Michigan SI 614 Random graphs & power law networks preferential attachment Lecture 7 Instructor: Lada Adamic.

1 A Random-Surfer Web-Graph Model (Joint work with Avrim Blum & Hubert Chan) Mugizi Rwebangira.

Entropy Rates of a Stochastic Process

Alon Arad Alon Arad Hurst Exponent of Complex Networks.

Web as Graph – Empirical Studies The Structure and Dynamics of Networks.

CS Lecture 6 Generative Graph Models Part II.

On Power-Law Relationships of the Internet Topology CSCI 780, Fall 2005.

1 Pertemuan 06 Sebaran Normal dan Sampling Matakuliah: >K0614/ >FISIKA Tahun: >2006.

Advanced Topics in Data Mining Special focus: Social Networks.

Chapter 6 The Normal Distribution and Other Continuous Distributions

Analysis of Social Information Networks Thursday January 27 th, Lecture 3: Popularity-Power law 1.

1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.

1 A Random-Surfer Web-Graph Model Avrim Blum, Hubert Chan, Mugizi Rwebangira Carnegie Mellon University.

1 Dynamic Models for File Sizes and Double Pareto Distributions Michael Mitzenmacher Harvard University.

Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.

Jointly distributed Random variables

Chapter 4 Continuous Random Variables and Probability Distributions

References for M/G/1 Input Process

Unit 4: Mathematics Introduce the laws of Logarithms. Aims Objectives

Information Networks Power Laws and Network Models Lecture 3.

Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.

Chapter 7: The Normal Probability Distribution

Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.

Models and Algorithms for Complex Networks Power laws and generative processes.

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.

1 Spring 2003 Prof. Tim Warburton MA557/MA578/CS557 Lecture 5a.

5.4 Exponential Functions: Differentiation and Integration.

Applied Quantitative Analysis and Practices LECTURE#11 By Dr. Osman Sadiq Paracha.

Computational Biology, Part 15 Biochemical Kinetics I Robert F. Murphy Copyright  1996, 1999, 2000, All rights reserved.

TDTS21: Advanced Networking Lecture 7: Internet topology Based on slides from P. Gill and D. Choffnes Revised 2015 by N. Carlsson.

Random-Graph Theory The Erdos-Renyi model. G={P,E}, PNP 1,P 2,...,P N E In mathematical terms a network is represented by a graph. A graph is a pair of.

Chapter Four Random Variables and Their Probability Distributions

1 Special Continuous Probability Distributions -Exponential Distribution -Weibull Distribution Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering.

Markov Chains and Random Walks. Def: A stochastic process X={X(t),t ∈ T} is a collection of random variables. If T is a countable set, say T={0,1,2, …

Chapter 9 Efficiency of Algorithms. 9.1 Real Valued Functions.

Maurizio Naldi Università di Roma “Tor Vergata” POPULARITY DISTRIBUTIONS AND INTERNET TRAFFIC MODELLING Workshop “Statistica e Telecomunicazioni”, Roma.

The thermodynamical limit of abstract composition rules T. S. Bíró, KFKI RMKI Budapest, H Non-extensive thermodynamics Composition rules and formal log-s.

KPS 2007 (April 19, 2007) On spectral density of scale-free networks Doochul Kim (Department of Physics and Astronomy, Seoul National University) Collaborators:

Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.

Basic Business Statistics

RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.

Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 6-1 The Normal Distribution.

© 2002 Prentice-Hall, Inc.Chap 5-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 5 The Normal Distribution and Sampling Distributions.

Chapter 4 Continuous Random Variables and Probability Distributions  Probability Density Functions.2 - Cumulative Distribution Functions and E Expected.

Section 6 – Ec1818 Jeremy Barofsky March 10 th and 11 th, 2010.

1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Christian Schindelhauer Search Algorithms Winter Semester 2004/ Dec.

Chapter 6 The Normal Distribution and Other Continuous Distributions

Chapter 4 Continuous Random Variables and Probability Distributions

Normal Distribution and Parameter Estimation

Topics In Social Computing (67810)

CS224W: Social and Information Network Analysis

Generative Model To Construct Blog and Post Networks In Blogosphere

The likelihood of linking to a popular website is higher

Peer-to-Peer and Social Networks

The Normal Distribution

Random variable. Let (, , P) is probability space.

Presentation transcript:

Power Law and Its Generative Models Bo Young Kim

Contents 1.Recall The Definition of Power Law 2.Recall Some Properties of Power Law 3.Generative Models for Power Law - Power Laws via Preferential Attachment - Power Laws via Multiplicative Processes 2Applied Algorithm Lab.

1.Recall The Definition of Power Law 2.Recall Some Properties of Power Law 3.Generative Models for Power Law - Power Laws via Preferential Attachment - Power Laws via Multiplicative Processes 3Applied Algorithm Lab.

1. Recall The Definition of Power Law X: a nonnegative random variable Def Power Law X is said to have a power law distribution if Pr[X≥x]~cx -α for constants c>0, α>0 Def f(x)~g(x) ⇔ lim x f(x)/g(x) = 1 What does this mean? In a power law distribution, asymptotically the tails fall according to the power α. (heavier tail than exponential distribution) 4Applied Algorithm Lab.

1.Recall The Definition of Power Law 2.Recall Some Properties of Power Law 3.Generative Models for Power Law - Power Laws via Preferential Attachment - Power Laws via Multiplicative Processes 5Applied Algorithm Lab.

2. Recall Some Properties of Power Law E.g. The Pareto distribution Pr[X≥x]=(x/k) -α ln(Pr[X≥x])=-α(ln(x)-ln(k)) * Linear Log-log plot (complementary cumulative distribution function) - X has a power law distribution - Then a log-log plot behavior is a straight line. (asymptotic sense) 6Applied Algorithm Lab.

2. Recall Some Properties of Power Law “Scale Invariance” - Let f(x) := P[X≥x] - f(x) ~ cx -α - f(kx) ~ c(kx) -α = k -α (cx -α ) = k’f(x) ∝ f(x) (k’=k -α ) - Scaling by a constant simply multiplies the original power law relation by the constant k’. - If we change the measurement unit(=scale), it retains the same power law form w/ the same exponent.  We cannot decide what scale we’re observing. (like Fractals) 7Applied Algorithm Lab.

2. Recall Some Properties of Power Law Web follows power law. [4] Recall (Rank exponent) - d v : outdegree of a node v - r v : the rank of a node v d v =k*r v R (R,k: constant) Designing random graph models that yield Web-like graphs? i.e. that yields power law distributions for the indegree and outdegree? 8Applied Algorithm Lab.

1.Recall The Definition of Power Law 2.Recall Some Properties of Power Law 3.Generative Models for Power Law - Power Laws via Preferential Attachment - Power Laws via Multiplicative Processes 9Applied Algorithm Lab.

Generative Models for Power Law - Power Laws via Preferential Attachment Def Preferential Attachment Process (=Yule Process) Any process s.t. some quantity (some form of wealth) is distributed among a number of individuals according to how much they already have, so that those who are already wealthy receive more than those who are not. ”The rich get richer” 10Applied Algorithm Lab.

The Chinese Restaurant Process - A Chinese restaurant has infinitely many tables - Each table can seat infinitely many customers - At each time step, customer X t comes into the restaurant. When X t+1 comes into here… (CRP1) Sits at an already occupied table k w/ prob. N k /(t+α) (N k : # of customers at table k  Σ k N k =t) (CRP2)or, sits at the next unoccupied table w/ prob. α/(t+α) Generative Models for Power Law - Power Laws via Preferential Attachment 11Applied Algorithm Lab.

When X t+1 comes into here… (CRP1) Sits at an already occupied table k w/ prob. N k /(t+α) (N k : # of customers at table k  Σ k N k =t) (CRP2)or, sits at the next unoccupied table w/ prob. α/(t+α) Generative Models for Power Law - Power Laws via Preferential Attachment 12Applied Algorithm Lab.

CPR rule: Next customer sits at a table w/ prob. Proportional to # of customers already sitting at it(and sits at new table w/ prob. Proportional to α)  Customers tend to sit at most popular tables  Most popular tables attract the most new customers, and become even more popular The concentration parameter α: how likely customer is to sit at a fresh table Generative Models for Power Law - Power Laws via Preferential Attachment 13Applied Algorithm Lab.

Generating Power law distribution via Preference Attachment (Most models are variations of this form) Let’s say “Web Page Process” Start w/ a single page This single page has a link to itself At each time step, a new page appears, w/ outdegree 1 Generative Models for Power Law - Power Laws via Preferential Attachment (WPP1) The link of new page points to a page chosen u.a.r. w/ prob. α<1 (WPP2) The link of new page points to page chosen proportionally to the indegree of the page w/ prob. 1- α 14Applied Algorithm Lab.

X j (t): # of pages w/ indegree j when ∃ t pages in the system Pr[X j increase] = αX j-1 /t+(1-α)(j-1)X j-1 /t Pr[X j decrease] = αX j /t+(1-α)jX j /t Generative Models for Power Law - Power Laws via Preferential Attachment (WPP1) The link of new page points to a page chosen u.a.r. w/ prob. α<1 (WPP2) The link of new page points to page chosen proportionally to the indegree of the page w/ prob. 1- α 15Applied Algorithm Lab.

Pr[X j increase] = αX j-1 /t+(1-α)(j-1)X j-1 /t Pr[X j decrease] = αX j /t+(1-α)jX j /t  dX j /dt = {α(X j-1 -X j )+(1-α)((j-1)X j-1 -jX j-1 )}/t Intuitively appealing, BUT how continuous DE describes a discrete process?  This can be justified formally using martingales [Kumar et al 00] & theoretical frameworks of Kurtz, Wormald [Drinea et al. 00, Kurtz 81, Wormald 95]. Generative Models for Power Law - Power Laws via Preferential Attachment 16Applied Algorithm Lab.

dX 0 /dt=1-αX 0 /t Suppose in the steady state limit: X j (t)=c j *t (portion c j )  c 0 =dX 0 /dt=1-αX 0 /t=1-αc 0 ⇔ c 0 = 1/(α+1) Substitute this assumption for dX j /dt = {α(X j-1 -X j )+(1-α)((j-1)X j-1 -jX j-1 )}/t  c j (1+α+j(1-α))=c j-1 (α+(j-1)(1-α))  We can determine c j exactly. Focusing on the asymptotic, for large j c j /c j-1 =1-(2-α)/(1+α+j(1-α))~1-{(2-α)/(1-α)}*(1/j) Generative Models for Power Law - Power Laws via Preferential Attachment 17Applied Algorithm Lab.

We have c j ~cj^(- ) for some constant c, giving a power law. Note c j ~cj^(- ) implies WTS: Σ j≥k c j behave the tail of power law distribution (Proof) For some constant c’. So, we’re done. Generative Models for Power Law - Power Laws via Preferential Attachment 18Applied Algorithm Lab.

1.Recall The Definition of Power Law 2.Recall Some Properties of Power Law 3.Generative Models for Power Law - Power Laws via Preferential Attachment - Power Laws via Multiplicative Processes 19Applied Algorithm Lab.

Pareto: income distribution obeys power law [Champernowne 53] offered an explanation for this behavior. Partition income in the following manner: 1 st range: between m and γm for some γ>1 2 nd range: between γm and γ 2 m … persons in class j: their income is between γ j-1 m and γ j m P ij : prob. of a person moving from class i to class j At each time step, P ij depends only on the value (j-i).  Under this assumption, Pareto distribution can be obtained. Generative Models for Power Law - Power Laws via Multiplicative Processes 20Applied Algorithm Lab.

E.g. γ=2, P ij =2/3 if j-i=-1 P ij =1/3 if j-i=1 Special case: i=1  P 11 =2/3 The equilibrium property of being in class k: 1/2 k X: a person’s income  Pr[X≥2 k-1 m]=1/2 k-1 Pr[X ≥ x]=m/x for x= 2 k-1 m This is a power law distribution. Generative Models for Power Law - Power Laws via Multiplicative Processes 21Applied Algorithm Lab.

References [1]M. Mitzenmacher, A Brief History of Generative Models for Power Law and Lognormal Distributions, Internet Mathematics, vol 1, No. 2, pp , [2]Mark Johnson, Chinese Restaurant Processes(CG168 notes), cog.brown.edu/~mj/classes/cg168/.../ChineseRestaurants.pdf [3]The lecture notes of C. Faloutsos, Carnegie Mellon University, Multimedia Databases and Data Mining, Spring pdf/195_powerLaws.pdfhttp:// pdf/195_powerLaws.pdf [4]Bruno Bassetti, Mina Zarei, Marco Cosentino Lagomarsino, and Ginestra Bianconi., Statistical mechanics of the “Chinese restaurant process”: Lack of self-averaging, anomalous finite- size effects, and condensation, Phys. Rev. E 80, (2009) [4 pages] [5] Applied Algorithm Lab.