Lecture 17: Search in structured networks CS 765 Complex Networks Slides are modified from Lada Adamic.

Slides:



Advertisements
Similar presentations
Routing in Poisson small-world networks A. J. Ganesh Microsoft Research, Cambridge Joint work with Moez Draief.
Advertisements

The Small World Phenomenon: An Algorithmic Perspective Speaker: Bradford Greening, Jr. Rutgers University – Camden.
1 Analyzing Kleinberg’s Small-world Model Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
Small-world networks.
Online Social Networks and Media Navigation in a small world.
What did we see in the last lecture?. What are we going to talk about today? Generative models for graphs with power-law degree distribution Generative.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.
Lecture 7 CS 728 Searchable Networks. Errata: Differences between Copying and Preferential Attachment In generative model: let p k be fraction of nodes.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
School of Information University of Michigan SI 614 Search in structured networks Lecture 15.
Company LOGO 1 Identity and Search in Social Networks D.J.Watts, P.S. Dodds, M.E.J. Newman Maryam Fazel-Zarandi.
Search in Power-Law Networks Presented by Hakim Weatherspoon CS294-4: Peer-to-Peer Systems Slides also borrowed from the following paper Path Finding Strategies.
CS728 Lecture 5 Generative Graph Models and the Web.
Small-World Graphs for High Performance Networking Reem Alshahrani Kent State University.
Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.
1 Analyzing Kleinberg’s (and other) Small-world Models Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.
Advanced Topics in Data Mining Special focus: Social Networks.
Analysis of Social Information Networks Thursday January 27 th, Lecture 2: Algorithmic Small World 1.
The Small World Phenomenon: An Algorithmic Perspective by Anton Karatoun.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 1  How many samples do we need to converge?  How many Random Walk steps to get.
School of Information University of Michigan SI 614 Small Worlds Lecture 5 Instructor: Lada Adamic.
Lecture 18: Small World Networks CS 790g: Complex Networks
Information Networks Power Laws and Network Models Lecture 3.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Geogra phic routing in social networks David Liben-Nowell, Jasmine Novak, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins Presentation prepared by Dor Medalsy.
The Science of Networks 6.1 Overview Social Goal. Explain why information and disease spread so quickly in social networks. Mathematical Approach. Model.
Proof of Kleinberg’s small-world theorems
Lada Adamic, HP Labs, Palo Alto, CA. Talk outline Information flow through blogs Information flow through Search through networks Search within.
School of Information University of Michigan Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution.
Small World Problem Christopher McCarty. Small World Phenomenon You meet someone, seemingly randomly, who has a connection to someone you know – Person.
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
School of Information University of Michigan Search in networks Lada Adamic (U. Michigan) NetSci Workshop May 16 th, 2006.
School of Information University of Michigan Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution.
Gennaro Cordasco - How Much Independent Should Individual Contacts be to Form a Small-World? - 19/12/2006 How Much Independent Should Individual Contacts.
Online Social Networks and Media
School of Information University of Michigan SI 614 Livejournal Lecture 23.
Navigation in small worlds Social Networks: Models and Applications Seminar Toronto, Fall 2007 (based on a presentation by Stratis Ioannidis)
It's a small world but I wouldn't want to paint it - Stephen Wright.
Complex Network Theory – An Introduction Niloy Ganguly.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Complex Network Theory – An Introduction Niloy Ganguly.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
What Is A Network? (and why do we care?). An Introduction to Network Theory | Kyle Findlay | SAMRA 2010 | 2 “A collection of objects (nodes) connected.
Clusters Recognition from Large Small World Graph Igor Kanovsky, Lilach Prego Emek Yezreel College, Israel University of Haifa, Israel.
How Do “Real” Networks Look?
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
Siddhartha Gunda Sorabh Hamirwasia.  Generating small world network model.  Optimal network property for decentralized search.  Variation in epidemic.
Models and Algorithms for Complex Networks
Models of Web-Like Graphs: Integrated Approach
Navigation in Networks, Revisited Networked Life MKSE 112 Fall 2012 Prof. Michael Kearns.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
Small-world phenomenon: An Algorithmic Perspective Jon Kleinberg.
Search in structured networks
Identity and Search in Social Networks
Peer-to-Peer and Social Networks
Small-World Navigability
How Do “Real” Networks Look?
How Do “Real” Networks Look?
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Lecture 9: Network models CS 765: Complex Networks
Navigation and Propagation in Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Lecture 17: Search in structured networks CS 765 Complex Networks Slides are modified from Lada Adamic

Mary Bob Jane Who could introduce me to Richard Gere? How do we search? 2

number of nodes found power-law graph 3

93 number of nodes found Poisson graph 4

How would you search for a node here? 5

What about here? 6

gnutella network fragment 7

step cumulative nodes found at step high degree seeking 1st neighbors high degree seeking 2nd neighbors 50% of the files in a 700 node network can be found in < 8 steps Gnutella network 8

And here? 9

here? 10

How to choose among hundreds of acquaintances? Strategy: Simple greedy algorithm - each participant chooses correspondent who is closest to target with respect to the given property Models geography Kleinberg (2000) hierarchical groups Watts, Dodds, Newman (2001), Kleinberg(2001) high degree nodes Adamic, Puniyani, Lukose, Huberman (2001), Newman(2003) How are people are able to find short paths? 11

How many hops actually separate any two individuals in the world? People use only local information not perfect in routing messages “The accuracy of small world chains in social networks” Peter D. Killworth, Chris McCarty, H. Russell Bernard & Mark House: Analyzed 10,920 shortest path connections between 105 members of an interviewing bureau, together with the equivalent conceptual, or ‘small world’ routes, which use individuals’ selections of intermediaries. study the impact of accuracy within small world chains. The mean small world path length (3.23) is 40% longer than the mean of the actual shortest paths (2.30) Model suggests that people make a less than optimal small world choice more than half the time. 12

nodes are placed on a lattice and connect to nearest neighbors additional links placed with p uv ~ review: Spatial search “The geographic movement of the [message] from Nebraska to Massachusetts is striking. There is a progressive closing in on the target area as each new person is added to the chain” S.Milgram ‘The small world problem’, Psychology Today 1,61,1967 Kleinberg, ‘The Small World Phenomenon, An Algorithmic Perspective’ Proc. 32nd ACM Symposium on Theory of Computing. (Nature 2000) 13

demo how does the probability of long-range links affect search? 14

When r=0, links are randomly distributed, ASP ~ log(n), n size of grid When r=0, any decentralized algorithm is at least a 0 n 2/3 no locality When r<2, expected time at least  r n (2-r)/3 15

Overly localized links on a lattice When r>2 expected search time ~ N (r-2)/(r-1) 16

Links balanced between long and short range When r=2, expected time of a DA is at most C (log N) 2 17

Use a well defined network: HP Labs correspondence over 3.5 months Edges are between individuals who sent at least 6 messages each way 450 users median degree = 10, mean degree = 13 average shortest path = 3 Node properties specified: degree geographical location position in organizational hierarchy Testing search models on social networks advantage: have access to entire communication network and to individual’s attributes 18

Power-law degree distribution of all senders of passing through HP labs Strategy 1: High degree search number of recipients sender has sent to proportion of senders 19

Filtered network (at least 6 messages sent each way) Degree distribution no longer power-law, but Poisson It would take 40 steps on average (median of 16) to reach a target! 20

Strategy 2: Geography 21

1U 2L3L 3U 2U 4U 1L 87 % of the 4,000 links are between individuals on the same floor Communication across corporate geography 22

Cubicle distance vs. probability of being linked optimum for search source: Adamic and Adar, How to search a social network, Social Networks,How to search a social network 23

Livejournal LiveJournal provides an API to crawl the friendship network + profiles friendly to researchers great research opportunity basic statistics Users (stats from April 2006) How many users, and how many of those are active? Total accounts: 9,980, active in some way: 1,979, that have ever updated: 6,755, update in last 30 days: 1,300, update in last 7 days: 751, update in past 24 hours: 216,581 24

Predominantly female & young demographic Male: 1,370,813 (32.4%) Female: 2,856,360 (67.6%) Unspecified: 1,575, Age distribution 25

Geographic Routing in Social Networks David Liben-Nowell, Jasmine Novak, Ravi Kumar, Prabhakar Raghavan, and Andrew Tomkins (PNAS’05) data used Feb ,000 LiveJournal users with US locations giant component (77.6%) of the network clustering coefficient:

Degree distributions The broad degree distributions we’ve learned to know and love but more probably lognormal than power law broader in degree than outdegree distribution Source: 27

Results of a simple greedy geographical algorithm Choose source s and target t randomly Try to reach target’s city – not target itself At each step, the message is forwarded from the current message holder u to the friend v of u geographically closest to t stop if d(v,t) > d(u,t) 13% of the chains are completed if d(v,t) > d(u,t) pick a neighbor at random in the same city if possible, else stop 80% of the chains are completed 28

the geographic basis of friendship  = d(u,v) the distance between pairs of people The probability that two people are friends given their distance is equal to P(  ) =  + f(  ),  is a constant independent of geography  is 5.0 x for LiveJournal users who are very far apart 29

the geographic basis of friendship The average user will have ~ 2.5 non-geographic friends The other friends (5.5 on average) are distributed according to an approximate 1/distance relationship But 1/d was proved not to be navigable by Kleinberg, so what gives? 30

Navigability in networks of variable geographical density Kleinberg assumed a uniformly populated 2D lattice But population is far from uniform population networks and rank-based friendship probability of knowing a person depends not on absolute distance but on relative distance i.e. how many people live closer Pr[u ->v] ~ 1/rank u (v) 31

what if we don’t have geography? 32

does community structure help? 33

Kleinberg, ‘Small-World Phenomena and the Dynamics of Information’ Individuals classified into a hierarchy, h ij = height of the least common ancestor. Group structure models: Individuals belong to nested groups q = size of smallest group that v,w belong to f(q) ~ q -  Theorem: If  = 1 and outdegree is polylogarithmic, can s ~ O(log n) h b=3 e.g. state-county-city-neighborhood industry-corporation-division-group review: hierarchical small world models 34

Why search is fast in hierarchical topologies T S R R’ 35

individuals belong to hierarchically nested groups multiple independent hierarchies h=1,2,..,H coexist corresponding to occupation, geography, hobbies, religion… p ij ~ exp(-  x) Source: Identity and Search in Social Networks: Duncan J. Watts, Peter Sheridan Dodds, and M. E. J. Newman; hierarchical models with multiple hierarchies 36

Source: Identity and Search in Social Networks: Duncan J. Watts, Peter Sheridan Dodds, and M. E. J. Newman; 37

Identity and search in social networks Watts, Dodds, Newman (2001) Message chains fail at each node with probability p Network is ‘searchable’ if a fraction r of messages reach the target N= N= N= Source: Identity and Search in Social Networks: Duncan J. Watts, Peter Sheridan Dodds, and M. E. J. Newman; 38

Small World Model, Watts et al. Fits Milgram’s data well Model parameters: N = 10 8 z = 300 g = 100 b = 10  = 1, H = 2 L model = 6.7 L data = more slides on this: 39

does it work in practice? back to HP Labs: Organizational hierarchy Strategy 3: Organizational Hierarchy 40

correspondence superimposed on the organizational hierarchy 41

Example of search path distance 1 distance 2 hierarchical distance = 5 search path distance = 4 distance 1 42

Probability of linking vs. distance in hierarchy in the ‘searchable’ regime: 0 <  < 2 (Watts, Dodds, Newman 2001) 43

Results distancehierarchygeographygeodesicorgrandom median mean5.7 (4.7) hierarchy geography source: Adamic and Adar, How to search a social network, Social Networks, 27(3), p , 2005.How to search a social network 44

Individuals associate on different levels into groups. Group structure facilitates decentralized search using social ties. Hierarchy search faster than geographical search A fraction of ‘important’ individuals are easily findable Humans may be more resourceful in executing search tasks: making use of weak ties using more sophisticated strategies conclusions 45