Lecture 19: Information diffusion in networks CS 765 Complex Networks Slides are modified from Lada Adamic, David Kempe, Bill Hackbor.

Slides:



Advertisements
Similar presentations
Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
Advertisements

The Strength of Weak Ties
Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale Network Theory: Computational Phenomena and Processes Social Network.
COMP 621U Week 3 Social Influence and Information Diffusion
Spread of Influence through a Social Network Adapted from :
Maximizing the Spread of Influence through a Social Network
Stelios Lelis UAegean, FME: Special Lecture Social Media & Social Networks (SM&SN)
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Strong and Weak Ties Chapter 3, from D. Easley and J. Kleinberg book.
Modeling Malware Spreading Dynamics Michele Garetto (Politecnico di Torino – Italy) Weibo Gong (University of Massachusetts – Amherst – MA) Don Towsley.
Based on chapter 3 in Networks, Crowds and markets (by Easley and Kleinberg) Roy Mitz Supervised by: Prof. Ronitt Rubinfeld November 2014 Strong and weak.
Maximizing the Spread of Influence through a Social Network
Nodes, Ties and Influence
Maximizing the Spread of Influence through a Social Network By David Kempe, Jon Kleinberg, Eva Tardos Report by Joe Abrams.
Population dynamics of infectious diseases Arjan Stegeman.
Epidemics in Social Networks
1 Epidemic Spreading in Real Networks: an Eigenvalue Viewpoint Yang Wang Deepayan Chakrabarti Chenxi Wang Christos Faloutsos.
Structure of Information Pathways in a Social Communication Network Gueorgi KossinetsJon Kleinberg Duncan Watts.
Influence and Correlation in Social Networks Aris Anagnostopoulos Ravi Kumar Mohammad Mahdian.
Social Networks: Advertising, Pricing and All That Zvi Topol & Itai Yarom.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
Using Social Networks to Analyze Sexual Relations by Amanda Dargie.
Multicast Communication Multicast is the delivery of a message to a group of receivers simultaneously in a single transmission from the source – The source.
INNOVATION SPREADING: A PROCESS ON MULTIPLE SCALES János Kertész Central European University Center for Network Science Lorentz Centre, Leiden, 2013.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Maximizing Product Adoption in Social Networks
Models of Influence in Online Social Networks
Using Friendship Ties and Family Circles for Link Prediction Elena Zheleva, Lise Getoor, Jennifer Golbeck, Ugur Kuter (SNAKDD 2008)
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Online Social Networks and Media Epidemics and Influence.
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
Influence and Correlation in Social Networks Mohammad Mahdian Yahoo! Research Joint work with Aris Anagnostopoulos and Ravi Kumar to appear in KDD’08.
Information diffusion
CS 599: Social Media Analysis University of Southern California1 Social Ties and Information Diffusion Kristina Lerman University of Southern California.
© 2004 M. Van Alstyne, All rights reserved. Information, & Output MIT Center for E-Business Marshall Van Alstyne with N. Bulkley, E. Brynjolfsson,
Information diffusion
V5 Epidemics on networks
Today’s topics Strength of Weak Ties Next Topic Acknowledgements
Maximizing the Spread of Influence through a Social Network David Kempe, Jon Kleinberg, Eva Tardos Cornell University KDD 2003.
Professor Yashar Ganjali Department of Computer Science University of Toronto
Maximizing the Spread of Influence through a Social Network Authors: David Kempe, Jon Kleinberg, É va Tardos KDD 2003.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
I NFORMATION C ASCADE Priyanka Garg. OUTLINE Information Propagation Virus Propagation Model How to model infection? Inferring Latent Social Networks.
+ Big Data, Network Analysis Week How is date being used Predict Presidential Election - Nate Silver –
On Bharathi-Kempe-Salek Conjecture about Influence Maximization Ding-Zhu Du University of Texas at Dallas.
CS 590 Term Project Epidemic model on Facebook
1 Finding Spread Blockers in Dynamic Networks (SNAKDD08)Habiba, Yintao Yu, Tanya Y., Berger-Wolf, Jared Saia Speaker: Hsu, Yu-wen Advisor: Dr. Koh, Jia-Ling.
The Social Network within a Management Recruiting Firm: Network Structure and Output Neil Gandal (Tel Aviv University and CEPR), Charles King (Greylock.
Steffen Staab 1WeST Web Science & Technologies University of Koblenz ▪ Landau, Germany Network Theory and Dynamic Systems Cascading.
LECTURE 14 The Diffusion of Innovations II 1. Cumulative and Individual Adoption Patterns 2.
Topics In Social Computing (67810) Module 2 (Dynamics) Cascades, Memes, and Epidemics (Networks Crowds & Markets Ch. 21)
Effects of Social Network Structure on Diffusion and Adoption of Agricultural Technology: Evidence from Rural Ethiopia Yasuyuki Todo, Petr Matous (The.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
Contagion in Networks Networked Life NETS 112 Fall 2015 Prof. Michael Kearns.
Wenyu Zhang From Social Network Group
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Independent Cascade Model and Linear Threshold Model
Information diffusion
New models for power law:
What Stops Social Epidemics?
Information diffusion
Information diffusion in networks
Networked Life NETS 112 Fall 2018 Prof. Michael Kearns
Network Science: A Short Introduction i3 Workshop
Independent Cascade Model and Linear Threshold Model
Networked Life NETS 112 Fall 2017 Prof. Michael Kearns
Networked Life NETS 112 Fall 2016 Prof. Michael Kearns
Diffusion in Networks Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale 1/17/2019.
Independent Cascade Model and Linear Threshold Model
“The Spread of Physical Activity Through Social Networks”
Networked Life NETS 112 Fall 2019 Prof. Michael Kearns
Presentation transcript:

Lecture 19: Information diffusion in networks CS 765 Complex Networks Slides are modified from Lada Adamic, David Kempe, Bill Hackbor

outline factors influencing information diffusion network structure: which nodes are connected? strength of ties: how strong are the connections? studies in information diffusion: Granovetter: the strength of weak ties J-P Onnela et al: strength of intermediate ties Kossinets et al: strength of backbone ties Davis: board interlocks and adoption of practices network position and access to information Burt: Structural holes and good ideas Aral and van Alstyne: networks and information advantage networks and innovation Lazer and Friedman: innovation

factors influencing diffusion network structure (unweighted) density degree distribution clustering connected components community structure strength of ties (weighted) frequency of communication strength of influence spreading agent attractiveness and specificity of information

Strong tie defined A strong tie frequent contact affinity many mutual contacts Less likely to be a bridge (or a local bridge) “forbidden triad”: strong ties are likely to “close” Source: Granovetter, M. (1973). "The Strength of Weak Ties",

school kids and 1 st through 8 th choices of friends snowball sampling: will you reach more different kids by asking each kid to name their 2 best friends, or their 7 th & 8 th closest friend? Source: M. van Alstyne, S. Aral. Networks, Information & Social Capital

outline factors influencing information diffusion network structure: which nodes are connected? strength of ties: how strong are the connections? studies in information diffusion: Granovetter: the strength of weak ties J-P Onnela et al: strength of intermediate ties Kossinets et al: strength of backbone ties Davis: board interlocks and adoption of practices network position and access to information Burt: Structural holes and good ideas Aral and van Alstyne: networks and information advantage networks and innovation Lazer and Friedman: innovation

how does strength of a tie influence diffusion? M. S. Granovetter: The Strength of Weak Ties, AJS, 1973: finding a job through a contact that one saw frequently (2+ times/week) 16.7% occasionally (more than once a year but < 2x week) 55.6% rarely 27.8% but… length of path is short contact directly works for/is the employer or is connected directly to employer

strength of tie: frequency of communication Kossinets, Watts, Kleinberg, KDD 2008: which paths yield the most up to date info? how many of the edges form the “backbone”? source: Kossinets et al. “The structure of information pathways in a social communication network”

the strength of intermediate ties strong ties frequent communication, but ties are redundant due to high clustering weak ties reach far across network, but communication is infrequent… “Structure and tie strengths in mobile communication networks” use nation-wide cellphone call records and simulate diffusion using actual call timing

source: Onnela J. et.al. Structure and tie strengths in mobile communication networks Localized strong ties slow infection spread.

how can information diffusion be different from simple contagion (e.g. a virus)? simple contagion: infected individual infects neighbors with information at some rate threshold contagion: individuals must hear information (or observe behavior) from a number or fraction of friends before adopting in lab: complex contagion (Centola & Macy, AJS, 2007) how do you pick individuals to “infect” such that your opinion prevails

Framework The network of computers consists of nodes (computers) and edges (links between nodes) Each node is in one of two states Susceptible (in other words, healthy) Infected Susceptible-Infected-Susceptible (SIS) model Cured nodes immediately become susceptible SusceptibleInfected Infected by neighbor Cured internally

Framework (Continued) Homogeneous birth rate β on all edges between infected and susceptible nodes Homogeneous death rate δ for infected nodes Infected Healthy XN1 N3 N2 Prob. β Prob. δ

SIR and SIS Models An SIR model consists of three group Susceptible: Those who may contract the disease Infected: Those infected Recovered: Those with natural immunity or those that have died. An SIS model consists of two group Susceptible: Those who may contract the disease Infected: Those infected

Important Parameters α is the transmission coefficient, which determines the rate at which the disease travels from one population to another. γ is the recovery rate: (I persons)/(days required to recover) R 0 is the basic reproduction number. (Number of new cases arising from one infective) x (Average duration of infection) If R 0 > 1 then ∆I > 0 and an epidemic occurs

SIR and SIS Models SIR Model: SIS Model:

diffusion of innovation surveys: farmers adopting new varieties of hybrid corn by observing what their neighbors were planting (Ryan and Gross, 1943) doctors prescribing new medication (Coleman et al. 1957) spread of obesity & happiness in social networks (Christakis and Fowler, 2008) online behavioral data: Spread of Flickr photos & Digg stories (Lerman, 2007) joining LiveJournal groups & CS conferences (Backstrom et al. 2006) + others e.g. Anagnostopoulos et al. 2008

18 Open question: how do we tell influence from correlation? approaches: time resolved data: if adoption time is shuffled, does it yield the same patterns? if edges are directed: does reversing the edge direction yield less predictive power?

poison pills diffused through interlocks geography had little to do with it more likely to be influenced by tie to firm doing something similar & having similar centrality golden parachutes did not diffuse through interlocks geography was a significant factor more likely to follow “central” firms why did one diffuse through the “network” while the other did not? Example: adopting new practices Source: Corporate Elite Networks and Governance Changes in the 1980s.

Social Network and Spread of Influence Social network plays a fundamental role as a medium for the spread of INFLUENCE among its members Opinions, ideas, information, innovation… Direct Marketing takes the “word-of-mouth” effects to significantly increase profits Gmail, Tupperware popularization, Microsoft Origami …

Problem Setting Given a limited budget B for initial advertising e.g. give away free samples of product estimates for influence between individuals Goal trigger a large cascade of influence e.g. further adoptions of a product Question Which set of individuals should B target at? Application besides product marketing spread an innovation detect stories in blogs

Models of Influence First mathematical models [Schelling '70/'78, Granovetter '78] Large body of subsequent work: [Rogers '95, Valente '95, Wasserman/Faust '94] Two basic classes of diffusion models: threshold and cascade General operational view: A social network is represented as a directed graph, with each person (customer) as a node Nodes start either active or inactive An active node may trigger activation of neighboring nodes Monotonicity assumption: active nodes never deactivate

Threshold dynamics The network: a ij is the adjacency matrix ( N ×N ) un-weighted undirected The nodes: are labelled i, i from 1 to N; have a state ; and a threshold r i from some distribution.

The fraction of nodes in state v i =1 is  (t) : Threshold dynamics Updating: Neighbourhood average: Node i has state and threshold

Linear Threshold Model A node v has random threshold θ v ~ U[0,1] A node v is influenced by each neighbor w according to a weight b vw such that A node v becomes active when at least (weighted) θ v fraction of its neighbors are active

Example Inactive Node Active Node Threshold Active neighbors v w Stop! U X

Independent Cascade Model When node v becomes active, it has a single chance of activating each currently inactive neighbor w. The activation attempt succeeds with probability p vw.

Example v w Inactive Node Active Node Newly active node Successful attempt Unsuccessful attempt Stop! U X

outline factors influencing information diffusion network structure: which nodes are connected? strength of ties: how strong are the connections? studies in information diffusion: Granovetter: the strength of weak ties J-P Onnela et al: strength of intermediate ties Kossinets et al: strength of backbone ties Davis: board interlocks and adoption of practices network position and access to information Burt: Structural holes and good ideas Aral and van Alstyne: networks and information advantage networks and innovation Lazer and Friedman: innovation

Burt: structural holes and good ideas Managers asked to come up with an idea to improve the supply chain Then asked: whom did you discuss the idea with? whom do you discuss supply-chain issues with in general do those contacts discuss ideas with one another? 673 managers (455 (68%) completed the survey) ~ 4000 relationships (edges)

results people whose networks bridge structural holes have higher compensation positive performance evaluations more promotions more good ideas these brokers are more likely to express ideas less likely to have their ideas dismissed by judges more likely to have their ideas evaluated as valuable

networks & information advantage BetweennessConstrained vs. Unconstrained Source: M. van Alstyne, S. Aral. Networks, Information & Social Capital slides: Marshall van Alstyne

Aral & Alstyne: Study of a head hunter firm Three firms initially Unusually measurable inputs and outputs 1300 projects over 5 yrs and 125,000 messages over 10 months (avg 20% of time!) Metrics (i) Revenues per person and per project, (ii) number of completed projects, (iii) duration of projects, (iv) number of simultaneous projects, (v) compensation per person Main firm 71 people in executive search (+2 firms partial data) 27 Partners, 29 Consultants, 13 Research, 2 IT staff Four Data Sets per firm 52 Question Survey (86% response rate) Accounting 15 Semi-structured interviews

structure matters Coefficients a (Base Model) Best structural pred. Ave. Size Colleagues’ Ave. Response Time BStd. Error Unstandardized Coefficients Adj. R 2 Sig. F  Dependent Variable: Bookings02 a. Coefficients a BStd. Error Unstandardized Coefficients Adj. R 2 Sig. F  Dependent Variable: Billings02 a. New Contract RevenueContract Execution Revenue *** ** ** * ** Base Model: YRS_EXP, PARTDUM, %_CEO_SRCH, SECTOR(dummies), %_SOLO. b. N=39. *** p<.01, ** p<.05, * p<.1 b. Sending shorter helps get contracts and finish them. Faster response from colleagues helps finish them.

diverse networks drive performance by providing access to novel information network structure (having high degree) correlates with receiving novel information sooner (as deduced from hashed versions of their ) getting information sooner correlates with $$ brought in controlling for # of years worked job level ….

Network Structure Matters Coefficients a (Base Model) Size Struct. Holes Betweenness BStd. Error Unstandardized Coefficients Adj. R 2 Sig. F  Dependent Variable: Bookings02 a. Coefficients a BStd. Error Unstandardized Coefficients Adj. R 2 Sig. F  Dependent Variable: Billings02 a. New Contract RevenueContract Execution Revenue *** * * ** Base Model: YRS_EXP, PARTDUM, %_CEO_SRCH, SECTOR(dummies), %_SOLO. b. N=39. *** p<.01, ** p<.05, * p<.1 b. Bridging diverse communities is significant. Being in the thick of information flows is significant.

outline factors influencing information diffusion network structure: which nodes are connected? strength of ties: how strong are the connections? studies in information diffusion: Granovetter: the strength of weak ties J-P Onnela et al: strength of intermediate ties Kossinets et al: strength of backbone ties Davis: board interlocks and adoption of practices network position and access to information Burt: Structural holes and good ideas Aral and van Alstyne: networks and information advantage networks and innovation Lazer and Friedman: innovation

networks and innovation: is more information diffusion always better? Nodes can innovate on their own (slowly) or adopt their neighbor’s solution Best solutions propagate through the network source: Lazer and Friedman, The Parable of the Hare and the Tortoise: Small Worlds, Diversity, and System Performance linear network fully connected network

networks and innovation fully connected network converges more quickly on a solution, but if there are lots of local maxima in the solution space, it may get stuck without finding optimum. linear network (fewer edges) arrives at better solution eventually because individuals innovate longer

lab: networks and coordination Kearns et al. “An Experimental Study of the Coloring Problem on Human Subject Networks” network structure affects convergence in coordination games, e.g. graph coloring

to sum up network structure influences information diffusion strength of tie matters diffusion can be simple (person to person) or complex (individuals having thresholds) people in special network positions (the brokers) have an advantage in receiving novel info & coming up with “novel” ideas in some scenarios, information diffusion may hinder innovation