Modeling Blog Dynamics Speaker: Michaela Götz Joint work with: Jure Leskovec, Mary McGlohon, Christos Faloutsos Cornell University Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
Advertisements

1 Dynamics of Real-world Networks Jure Leskovec Machine Learning Department Carnegie Mellon University
Jurij Leskovec, CMU Jon Kleinberg, Cornell Christos Faloutsos, CMU
1 Realistic Graph Generation and Evolution Using Kronecker Multiplication Jurij Leskovec, CMU Deepay Chakrabarti, CMU/Yahoo Jon Kleinberg, Cornell Christos.
Albert-László Barabási
Spread of Influence through a Social Network Adapted from :
Cost-effective Outbreak Detection in Networks Jure Leskovec, Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen, Natalie Glance.
Power Laws By Cameron Megaw 3/11/2013. What is a Power Law?
Analysis and Modeling of Social Networks Foudalis Ilias.
Jure Leskovec, CMU Lars Backstrom, Cornell Ravi Kumar, Yahoo! Research Andrew Tomkins, Yahoo! Research.
RESILIENCE NOTIONS FOR SCALE-FREE NETWORKS GUNES ERCAL JOHN MATTA 1.
Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos.
Patterns of Influence in a Recommendation Network Jure Leskovec, CMU Ajit Singh, CMU Jon Kleinberg, Cornell School of Computer Science Carnegie Mellon.
Efficient Distribution Mining and Classification Yasushi Sakurai (NTT Communication Science Labs), Rosalynn Chong (University of British Columbia), Lei.
SLAW: A Mobility Model for Human Walks Lee et al..
1 A Random-Surfer Web-Graph Model (Joint work with Avrim Blum & Hubert Chan) Mugizi Rwebangira.
Masters Thesis Defense Amit Karandikar Advisor: Dr. Anupam Joshi Committee: Dr. Finin, Dr. Yesha, Dr. Oates Date: 1 st May 2007 Time: 9:30 am Place: ITE.
Directional triadic closure and edge deletion mechanism induce asymmetry in directed edge properties.
Scale Free Networks Robin Coope April Abert-László Barabási, Linked (Perseus, Cambridge, 2002). Réka Albert and AL Barabási,Statistical Mechanics.
Weighted Graphs and Disconnected Components Patterns and a Generator Mary McGlohon, Leman Akoglu, Christos Faloutsos Carnegie Mellon University School.
Social Networks and Graph Mining Christos Faloutsos CMU - MLD.
Cascading Behavior in Large Blog Graphs Patterns and a Model Leskovec et al. (SDM 2007)
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
CS Lecture 6 Generative Graph Models Part II.
Blogosphere  What is blogosphere?  Why do we need to study Blog-space or Blogosphere?
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
RTG: A Recursive Realistic Graph Generator using Random Typing Leman Akoglu and Christos Faloutsos Carnegie Mellon University.
Cascading Behavior in Large Blog Graphs: Patterns and a model offence.
Analysis of the Internet Topology Michalis Faloutsos, U.C. Riverside (PI) Christos Faloutsos, CMU (sub- contract, co-PI) DARPA NMS, no
1 Characterizing Selfishly Constructed Overlay Routing Networks March 11, 2004 Byung-Gon Chun, Rodrigo Fonseca, Ion Stoica, and John Kubiatowicz University.
1 Exploring Blog Networks Patterns and a Model for Information Propagation Mary McGlohon In collaboration with Jure Leskovec, Christos Faloutsos Natalie.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
九大数理集中講義 Comparison, Analysis, and Control of Biological Networks (3) Domain-Based Mathematical Models for Protein Evolution Tatsuya Akutsu Bioinformatics.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Limits of Local Algorithms in Random Graphs
Weighted Graphs and Disconnected Components Patterns and a Generator IDB Lab 현근수 In KDD 08. Mary McGlohon, Leman Akoglu, Christos Faloutsos.
To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs Imrul Kayes 1, Xiang Zuo 1, Da Wang 2, Jacob Chakareski 3 1 University.
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Jon Kleinberg (Cornell), Christos.
Emergence of Scaling and Assortative Mixing by Altruism Li Ping The Hong Kong PolyU
1 Graph mining techniques applied to blogs Mary McGlohon Seminar on Social Media Analysis- Oct
Maximizing the Spread of Influence through a Social Network David Kempe, Jon Kleinberg, Eva Tardos Cornell University KDD 2003.
Butterfly model slides. Topological Model: “Butterfly” Objective: Develop model to help explain behavioral mechanisms that cause observed properties,
Du, Faloutsos, Wang, Akoglu Large Human Communication Networks Patterns and a Utility-Driven Generator Nan Du 1,2, Christos Faloutsos 2, Bai Wang 1, Leman.
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Anonymous communication over social networks Shishir Nagaraja and Ross Anderson Security Group Computer Laboratory.
Jure Leskovec, CMU Lars Backstrom, Cornell Ravi Kumar, Yahoo! Research Andrew Tomkins, Yahoo! Research.
1 Patterns of Cascading Behavior in Large Blog Graphs Jure Leskoves, Mary McGlohon, Christos Faloutsos, Natalie Glance, Matthew Hurst SDM 2007 Date:2008/8/21.
Of 17 Limits of Local Algorithms in Random Graphs Madhu Sudan MSR Joint work with David Gamarnik (MIT) 7/11/2013Local Algorithms on Random Graphs1.
Dynamic Network Analysis Case study of PageRank-based Rewiring Narjès Bellamine-BenSaoud Galen Wilkerson 2 nd Second Annual French Complex Systems Summer.
Dynamics of Real-world Networks
SLAW: A Mobility Model for Human Walks
Topics In Social Computing (67810)
On Growth of Limited Scale-free Overlay Network Topologies
Who are the most influential bloggers?
Center for Complexity in Business, R. Smith School of Business
CS224W: Social and Information Network Analysis
Generative Model To Construct Blog and Post Networks In Blogosphere
Part 1: Graph Mining – patterns
A Locality Model of the Evolution of Blog Networks
Lecture 13 Network evolution
The likelihood of linking to a popular website is higher
Dynamics of Real-world Networks
Graph and Tensor Mining for fun and profit
Modelling Structure and Function in Complex Networks
Cost-effective Outbreak Detection in Networks
Lecture 21 Network evolution
Malik Magdon-Ismail, Konstantin Mertsalov, Mark Goldberg
Affiliation Network Models of Clusters in Networks
Human-centered Machine Learning
Presentation transcript:

Modeling Blog Dynamics Speaker: Michaela Götz Joint work with: Jure Leskovec, Mary McGlohon, Christos Faloutsos Cornell University Carnegie Mellon University Stanford University

Modeling the Blogosphere Blogosphere  System of interactions: ▫Entities: Bloggers, Posts, Topics, … ▫Relations:  Blogger publishes post  Post is about a topic  Post links to post Model  Simple set of rules (followed by a blogger) that creates these interactions Evaluation  Creating a synthetic blogosphere and comparing it to real blogosphere Motivation  Forecasting, advertising Michaela GötzModeling Blog DynamicsICWSM 2009

How is this different from modeling social networks? ▫2 networks combined: Blog vs. Post network B1 B4 B3 B2 B6 B7 B5 Blogosphere B1 B4 B3 B2 B6 B7 B Postnetwork Blognetwork Cascade Michaela GötzModeling Blog DynamicsICWSM 2009

▫2 networks combined: Blog vs. Post network ▫Complex temporal dynamics B4 B3 B2 B6 B7 B5 B1 How is this different from modeling social networks? Michaela GötzModeling Blog DynamicsICWSM 2009

▫2 networks combined: Blog vs. Post network ▫Complex temporal dynamics B1 B4 B3 B2 B6 B7 B5 B8 How is this different from modeling social networks? Our goal: Model micro-level interactions to create the macro-level patterns (both the temporal and topological) of the blogosphere Michaela GötzModeling Blog DynamicsICWSM 2009

Models – Related Work Michaela GötzModeling Blog DynamicsICWSM 2009 Realistic Topological PatternsRealistic Dynamical Patterns Links (SVM) [Adar and Adamic] Blog Mortality [Venolia] Links (Epidemiological Model) [Gruhl et al.] Links (Epidemiological Model) [Leskovec et al.] Inter-arrival time of s (Markov Chain) [Kleinberg] Links (Preferential Attachment) [Karandikar et al.] Time between answering two consecutive s (PA) [Barabasi; Vazquez] Blog

What are the properties of the blogosphere? Topological properties ▫Blog network ▫Post network Temporal properties ▫User posting activity ▫Popularity of posts over time (link creation) Michaela GötzModeling Blog DynamicsICWSM 2009

Some Topological Patterns Michaela GötzModeling Blog DynamicsICWSM 2009 Power law: [Leskovec et al.] Blog In-Degree Post In-Degree Cascade Size

Michaela GötzModeling Blog DynamicsICWSM 2009 Burstiness & Self-Similarity: Measure: Entropy at various resolutions Plot: Burstiness & Self-Similarity: Measure: Entropy at various resolutions Plot: Self-Similarity -> Linearity Burstiness -> Slope < 1 Self-Similarity -> Linearity Burstiness -> Slope < 1 Blogosphere – Some Temporal Patterns Time vs. Count Example 1 Aggregation Level vs. Entropy Example 2

Blogosphere – Some Temporal Patterns Michaela GötzModeling Blog DynamicsICWSM 2009 [McGlohon et al.] Inter-Posting Time Popularity: time t vs. #in-links received t days after publishing Burstiness of Posting Activity Burstiness & Self-Similarity: Measure: Entropy at various resolutions Plot: Burstiness & Self-Similarity: Measure: Entropy at various resolutions Plot:

Blogosphere - Dynamics B1 B4 B3 B2 B6 B7 B5 Michaela GötzModeling Blog DynamicsICWSM 2009 B8 WANTED: Model - Simple (no parameters) - Intuitive (local rules) - Creating Realistic Topology and Dynamics WANTED: Model - Simple (no parameters) - Intuitive (local rules) - Creating Realistic Topology and Dynamics

First try solution Let’s assume: ▫ Inter-posting times are sampled from exponential distribution ▫Links are created using Preferential Attachment But it won’t work: Inter-Posting Time Burstiness of Posting Activity Blog In-Degree Michaela GötzModeling Blog DynamicsICWSM 2009

Michaela GötzModeling Blog DynamicsICWSM 2009 Our Model ZC In Every Round For Every Blog In Every Round For Every Blog post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2

Michaela GötzModeling Blog DynamicsICWSM 2009 Our Model ZC In Every Round For Every Blog In Every Round For Every Blog post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2 B3 B2 B

Michaela GötzModeling Blog DynamicsICWSM 2009 Our Model ZC In Every Round For Every Blog In Every Round For Every Blog post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2 B3 B2 B

Michaela GötzModeling Blog DynamicsICWSM 2009 Our Model ZC In Every Round For Every Blog In Every Round For Every Blog post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2 B3 B2 B ?

Michaela GötzModeling Blog DynamicsICWSM 2009 Our Model ZC In Every Round For Every Blog In Every Round For Every Blog post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2 B3 B2 B

Michaela GötzModeling Blog DynamicsICWSM 2009 Our Model ZC In Every Round For Every Blog In Every Round For Every Blog post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2 B3 B2 B

Michaela GötzModeling Blog DynamicsICWSM 2009 Our Model ZC In Every Round For Every Blog In Every Round For Every Blog post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2 B3 B2 B

Michaela GötzModeling Blog DynamicsICWSM 2009 post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2 Our Model ZC Properties - Simple (no parameters) - Intuitive (local rules) Properties - Simple (no parameters) - Intuitive (local rules) Theorem: The interposting time of ZC follows a power- law with exponent -1.5 Theorem: The interposting time of ZC follows a power- law with exponent -1.5 Theorem: The posting activity is self-similar and bursty.

Michaela GötzModeling Blog DynamicsICWSM 2009 Our Model ZC – Experimental Evaluation Inter-Posting TimeBurstiness of Posting Activity Real Data Simulation Popularity over time 45k blogs, 2.2 million posts [Glance et al.] 45k blogs, 2.2 million posts

Michaela GötzModeling Blog DynamicsICWSM 2009 Post In-Degree Cascade Size Our Model ZC – Experimental Evaluation Blog In-Degree Real Data Simulation

Michaela GötzModeling Blog DynamicsICWSM 2009 Conclusions Model ZC of the Blogosphere - Simple (no parameters) - Intuitive (local rules) - Creating Realistic Blogosphere: - evaluated on 3 topological patterns - evaluated on 3 temporal patterns - Useful for forecasting and to explore dynamics for advertising Model ZC of the Blogosphere - Simple (no parameters) - Intuitive (local rules) - Creating Realistic Blogosphere: - evaluated on 3 topological patterns - evaluated on 3 temporal patterns - Useful for forecasting and to explore dynamics for advertising post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2

Michaela GötzModeling Blog DynamicsICWSM 2009 Thanks! post P no link create link choose neighbor B choose neighbor B choose non-neighbor B choose non-neighbor B choose post Q of B choose post Q of B link to random posts upward in the cascade link to random posts upward in the cascade “random walk” …… “explore”“exploit” “link expansion” /2